remove dulicay in df based on column code example
Example 1: python: remove duplicate in a specific column
df = df.drop_duplicates(subset=['Column1', 'Column2'], keep='first')
Example 2: drop duplicates pandas first column
import pandas as pd
data = pd.read_csv("employees.csv")
data.sort_values("First Name", inplace = True)
data.drop_duplicates(subset ="First Name",keep = False, inplace = True)
print(data)
Example 3: get duplicate and remove but keep last in python df
drop_duplicates(self, subset=None, keep="last", inplace=False)