pyspark drop duplicate columns after join code example
Example 1: python: remove duplicate in a specific column
df = df.drop_duplicates(subset=['Column1', 'Column2'], keep='first')
Example 2: remove duplicate columns python dataframe
df = df.loc[:,~df.columns.duplicated()]