how to pull a subset from pandas dataframe based on condition code example
Example 1: panda - subset based on column value
import pandas as pd
import numpy as np
df = pd.DataFrame({'A': 'foo bar foo bar foo bar foo foo'.split(),
'B': 'one one two three two two one three'.split(),
'C': np.arange(8), 'D': np.arange(8) * 2})
print(df)
sub_df = df.loc[df['A'] == 'foo']
Example 2: python - subset dataframe based on unique value of a clumn
my_df = my_df.drop_duplicates(subset=['my_var'])
my_df = my_df.drop_duplicates(subset=['my_var'], keep='last')
my_df = my_df.drop_duplicates(subset=['my_var'], keep=False)