pandas remove rows where column is nan code example
Example 1: pandas drop row with nan
import pandas as pd
df = pd.DataFrame({'values_1': ['700','ABC','500','XYZ','1200'],
'values_2': ['DDD','150','350','400','5000']
})
df = df.apply (pd.to_numeric, errors='coerce')
df = df.dropna()
df = df.reset_index(drop=True)
print (df)
Example 2: pandas drop rows with nan in a particular column
In [30]: df.dropna(subset=[1])
Out[30]:
0 1 2
1 2.677677 -1.466923 -0.750366
2 NaN 0.798002 -0.906038
3 0.672201 0.964789 NaN
5 -1.250970 0.030561 -2.678622
6 NaN 1.036043 NaN
7 0.049896 -0.308003 0.823295
9 -0.310130 0.078891 NaN
Example 3: delete nans in df python
df[~np.isnan(df)]
Example 4: threshold meaning in pandas dropna
>>>df = pd.DataFrame({"name": ['Alfred', 'Batman', 'Catwoman'],
"toy": [np.nan, 'Batmobile', 'Bullwhip'],
"born": [pd.NaT, pd.Timestamp("1940-04-25"),
pd.NaT]})
df.dropna(thresh=2)
name toy born
1 Batman Batmobile 1940-04-25
2 Catwoman Bullwhip NaT