How to delete all rows in a dataframe?

df.drop(df.index,inplace=True) 

This line will delete all rows, while keeping the column names.


The latter is possible and strongly recommended - "inserting" rows row-by-row is highly inefficient. A sketch could be

>>> import numpy as np
>>> import pandas as pd
>>> index = np.arange(0, 10)
>>> df = pd.DataFrame(index=index, columns=['foo', 'bar'])
>>> df
Out[268]: 
   foo  bar
0  NaN  NaN
1  NaN  NaN
2  NaN  NaN
3  NaN  NaN
4  NaN  NaN
5  NaN  NaN
6  NaN  NaN
7  NaN  NaN
8  NaN  NaN
9  NaN  NaN

If you have an existing DataFrame with the columns you want then extract the column names into a list comprehension then create an empty DataFrame with your column names.

# Creating DataFrame from a CSV file with desired headers
csv_a = "path/to/my.csv"
df_a = pd.read_csv(csv_a)

# Extract column names into a list
names = [x for x in df_a.columns]

# Create empty DataFrame with those column names
df_b = pd.DataFrame(columns=names)

Here's another method if you have an existing DataFrame that you'd like to empty without recreating the column information:

df_empty = df[0:0]

df_empty is a DataFrame with zero rows but with the same column structure as df

Tags:

Python

Pandas