Add column in dataframe from list

A solution improving on the great one from @sparrow.

Let df, be your dataset, and mylist the list with the values you want to add to the dataframe.

Let's suppose you want to call your new column simply, new_column

First make the list into a Series:

column_values = pd.Series(mylist)

Then use the insert function to add the column. This function has the advantage to let you choose in which position you want to place the column. In the following example we will position the new column in the first position from left (by setting loc=0)

Click to copy

df.insert(loc=0, column='new_column', value=column_values)

First let's create the dataframe you had, I'll ignore columns B and C as they are not relevant.

Click to copy

df = pd.DataFrame({'A': [0, 4, 5, 6, 7, 7, 6,5]})

And the mapping that you desire:

Click to copy

mapping = dict(enumerate([2,5,6,8,12,16,26,32]))

df['D'] = df['A'].map(mapping)

Done!

Click to copy

print df

Output:

Click to copy

IIUC, if you make your (unfortunately named) List into an ndarray, you can simply index into it naturally.

Click to copy

>>> import numpy as np
>>> m = np.arange(16)*10
>>> m[df.A]
array([  0,  40,  50,  60, 150, 150, 140, 130])
>>> df["D"] = m[df.A]
>>> df
    A   B   C    D
0   0 NaN NaN    0
1   4 NaN NaN   40
2   5 NaN NaN   50
3   6 NaN NaN   60
4  15 NaN NaN  150
5  15 NaN NaN  150
6  14 NaN NaN  140
7  13 NaN NaN  130

Here I built a new m, but if you use m = np.asarray(List), the same thing should work: the values in df.A will pick out the appropriate elements of m.

Note that if you're using an old version of numpy, you might have to use m[df.A.values] instead-- in the past, numpy didn't play well with others, and some refactoring in pandas caused some headaches. Things have improved now.

Just assign the list directly:

Click to copy

df['new_col'] = mylist

Alternative
Convert the list to a series or array and then assign:

Click to copy

se = pd.Series(mylist)
df['new_col'] = se.values

Click to copy

df['new_col'] = np.array(mylist)

Add column in dataframe from list

Tags:

Python

Pandas

Dataframe

Related

Recent Posts