binning boundaries pandas code example

Example 1: difference between cut and qcut pandas

pd.cut(df['ext price'], bins=4).value_counts() #bin range size afre equal

#res
(87998.212, 120263.375]     12 #different no. of observation
(55603.989, 87998.212]       5
(120263.375, 152528.538]     2
(152528.538, 184793.7]       1
Name: ext price, dtype: int64

#If you want equal distribution of the items in your bins, use qcut . If you want to define your own numeric bin ranges, then use cut .

Example 2: difference between cut and qcut pandas

interval_range = pd.interval_range(start=0, freq=10000, end=200000)
df['cut_ex2'] = pd.cut(df['ext price'], bins=interval_range, labels=[1,2,3])
df.head()

#There is a downside to using interval_range . You can not define custom labels.