split dataset into train and test python code example

Example 1: pandas split dataframe to train and test

train=df.sample(frac=0.8,random_state=200) #random state is a seed value
test=df.drop(train.index)

Example 2: pandas split train test

from sklearn.model_selection import train_test_split


y = df.pop('output')
X = df

X_train,X_test,y_train,y_test = train_test_split(X.index,y,test_size=0.2)
X.iloc[X_train] # return dataframe train

Example 3: pandas split train test

from sklearn.model_selection import train_test_split

train, test = train_test_split(df, test_size=0.2)

Example 4: split data train, test by id python

train_inds, test_inds = next(GroupShuffleSplit(test_size=.20, n_splits=2, random_state = 7).split(df, groups=df['Group_Id']))

train = df.iloc[train_inds]
test = df.iloc[test_inds]