use sklearn's train_test_split function with a test_size = 0.2 and random_state = 42 code example
Example 1: sklearn split train test
import numpy as np
from sklearn.model_selection import train_test_split
X, y = np.arange(10).reshape((5, 2)), range(5)
X_train, X_test, y_train, y_test = train_test_split(
X, y, test_size=0.33, random_state=42)
X_train
y_train
X_test
y_test
Example 2: train_test_split example
train_test_split example
Example 3: splitting data into training and testing sklearn
train_features, test_features, train_labels, test_labels =
train_test_split(features, labels)