how to apply Pearson correlation for feature selection in python code example
Example 1: scikit learn to identify highly correlated features
# Create correlation matrix
corr_matrix = df.corr().abs()
# Select upper triangle of correlation matrix
upper = corr_matrix.where(np.triu(np.ones(corr_matrix.shape), k=1).astype(np.bool))
# Find index of feature columns with correlation greater than 0.95
to_drop = [column for column in upper.columns if any(upper[column] > 0.95)]
Example 2: how to print correlation to a feature in pyhton
df[df.columns[1:]].corr()['LoanAmount'][:]