More on Pipelines — Applied Machine Learning in Python

PHOTO EMBED

Tue Oct 24 2023 19:42:56 GMT+0000 (Coordinated Universal Time)

Saved by @elham469

# GOOD!
scores = []
select = SelectPercentile(percentile=5)
for train, test in KFold().split(X, y):
    select.fit(X[train], y[train])
    X_sel_train = select.transform(X[train])
    knn = KNeighborsClassifier().fit(X_sel_train, y[train])
    X_sel_test = select.transform(X[test])
    score = knn.score(X_sel_test, y[test])
    scores.append(score)
content_copyCOPY

https://amueller.github.io/aml/01-ml-workflow/12-pipelines-gridsearch.html