Revise Machine Learning Tutorial and Scaling Workflow by amandalin047 · Pull Request #3 · school-brainhack/module_machine_learning_neuroimaging

amandalin047 · 2026-05-15T23:54:49Z

This PR revises the machine learning tutorial notebook to improve conceptual clarity for beginners.
Main changes:

Clarified the explanation of linear SVC.
Reframed standard scaling as preprocessing rather than "model tweaking."
Added explanation of why scaling should be performed inside each CV fold using a Pipeline.
Clarified that matching predicted labels after scaling does not imply identical model behavior, since the decision-function values may still differ.
Added explanation of the standard workflow: cross-validation on the training set, choose a final pipeline, refit on the full training set, and evaluate once on the held-out test set.
Revised comments around final pipeline refitting and test-set evaluation.
Clarified that linear SVC coefficients / weights should not be interpreted as straightforward feature importance. They define the decision boundary in the transformed feature space, and their interpretation depends on scaling, correlations among features, preprocessing choices, regularization, and model geometry.

No hyperparameter tuning was added in this revision.

amandalin047 added 6 commits May 15, 2026 06:10

added machine-learning-with-nilearn.ipynb

b0f57a7

need to fix markdown

cca118b

final version

46895db

fixed typo

3b9293a

fixed typo

d838346

deleted unused imports

f866537

Provide feedback