Benchmarks¶

River benchmarks track the timing performance of models across git history using airspeed velocity (asv).

The benchmarks cover three tracks:

Binary classification — Logistic regression, AMF, ALMA on Bananas, Phishing
Multiclass classification — Naive Bayes, Hoeffding Trees, Random Forests, ensembles, KNN on ImageSegments
Regression — Linear models, trees, forests, ensembles, MLP on ChickWeights, TrumpApproval

Each benchmark runs full progressive validation (evaluate.iter_progressive_val_score) and measures wall-clock time. See the benchmarks README for instructions on running benchmarks locally.