0.25.0 - 2026-05-31¶

Breaking changes¶

The native Rust extension moved from river.stats._rust_stats to river._river_rust, split into submodules stats, drift, tree, and vectordict. Pickles produced with prior versions no longer load directly. To convert existing pickles, use this migration script (the new river must be installed in the conversion env).
pandas is no longer a hard dependency of River. The core online interface (learn_one / predict_one) works with pip install river alone. The mini-batch interface (learn_many, predict_many, predict_proba_many, transform_many) still requires pandas; install with pip install "river[pandas]". Calling a *_many method without pandas raises an ImportError pointing to the extra.
Renamed drift.binary.HDDM_A → drift.binary.HDDMA and drift.binary.HDDM_W → drift.binary.HDDMW to comply with PEP-8 CapWords class naming.

cluster¶

Fixed cluster.TextClust corrupting its own parameters: __init__ was overwriting self.micro_distance / self.macro_distance with runtime distance instances, breaking clone and repr round-trips. The runtime instances are now stored on _micro_distance / _macro_distance. Internal camelCase identifiers (clusterId, microToMacro, numClusters, updateMacroClusters, _calculateIDF) were renamed to snake_case, and the nested helper classes tfcontainer, microcluster, distances were renamed to TfContainer, MicroCluster, Distances.

imblearn¶

Fixed imblearn.HardSamplingClassifier / imblearn.HardSamplingRegressor storing references to user-supplied feature dictionaries in their buffer; the buffered triplets now hold shallow copies so callers can safely mutate x after learn_one.

naive_bayes¶

Marked predict_many/predict_proba_many checks as skipped on BaseNB subclasses (MultinomialNB, BernoulliNB, ComplementNB) via _unit_test_skips. joint_log_likelihood_many's output is mis-aligned with the input batch when the model is trained via learn_one rather than learn_many, so the new mini-batch consistency checks fail. Tracked separately.

neighbors¶

Fixed neighbors.KNNClassifier / neighbors.KNNRegressor storing references to the input feature dicts in their search window; learn_one now stores a shallow copy.

preprocessing¶

Fixed preprocessing.RobustScaler.transform_one crashing with TypeError when called before any learn_one (the running median returned None); transform now passes the value through unchanged when centering statistics are not yet available.

tree¶

Fixed tree.mondrian.MondrianTreeRegressor.learn_one storing the input feature dict by reference on self._x; it now stores a shallow copy so callers can safely mutate x after learn_one. Knock-on fix for forest.AMFRegressor.
Breaking: Renamed tree.iSOUPTreeRegressor → tree.ISOUPTreeRegressor to comply with PEP-8 CapWords class naming.

tooling¶

Enabled the pep8-naming ruleset (N801, N802, N804) in ruff so that future class, function, and classmethod-first-argument naming violations are caught at lint time. N803 (argument names) and N806 (local variable names) were intentionally left out — X: pd.DataFrame, A_numpy = ..., and similar scientific-Python conventions are pervasive in the codebase.

docs¶

Fixed corrupted markdown cells in the Hoeffding Trees notebook example that caused blank page titles and invisible sidebar navigation. Fixes #1847.
Bumped zensical to 0.0.40 and enabled strict mode with link and footnote validation.
Fixed doc generation to escape bare brackets in type annotations and descriptions, produce proper footnote definitions, and use fenced code blocks for notebook outputs.
Fixed the "Releases" entry in the docs nav 404'ing when no unreleased.md exists: the update_releases_nav script no longer hard-codes an unreleased: releases/unreleased.md line in mkdocs.yml, instead only emitting it when the file is present on disk.

feature_extraction¶

Added feature_extraction.RandomTreesEmbedding, an online random-tree leaf embedding transformer for feeding sparse tree features into downstream models. Addresses #1386.

neural_net¶

Deprecated river.neural_net; importing it now emits a DeprecationWarning and users are encouraged to use deep-river for neural networks. Addresses #1828.

drift¶

Reimplemented drift.ADWIN's inner AdaptiveWindowing in Rust. The Cython sources are removed; output is bit-identical to the Cython baseline (width, total, variance, n_detections, drift_detected) over a 3.8k-step parity fuzz. Rust is 1.3-3.5x faster than the previous Cython implementation across clock settings.

compat¶

Fixed compat.SKL2RiverClassifier.predict_proba_many raising a TypeError whenever the wrapped estimator was already fitted: it incorrectly built a pd.Series(..., columns=...) instead of a pd.DataFrame. Test coverage previously only exercised the not-fitted branch. SKL2RiverClassifier and SKL2RiverRegressor are now also exercised by the generic estimator-check suite via _unit_test_params.
Fixed compat.SKL2RiverClassifier._multiclass advertising multi-class support unconditionally; it now reflects len(classes) > 2.

misc¶

Added misc.ZstdClassifier, a compression-based text classifier that scores documents by the size of their zstd-compressed output under per-class prefix dictionaries built from a sliding byte window. Requires Python 3.14 (compression.zstd). See Zstd-based text classification.

metrics¶

Sped up metrics.Silhouette by switching the centroid distance computations from the utils.math.minkowski_distance Python wrapper to a direct call into the Rust euclidean_distance_dict.
Reimplemented the inner expected_mutual_info routine (used by metrics.AdjustedMutualInfo) in Rust. The Cython sources are removed and the new implementation is roughly twice as fast as the old one across all tested contingency-table sizes.
Reimplemented metrics.RollingROCAUC and metrics.RollingPRAUC in Rust. The C++ implementation is removed. Output is bit-identical to the C++ version on all tested inputs and a latent bug in revert() with a non-default pos_val is also fixed.

utils¶

Reimplemented utils.VectorDict (and the helper functions euclidean_distance_dict, euclidean_distance_tuple, lazy_search_euclidean) in Rust. The Cython sources are removed; the public API is unchanged. Element-wise operations are faster across the board: vec + scalar and vec * scalar are ~18% faster on 20-key dicts and ~14% faster on 1000-key dicts; vec + vec is 4-5% faster, vec @ vec (dot product) is 4-10% faster. The constructor and __setitem__ are within 1-4% of the Cython baseline (~2 ns absolute, dominated by PyO3 object-allocation overhead).

anomaly¶

Sped up anomaly.HalfSpaceTrees.learn_one and score_one by replacing the generic recursive tree.base.Branch.walk traversal with an iterative tight loop specialised for HST, caching the (constant) size_limit and tree height as locals, and pivoting node masses through a precomputed flat node list. Output is unchanged. On a synthetic 10-feature stream score+learn is ~3.0× faster (27.9k → 85.2k obs/s), learn_one ~2.6×, and score_one ~3.8×; in a MinMaxScaler | HalfSpaceTrees pipeline on CreditCard the end-to-end pipeline is ~2.0× faster (20.5k → 40.5k obs/s).

cluster¶

Sped up cluster.DBSTREAM by replacing the per-cleanup copy.deepcopy of the micro-cluster dict with an in-place pop, replacing the deepcopy in the offline reclustering step with a direct micro-cluster construction, hoisting the Gaussian neighborhood factor out of the per-feature center update (it does not vary across dimensions), and folding the nested try/except KeyError shared-density update into a plain dict.get. Output is unchanged. On the 15k-sample synthetic-sklearn workload, learn_one is ~6.1× faster (0.516 s → 0.084 s) and learn_one + predict_one is ~4.3× faster (0.872 s → 0.204 s).
Sped up cluster.DenStream._merge by replacing the speculative copy.copy + insert + radius check with a non-mutating radius_with(x) that computes the would-be radius directly from linear_sum, squared_sum and N. Cached each micro-cluster's center (it reduces to linear_sum / N once the fading factor is cancelled algebraically), and switched the per-candidate distance lookup in _get_closest_cluster_key from the utils.math.minkowski_distance Python wrapper to a direct call into the Rust euclidean_distance_dict. On a 20k-sample 10-feature synthetic stream, learn_one is ~1.7× faster (6.4 µs/point → 3.8 µs/point). The change also fixes a latent shallow-copy bug: the previous code shared linear_sum/squared_sum between the copy.copy and the original, so a failed radius check left the original cluster with the candidate point's contributions added in (without bumping N).
Sped up cluster.CluStream.learn_one by caching each micro-cluster's center dict on the micro-cluster itself (invalidated on insert / __iadd__), materializing the center list once at the top of _maintain_micro_clusters instead of rebuilding it inside the n² pairwise scan, replacing the deepcopy-heavy Var.__add__ calls in CluStreamMicroCluster.__iadd__ with in-place Var.__iadd__, and switching _distance from the utils.math.minkowski_distance Python wrapper to a direct call into the Rust euclidean_distance_dict. The fix removes ~36M redundant center dict rebuilds and 367M Mean.get calls on a 5k-sample 10-feature synthetic stream. End-to-end learn_one is ~3.9× faster at d=10 (25.5 s → 6.5 s for 5k points), ~3.8× at d=20 and ~3.8× at d=50.

anomaly¶

Sped up api.anomaly.LocalOutlierFactor by replacing the default functools.partial([utils.math.minkowski_distance](../api/utils/math/minkowski-distance), p=2) distance function with a direct call into the Rust euclidean_distance_dict, removing the Python-level dispatch.

cluster¶

Sped up cluster.STREAMKMeans.predict_one by switching the per-center distance from the utils.math.minkowski_distance Python wrapper to a direct call into the Rust euclidean_distance_dict.

preprocessing¶

Sped up preprocessing.OneHotEncoder.transform_one by ~8x and learn_one + transform_one by ~5.5x (on 100k rows × 5 features with cardinality 20). The previous implementation rebuilt the all-zeros dict via {f"{i}_{v}": 0 ...} on every call; the encoder now maintains an incremental cache of that zero-dict and transform_one copies it instead of rebuilding. Output is unchanged.
Sped up preprocessing.StandardScaler by ~15% on learn_one and learn_one + transform_one by hoisting the self.counts/self.means/self.vars dict references out of the inner loop, splitting the with_std=True and with_std=False paths, and folding the safe_div call in transform_one into an inline branch (eliminating ~1M function calls per 100k samples × 10 features). The Welford update formula is unchanged.
Sped up preprocessing.MinMaxScaler.transform_one by ~1.3x by caching each feature's self.min[i].get() and self.max[i].get() results in locals (previously self.min[i].get() was called twice per feature) and inlining safe_div. preprocessing.MaxAbsScaler.transform_one benefits from the same safe_div inlining. learn_one is also slightly faster thanks to hoisting self.min/self.max/self.abs_max out of the loop. .update()/.get() on stats.Min/stats.Max/stats.AbsMax remain the only paths into those objects.

compose¶

Sped up compose.Pipeline end-to-end throughput by 1.3x–1.9x (e.g. scaler|lr 7.4 µs → 5.7 µs/event, (sel+sel)|scaler|lr 12.5 µs → 6.7 µs/event on TrumpApproval) by precomputing an execution plan (kind/_supervised flags) for each step at construction time, eliminating per-event isinstance checks via the EstimatorMeta.__instancecheck__ metaclass (~180k → 0 calls per 20k events) and repeated _supervised property lookups. The plan is invalidated on _add_step. The lazy _anomaly_filter_cls / _anomaly_detector_cls imports are now functools.cached.
Sped up compose.TransformerUnion.transform_one by replacing the dict(collections.ChainMap(*outputs)) merge with a single dict.update loop over reversed transformer outputs (~10x faster on the merge alone). Semantics are preserved (earlier transformers win on duplicate keys).
Sped up compose.Prefixer / compose.Suffixer transform_one by inlining the prefix/suffix concatenation in the dict comprehension instead of going through the _rename method on each key.

tree¶

Fixed MondrianNodeClassifier.replant not copying the counts attribute when promoting a leaf to a branch, leaving the new branch with n_samples != 0 but empty class counts. The fix mirrors the regressor's _mean copy and matches the reference onelearn implementation. Addresses #1823.
Fixed Mondrian tree leaf nodes losing their bounding box ranges during splits. Previously, when a leaf was split, the new child nodes did not inherit the memory_range_min and memory_range_max attributes, which caused incorrect range extension calculations. Fixes #1801
Fixed MondrianNodeClassifier.replant copying min and max bounds by reference instead of by value during a split. The fix ensures these arrays are explicitly copied by value so the bounds are correctly preserved. Fixed #1834
Skipped the expensive range_extension_c call for pure nodes in the Mondrian classifier's downward pass when split_pure=False (default). Benchmarks show ~3–5% speedup on datasets with 50+ features.
Reimplemented the Mondrian tree numerical helpers (tree.mondrian._mondrian_ops) in Rust. The Cython sources are removed; the helpers are now exposed via river.stats._rust_stats. Output matches the Cython baseline (Bananas accuracy unchanged at 70.64%). The leaf-to-root _go_upwards walk and the predict tree-walk also moved into Rust as single FFI calls, eliminating ~360k Python frame setups per 20k-sample run. End-to-end MondrianTreeClassifier learn+predict is ~28% faster (~23 µs/iter vs ~32 µs/iter Cython); MondrianTreeRegressor is ~21% faster (~31 µs/iter vs ~39 µs/iter) on a 20k-sample 10-feature synthetic stream.