0.4.1 - 2019-10-23
base
Tests are now much more extensive, thanks mostly to the newly added estimator tags.
compose
datasets
Added fetch_kdd99_http
.
Added fetch_sms
.
Added fetch_trec07p
.
ensemble
Removed ensemble.HedgeBinaryClassifier
because it's performance was subpar.
Removed ensemble.GroupRegressor
, as this should be a special case of ensemble.StackingRegressor
.
Fixed a bug where feature_extraction.CountVectorizer
and feature_extraction.TFIDFVectorizer
couldn't be pickled.
linear_model
linear_model.LogisticRegression
and linear_model.LinearRegression
now have an intercept_lr
parameter.
metrics
Metrics can now be composed using the +
operator, which is useful for evaluating multiple metrics at the same time.
Added metrics.Rolling
, which eliminates the need for a specific rolling implementation for each metric.
Each metric can now be passed a sample_weight
argument.
Added metrics.WeightedF1
.
Added metrics.WeightedFBeta
.
Added metrics.WeightedPrecision
.
Added metrics.WeightedRecall
.
neighbors
Added neighbors.KNeighborsRegressor
.
Added neighbors.KNeighborsClassifier
.
optim
Added optim.AdaMax
.
The optim
module has been reorganized into submodules; namely optim.schedulers
, optim.initializers
, and optim.losses
. The top-level now only contains optimizers. Some classes have been renamed accordingly. See the documentation for details.
Renamed optim.VanillaSGD
to optim.SGD
.
stats
Added stats.IQR
.
Added stats.RollingIQR
.
Cythonized stats.Mean
and stats.Var
.
stream
Added stream.shuffle
.
stream.iter_csv
now has fraction
and seed
parameters to sample rows, deterministically or not.
Renamed stream.iter_numpy
to stream.iter_array
.
stream.iter_csv
can now read from gzipped files.
time_series
time_series.Detrender
now has a window_size
parameter for detrending with a rolling mean.
tree
Added tree.RandomForestClassifier
.
utils
Fixed a bug where utils.dot
could take longer than necessary.