0.5.0 - 2020-03-13¶
compat¶
- Added
compat.PyTorch2CremeRegressor
. compat.SKL2CremeRegressor
andcompat.SKL2CremeClassifier
now have an optionalbatch_size
parameter in order to perform mini-batching.
compose¶
- Renamed
compose.Whitelister
tocompose.Select
. - Renamed
compose.Blacklister
tocompose.Discard
.
facto¶
- Added
facto.FFMClassifier
. - Added
facto.FFMRegressor
. - Added
facto.FwFMClassifier
. - Added
facto.FwFMRegressor
. - Added
facto.HOFMClassifier
. - Added
facto.HOFMRegressor
. - Refactored
facto.FMClassifier
. - Refactored
facto.FMRegressor
.
feature_selection¶
- Added
feature_selection.PoissonInclusion
. - Removed
feature_selection.RandomDiscarder
as it didn't make much sense.
feature_extraction¶
- Renamed
feature_extraction.CountVectorizer
tofeature_extraction.BagOfWords
. - Renamed
feature_extraction.TFIDFVectorizer
tofeature_extraction.TFIDF
. - Added
preprocessor
andngram_range
parameters tofeature_extraction.BagOfWords
. - Added
preprocessor
andngram_range
parameters tofeature_extraction.TFIDF
.
datasets¶
- The
datasets
module has been overhauled. Each dataset is now a class (e.g.fetch_electricity
has becomedatasets.Elec2
). - Added
datasets.TrumpApproval
. - Added
datasets.MaliciousURL
. - Added
datasets.gen.SEA
. - Added
datasets.Higgs
. - Added
datasets.MovieLens100K
. - Added
datasets.Bananas
. - Added
datasets.Taxis
. - Added
datasets.ImageSegments
. - Added
datasets.SMTP
impute¶
- Added
impute.PreviousImputer
.
linear_model¶
linear_model.FMClassifier
has been moved to thefacto
module.linear_model.FMRegressor
has been moved to thefacto
module.- Added
linear_model.ALMAClassifier
.
metrics¶
- Added
metrics.ClassificationReport
. - Added
metrics.TimeRolling
. - The implementation of
metrics.ROCAUC
was incorrect. Using the trapezoidal rule instead of Simpson's rule seems to be more robust. metrics.PerClass
has been removed; it is recommended that you usemetrics.ClassificationReport
instead as it gives a better overview.
meta¶
- Moved
meta.TransformedTargetRegressor
andmeta.BoxCoxRegressor
to this module (they were previously in thecompose
module). - Added
meta.PredClipper
model_selection¶
- Added
model_selection.expand_param_grid
to generate a list of models from a grid of parameters. - Added the
model_selection.successive_halving
method for selecting hyperparameters. - The
online_score
andonline_qa_score
methods have been merged into a single method namedmodel_selection.progressive_val_score
.
preprocessing¶
- Added
preprocessing.RBFSampler
. - Added
preprocessing.MaxAbsScaler
. - Added
preprocessing.RobustScaler
. - Added
preprocessing.Binarizer
. - Added
with_mean
andwith_std
parameters topreprocessing.StandardScaler
.
optim¶
- Added
optim.losses.BinaryFocalLoss
. - Added the
optim.AMSGrad
optimizer. - Added the
optim.Nadam
optimizer. - Added
optim.losses.Poisson
. - Fixed a performance bug in
optim.NesterovMomentum
.
reco¶
- Added
reco.FunkMF
. - Renamed
reco.SVD
toreco.BiasedMF
. - Renamed
reco.SGDBaseline
toreco.Baseline
. - Models now expect a
dict
input withuser
anditem
fields.
sampling¶
- Added
sampling.RandomUnderSampler
. - Added
sampling.RandomOverSampler
. - Added
sampling.RandomSampler
. - Added
sampling.HardSamplingClassifier
. - Added
sampling.HardSamplingRegressor
.
stats¶
- Added
stats.AbsMax
. - Added
stats.RollingAbsMax
.
stream¶
- Added
stream.iter_libsvm
. stream.iter_csv
now supports reading from '.zip' files.- Added
stream.Cache
. - Added a
drop
parameter tostream.iter_csv
to discard fields.