0.5.0 - 2020-03-13
compat
- Added
compat.PyTorch2CremeRegressor.
compat.SKL2CremeRegressor and compat.SKL2CremeClassifier now have an optional batch_size parameter in order to perform mini-batching.
compose
- Renamed
compose.Whitelister to compose.Select.
- Renamed
compose.Blacklister to compose.Discard.
facto
- Added
facto.FFMClassifier.
- Added
facto.FFMRegressor.
- Added
facto.FwFMClassifier.
- Added
facto.FwFMRegressor.
- Added
facto.HOFMClassifier.
- Added
facto.HOFMRegressor.
- Refactored
facto.FMClassifier.
- Refactored
facto.FMRegressor.
feature_selection
- Added
feature_selection.PoissonInclusion.
- Removed
feature_selection.RandomDiscarder as it didn't make much sense.
- Renamed
feature_extraction.CountVectorizer to feature_extraction.BagOfWords.
- Renamed
feature_extraction.TFIDFVectorizer to feature_extraction.TFIDF.
- Added
preprocessor and ngram_range parameters to feature_extraction.BagOfWords.
- Added
preprocessor and ngram_range parameters to feature_extraction.TFIDF.
datasets
- The
datasets module has been overhauled. Each dataset is now a class (e.g. fetch_electricity has become datasets.Elec2).
- Added
datasets.TrumpApproval.
- Added
datasets.MaliciousURL.
- Added
datasets.gen.SEA.
- Added
datasets.Higgs.
- Added
datasets.MovieLens100K.
- Added
datasets.Bananas.
- Added
datasets.Taxis.
- Added
datasets.ImageSegments.
- Added
datasets.SMTP
impute
- Added
impute.PreviousImputer.
linear_model
linear_model.FMClassifier has been moved to the facto module.
linear_model.FMRegressor has been moved to the facto module.
- Added
linear_model.ALMAClassifier.
metrics
- Added
metrics.ClassificationReport.
- Added
metrics.TimeRolling.
- The implementation of
metrics.ROCAUC was incorrect. Using the trapezoidal rule instead of Simpson's rule seems to be more robust.
metrics.PerClass has been removed; it is recommended that you use metrics.ClassificationReport instead as it gives a better overview.
- Moved
meta.TransformedTargetRegressor and meta.BoxCoxRegressor to this module (they were previously in the compose module).
- Added
meta.PredClipper
model_selection
- Added
model_selection.expand_param_grid to generate a list of models from a grid of parameters.
- Added the
model_selection.successive_halving method for selecting hyperparameters.
- The
online_score and online_qa_score methods have been merged into a single method named model_selection.progressive_val_score.
preprocessing
- Added
preprocessing.RBFSampler.
- Added
preprocessing.MaxAbsScaler.
- Added
preprocessing.RobustScaler.
- Added
preprocessing.Binarizer.
- Added
with_mean and with_std parameters to preprocessing.StandardScaler.
optim
- Added
optim.losses.BinaryFocalLoss.
- Added the
optim.AMSGrad optimizer.
- Added the
optim.Nadam optimizer.
- Added
optim.losses.Poisson.
- Fixed a performance bug in
optim.NesterovMomentum.
reco
- Added
reco.FunkMF.
- Renamed
reco.SVD to reco.BiasedMF.
- Renamed
reco.SGDBaseline to reco.Baseline.
- Models now expect a
dict input with user and item fields.
sampling
- Added
sampling.RandomUnderSampler.
- Added
sampling.RandomOverSampler.
- Added
sampling.RandomSampler.
- Added
sampling.HardSamplingClassifier.
- Added
sampling.HardSamplingRegressor.
stats
- Added
stats.AbsMax.
- Added
stats.RollingAbsMax.
stream
- Added
stream.iter_libsvm.
stream.iter_csv now supports reading from '.zip' files.
- Added
stream.Cache.
- Added a
drop parameter to stream.iter_csv to discard fields.