0.6.0 - 2020-06-09
base
Added a new base class called SupervisedTransformer
from which supervised transformers inherit from. Before this, supervised transformers has a is_supervised
property.
compose
Added compose.SelectType
, which allows selecting feature subsets based on their type.
Added a score_one
method to compose.Pipeline
so that estimators from the anomaly
module can be pipelined.
Added compose.Grouper
, which allows applying transformers within different subgroups.
datasets
Added datasets.Music
, which is a dataset for multi-output binary classification.
Added datasets.synth.Friedman
, which is synthetic regression dataset.
The datasets.gen
module has been renamed to datasets.synth
Each dataset now has a __repr__
method which displays some descriptive information.
Added datasets.Insects
, which has 10 variants.
feature_extraction.Differ
has been deprecated. We might put it back in a future if we find a better design.
impute
impute.StatImputer
has been completely refactored.
metrics
In metrics.SMAPE
, instead of raising a ZeroDivisionError
, the convention is now to use 0 when both y_true
and y_pred
are equal to 0.
model_selection
Added the possibility to configure how the progress is printed in model_selection.progressive_val_score
. For instance, the progress can now be printed to a file by providing the file
argument.
multiclass
Added multiclass.OutputCodeClassifier
.
Added multiclass.OneVsOneClassifier
.
multioutput
Fixed a bug where multioutput.ClassifierChain
and multioutput.RegressorChain
could not be pickled.
stats
Added stats.Shift
, which can be used to compute statistics over a shifted version of a variable.
Added stats.Link
, which can be used to compose univariate statistics. Univariate statistics can now be composed via the |
operator.
Renamed stats.Covariance
to stats.Cov
.
Renamed stats.PearsonCorrelation
to stats.PearsonCorr
.
Renamed stats.AutoCorrelation
to stats.AutoCorr
.
Added stats.RollingCov
, which computes covariance between two variables over a window.
Added stats.RollingPearsonCorr
, which computes the Pearson correlation over a window.
stream
Added a stream.iter_sql
utility method to work with SQLAlchemy.
The target_name
parameter of stream.iter_csv
has been renamed to target
. It can now be passed a list of values in order to support multi-output scenarios.
Added stream.iter_arff
for handling ARFF files.
tree
Cancelled the behavior where tree.DecisionTreeRegressor
would raise an exception when no split was found.