Skip to content

BanditClassifier

Bandit-based model selection for classification.

Each model is associated with an arm. At each learn_one call, the policy decides which arm/model to pull. The reward is the performance of the model on the provided sample. The predict_one and predict_proba_one methods use the current best model.

Parameters

Attributes

  • best_model

  • models

Examples

from river import bandit
from river import datasets
from river import evaluate
from river import linear_model
from river import metrics
from river import model_selection
from river import optim
from river import preprocessing

models = [
    linear_model.LogisticRegression(optimizer=optim.SGD(lr=lr))
    for lr in [0.0001, 0.001, 1e-05, 0.01]
]

dataset = datasets.Phishing()
model = (
    preprocessing.StandardScaler() |
    model_selection.BanditClassifier(
        models,
        metric=metrics.Accuracy(),
        policy=bandit.EpsilonGreedy(
            epsilon=0.1,
            decay=0.001,
            burn_in=20,
            seed=42
        )
    )
)
metric = metrics.Accuracy()

evaluate.progressive_val_score(dataset, model, metric)
Accuracy: 88.96%

Methods

learn_one

Update the model with a set of features x and a label y.

Parameters

  • x
  • y

Returns

self

predict_one

Predict the label of a set of features x.

Parameters

  • x'dict'
  • kwargs

Returns

base.typing.ClfTarget | None: The predicted label.

predict_proba_one

Predict the probability of each label for a dictionary of features x.

Parameters

  • x

Returns

A dictionary that associates a probability which each label.