Skip to content

warm_up_mode

A context manager for training pipelines during a warm-up phase.

You don't have to worry about anything when you call predict_one and learn_one with a pipeline during in a training loop. The methods at each step of the pipeline will be called in the correct order.

However, during a warm-up phase, you might just be calling learn_one because you don't need the out-of-sample predictions. In this case the unsupervised estimators in the pipeline won't be updated, because they are usually updated when predict_one is called.

This context manager allows you to override that behavior and make it so that unsupervised estimators are updated when learn_one is called.

Examples

Let's first see what methods are called if we just call learn_one.

import io
import logging
from river import anomaly
from river import compose
from river import datasets
from river import preprocessing
from river import utils

model = compose.Pipeline(
    preprocessing.MinMaxScaler(),
    anomaly.HalfSpaceTrees()
)

class_condition = lambda x: x.__class__.__name__ in ('MinMaxScaler', 'HalfSpaceTrees')

logger = logging.getLogger()
logger.setLevel(logging.DEBUG)

logs = io.StringIO()
sh = logging.StreamHandler(logs)
sh.setLevel(logging.DEBUG)
logger.addHandler(sh)

with utils.log_method_calls(class_condition):
    for x, y in datasets.CreditCard().take(1):
        model = model.learn_one(x)

print(logs.getvalue())
MinMaxScaler.transform_one
HalfSpaceTrees.learn_one

Now let's use the context manager and see what methods get called.

logs = io.StringIO()
sh = logging.StreamHandler(logs)
sh.setLevel(logging.DEBUG)
logger.addHandler(sh)

with utils.log_method_calls(class_condition), compose.warm_up_mode():
    for x, y in datasets.CreditCard().take(1):
        model = model.learn_one(x)

print(logs.getvalue())
MinMaxScaler.learn_one
MinMaxScaler.transform_one
HalfSpaceTrees.learn_one

We can see that the scaler got updated before transforming the data.