Skip to content

0.17.0 - 2023-05-27

bandit

  • Bandit policies now return a single arm when the pull method is called, instead of yielding or one more arms at a time. This is simpler to understand. We will move back to multi-armed pulls in the future.
  • Added bandit.Exp3.
  • bandit.UCB and bandit.Exp3 have an extra reward_scaler parameter, which can be any object that inherits from compose.TargetTransformRegressor. This allows scaling rewards before updating arms.

compose

model_selection

  • Added model_selection.BanditClassifier, which is the classification equivalent to bandit.BanditRegressor. Both are methods to perform online model selection via a bandit policy.

multioutput

preprocessing