package prbnmcn-ucb1
Legend:
Page
Library
Module
Module type
Parameter
Class
Class type
Source
Page
Library
Module
Module type
Parameter
Class
Class type
Source
Module type Ucb1.S
Source
The type of arms (i.e. actions)
The state of a bandit.
Create a fresh bandit with given arms.
Select the UCB1-optimal action to play. The bandit expects a reward.
Assign a reward to the bandit.
Total rewards obtained by the bandit.
find_best_arm bandit f
returns the arm that maximizes f
, together with the maximizing value.
Pretty-print useful statistics on the bandit, for debugging purposes.