package prbnmcn-ucb1
Legend:
Page
Library
Module
Module type
Parameter
Class
Class type
Source
Page
Library
Module
Module type
Parameter
Class
Class type
Source
Module Ucb1
Source
The UCB1 module is parameterised by a finite set of actions presented as an array of abstract "arms", each arm corresponding to an action.
Phantom types used to tag the state of the bandit.
awaiting_rewards
tags bandits from which an arm was selected and are awaiting the reward associated to this arm.
ready_to_move
tags bandits that are ready to perform another action.
Source
type arm_statistics = {
number_of_activations : float;
(*Number of times this arm was activated.
*)cumulative_reward : float;
(*Total reward gathered by this arm.
*)empirical_reward : float;
(*Average reward gathered by this arm.
*)
}
Statistics of a given arm.