package prbnmcn-ucb1
Legend:
Library
Module
Module type
Parameter
Class
Class type
Library
Module
Module type
Parameter
Class
Class type
val create : arm array -> ready_to_move t
Create a fresh bandit with given arms.
val next_action : ready_to_move t -> arm * awaiting_reward t
Select the UCB1-optimal action to play. The bandit expects a reward.
val set_reward : awaiting_reward t -> float -> ready_to_move t
Assign a reward to the bandit.
val total_rewards : ready_to_move t -> float
Total rewards obtained by the bandit.
val find_best_arm : 'state t -> (arm_statistics -> float) -> arm * float
find_best_arm bandit f
returns the arm that maximizes f
, together with the maximizing value.
val pp_stats : Format.formatter -> 'state t -> unit
Pretty-print useful statistics on the bandit, for debugging purposes.
sectionYPositions = computeSectionYPositions($el), 10)"
x-init="setTimeout(() => sectionYPositions = computeSectionYPositions($el), 10)"
>