package prbnmcn-ucb1

You can search for identifiers within the package.

in-package search v0.2.0

Legend:
Page
Library
Module
Module type
Parameter
Class
Class type
Source

Module type `Ucb1.S`Source

Sourcetype arm

The type of arms (i.e. actions)

Sourcetype 'state t

The state of a bandit.

Sourceval create : arm array -> ready_to_move t

Create a fresh bandit with given arms.

Sourceval next_action : ready_to_move t -> arm * awaiting_reward t

Select the UCB1-optimal action to play. The bandit expects a reward.

Sourceval set_reward : awaiting_reward t -> float -> ready_to_move t

Assign a reward to the bandit.

Sourceval total_rewards : ready_to_move t -> float

Total rewards obtained by the bandit.

Sourceval find_best_arm : 'state t -> (arm_statistics -> float) -> arm * float

find_best_arm bandit f returns the arm that maximizes f, together with the maximizing value.

Sourceval pp_stats : Format.formatter -> 'state t -> unit

Pretty-print useful statistics on the bandit, for debugging purposes.