obandit

Ocaml Multi-Armed Bandits
IN THIS PACKAGE
Module type Obandit . AlphaPhiUCBParam
val k : int

The number of actions $ K $ .

val alpha : float

The number of actions $ K $ .

The $ \alpha $ parameter.

val invLFPhi : float -> float

The $ \alpha $ parameter.

The inverse of the Legendre-Fenchel transform of $ \psi $.