package fehu

You can search for identifiers within the package.

in-package search v0.2.0

On This Page

Available Algorithms
1. Policy Gradient Methods
2. Value-Based Methods
Usage Pattern
Choosing an Algorithm

package fehu

fehu
- CHANGES
- README
- Library fehu
  - Fehu
    
    Errors
    
    Info
    
    Metadata
    
    Space
    
    Value
    
    Discrete
    
    Box
    
    Multi_binary
    
    Multi_discrete
    
    Tuple
    
    Dict
    
    Sequence
    
    Text
    
    Env
    
    Wrapper
    
    Vector_env
    
    Buffer
    
    Replay
    
    Rollout
    
    Training
    
    Trajectory
- Library fehu.algorithms
  - Fehu_algorithms
    
    Reinforce
    
    Dqn
- Library fehu.envs
  - Fehu_envs
    
    Random_walk
    
    Grid_world
    
    Cartpole
    
    Mountain_car
- Sources
  - fehu
    
    buffer.ml
    
    env.ml
    
    errors.ml
    
    fehu.ml
    
    fehu__.ml
    
    info.ml
    
    metadata.ml
    
    space.ml
    
    training.ml
    
    trajectory.ml
    
    vector_env.ml
    
    wrapper.ml
  - fehu.algorithms
    
    dqn.ml
    
    fehu_algorithms.ml
    
    fehu_algorithms__.ml
    
    reinforce.ml
  - fehu.envs
    
    cartpole.ml
    
    fehu_envs.ml
    
    fehu_envs__.ml
    
    grid_world.ml
    
    mountain_car.ml
    
    random_walk.ml

Legend:
Page
Library
Module
Module type
Parameter
Class
Class type
Source

Module `Fehu_algorithms`Source

Reinforcement learning algorithms for Fehu.

This library provides production-ready implementations of standard RL algorithms. Each algorithm follows a consistent interface: create an agent with a policy network and configuration, train with learn, and use the trained policy with predict.

Available Algorithms

Policy Gradient Methods

Reinforce: Monte Carlo Policy Gradient (REINFORCE)

Value-Based Methods

Dqn: Deep Q-Network (DQN)

Usage Pattern

All algorithms follow this pattern:

  open Fehu

  (* 1. Create policy network *)
  let policy_net = Kaun.Layer.sequential [...] in

  (* 2. Initialize algorithm *)
  let agent = Algorithm.create
    ~policy_network:policy_net
    ~n_actions:n
    ~rng:(Rune.Rng.key 42)
    Algorithm.default_config
  in

  (* 3. Train *)
  let agent = Algorithm.learn agent ~env ~total_timesteps:100_000 () in

  (* 4. Use trained policy *)
  let action = Algorithm.predict agent obs ~training:false |> fst

Choosing an Algorithm

REINFORCE: Simple policy gradient, works for small discrete action spaces, requires complete episodes. Good for learning but sample inefficient.
DQN: Off-policy value-based method with experience replay, good for discrete actions, more sample efficient than REINFORCE.

Future algorithms:

PPO: More sample efficient, supports continuous actions, industry standard
SAC: Off-policy actor-critic, excellent for continuous control

Sourcemodule Reinforce : sig ... end

Reinforce algorithm implementation.

Sourcemodule Dqn : sig ... end

Dqn algorithm implementation.

package fehu

Module Fehu_algorithmsSource

Available Algorithms

Policy Gradient Methods

Value-Based Methods

Usage Pattern

Choosing an Algorithm

Module `Fehu_algorithms`Source