package fehu

You can search for identifiers within the package.

in-package search v0.2.0

On This Page

Benefits
Usage

package fehu

fehu
- CHANGES
- README
- Library fehu
  - Fehu
    
    Errors
    
    Info
    
    Metadata
    
    Render
    
    Pixel
    
    Space
    
    Value
    
    Discrete
    
    Box
    
    Multi_binary
    
    Multi_discrete
    
    Tuple
    
    Dict
    
    Sequence
    
    Text
    
    Env
    
    Wrapper
    
    Vector_env
    
    Buffer
    
    Replay
    
    Rollout
    
    Training
    
    Policy
    
    Trajectory
- Library fehu.algorithms
  - Fehu_algorithms
    
    Reinforce
    
    Dqn
- Library fehu.envs
  - Fehu_envs
    
    Random_walk
    
    Grid_world
    
    Cartpole
    
    Mountain_car
- Library fehu.visualize
  - Fehu_visualize
    
    Overlay
    
    Video
    
    Sink
- Sources
  - fehu
    
    buffer.ml
    
    env.ml
    
    errors.ml
    
    fehu.ml
    
    fehu__.ml
    
    info.ml
    
    metadata.ml
    
    policy.ml
    
    render.ml
    
    space.ml
    
    training.ml
    
    trajectory.ml
    
    vector_env.ml
    
    wrapper.ml
  - fehu.algorithms
    
    dqn.ml
    
    fehu_algorithms.ml
    
    fehu_algorithms__.ml
    
    reinforce.ml
  - fehu.envs
    
    cartpole.ml
    
    fehu_envs.ml
    
    fehu_envs__.ml
    
    grid_world.ml
    
    mountain_car.ml
    
    random_walk.ml
  - fehu.visualize
    
    fehu_visualize.ml
    
    fehu_visualize__.ml
    
    overlay.ml
    
    sink.ml
    
    utils.ml
    
    wrapper_video.ml

Legend:
Page
Library
Module
Module type
Parameter
Class
Class type
Source

Module `Fehu.Vector_env`Source

Vectorized environments for parallel interaction.

Vectorization enables simultaneous stepping of multiple environments, essential for efficient on-policy data collection. See Vector_env for batched operations.

Vectorized environments for parallel interaction.

Vectorized environments run multiple environment instances in parallel, enabling efficient data collection. This module follows Gymnasium's vectorization API, batching observations, actions, and rewards across environments.

Benefits

Collect trajectories faster by stepping multiple environments simultaneously
Amortize policy inference costs across batched observations
Essential for on-policy algorithms that need large amounts of data per update

Usage

Create a vectorized environment from multiple instances:

  let envs = List.init 8 (fun _ -> make_env ()) in
  let vec_env = Vector_env.create_sync ~envs () in
  let observations, infos = Vector_env.reset vec_env () in
  let actions = (* compute batched actions *) in
  let step = Vector_env.step vec_env actions

With autoreset enabled (default), terminated environments automatically reset on the next step, returning their initial observation. This ensures continuous data collection without manual intervention.

Sourcetype autoreset_mode =

| Next_step
(*
Reset terminated environments on the next step call
*)
| Disabled
(*
Do not automatically reset; requires manual intervention
*)

Autoreset behavior for terminated episodes.

With Next_step, when an environment terminates or truncates, the next call to step returns its initial observation instead of requiring an explicit reset. This maintains a constant number of active environments.

Sourcetype ('obs, 'act, 'render) step = {

observations : 'obs array;
(*
Observations from each environment
*)
rewards : float array;
(*
Rewards from each environment
*)
terminations : bool array;
(*
Natural termination flags
*)
truncations : bool array;
(*
Artificial truncation flags
*)
infos : Info.t array;
(*
Info dictionaries from each environment
*)

}

Batched step result from all environments.

Sourcetype ('obs, 'act, 'render) t

Vectorized environment handle managing multiple environment instances.

Source

val create_sync : 
  ?autoreset_mode:autoreset_mode ->
  envs:('obs, 'act, 'render) Env.t list ->
  unit ->
  ('obs, 'act, 'render) t

create_sync ~autoreset_mode ~envs () creates a synchronous vectorized environment.

Wraps envs to provide batched operations. All environments are stepped sequentially in the current process. For true parallelism, consider asynchronous implementations (not yet provided).

Parameters:

autoreset_mode: Controls automatic resetting of terminated environments (default: Next_step)
envs: List of environment instances to vectorize. Must be non-empty and share compatible observation/action spaces

raises Invalid_argument
if envs is empty.

Sourceval num_envs : ('obs, 'act, 'render) t -> int

num_envs vec_env returns the number of parallel environments.

Sourceval observation_space : ('obs, 'act, 'render) t -> Space.packed

observation_space vec_env returns the observation space of the vectorized environment.

All constituent environments share the same observation space.

Sourceval action_space : ('obs, 'act, 'render) t -> Space.packed

action_space vec_env returns the action space of the vectorized environment.

All constituent environments share the same action space.

Sourceval metadata : ('obs, 'act, 'render) t -> Metadata.t

metadata vec_env returns the metadata of the vectorized environment.

Sourceval reset : ('obs, 'act, 'render) t -> unit -> 'obs array * Info.t array

reset vec_env () resets all environments.

Returns (observations, infos) where each array has length num_envs, containing the initial observation and info from each environment.

Sourceval step : ('obs, 'act, 'render) t -> 'act array -> ('obs, 'act, 'render) step

step vec_env actions executes actions in all environments.

Takes an array of actions with length num_envs, steps each environment, and returns batched results.

If autoreset is enabled (Next_step), terminated environments automatically reset and return their initial observation. The terminations and truncations arrays indicate which environments ended before resetting. Infos for terminated environments include a `final_observation` key with the structured final observation encoded as an Info.value.

raises Invalid_argument
if actions length doesn't match num_envs.

Sourceval render : ('obs, 'act, 'render) t -> 'render option array

render vec_env calls Env.render on each underlying environment.

Returns an array of render outputs (or None if an environment does not produce a frame).

Sourceval envs : ('obs, 'act, 'render) t -> ('obs, 'act, 'render) Env.t array

envs vec_env returns the array of underlying environments. Mutating the array updates the vector environment in place.

Sourceval close : ('obs, 'act, 'render) t -> unit

close vec_env closes all constituent environments.

Releases resources held by all environments. Subsequent operations will fail.

package fehu

Module Fehu.Vector_envSource

Benefits

Usage

Module `Fehu.Vector_env`Source