package kaun
Install
dune-project
Dependency
Authors
Maintainers
Sources
sha256=96d35ce03dfbebd2313657273e24c2e2d20f9e6c7825b8518b69bd1d6ed5870f
sha512=90c5053731d4108f37c19430e45456063e872b04b8a1bbad064c356e1b18e69222de8bfcf4ec14757e71f18164ec6e4630ba770dbcb1291665de5418827d1465
doc/kaun.hf/Kaun_hf/index.html
Module Kaun_hfSource
HuggingFace Hub integration.
Download pretrained model weights and configuration files from the HuggingFace Hub. Supports single-file and sharded SafeTensors checkpoints, caching, authentication, and offline mode.
let config =
Kaun_hf.load_config ~model_id:"bert-base-uncased" ()
in
let weights =
Kaun_hf.load_weights ~model_id:"bert-base-uncased" ()
in
(* weights : (string * Kaun.Ptree.tensor) list *)Types
The type for repository revisions.
Downloading
val download_file :
?token:string ->
?cache_dir:string ->
?offline:bool ->
?revision:revision ->
model_id:string ->
filename:string ->
unit ->
stringdownload_file ~model_id ~filename () is the local path to filename from the repository model_id.
The file is downloaded to the cache on first access and served from there on subsequent calls.
token is a HuggingFace API token for private repositories. Defaults to the value of HF_TOKEN.
cache_dir defaults to {RAVEN_CACHE_ROOT}/huggingface, or {XDG_CACHE_HOME}/raven/huggingface when unset.
offline defaults to false. When true, only cached files are returned.
revision defaults to Main.
Raises Failure if the download fails or the file is not cached in offline mode.
Loading
val load_config :
?token:string ->
?cache_dir:string ->
?offline:bool ->
?revision:revision ->
model_id:string ->
unit ->
Jsont.jsonload_config ~model_id () is the parsed config.json from model_id.
Parameters are the same as download_file.
Raises Failure on download or JSON parse errors.
val load_weights :
?token:string ->
?cache_dir:string ->
?offline:bool ->
?revision:revision ->
model_id:string ->
unit ->
(string * Kaun.Ptree.tensor) listload_weights ~model_id () is the list of (name, tensor) pairs from model_id's SafeTensors checkpoint.
Handles sharded checkpoints transparently: when model.safetensors.index.json is present, all referenced shards are downloaded and merged. Falls back to model.safetensors when no index exists.
Tensor names are the raw keys from the SafeTensors file (e.g. "bert.encoder.layer.0.attention.self.query.weight"). Model code is responsible for mapping these to its own parameter structure.
Parameters are the same as download_file.
Raises Failure if no SafeTensors files are found, or on download/parse errors.
Cache management
clear_cache () removes all cached files.
When model_id is given, only that model's cache is removed.