package doi2bib

  1. Overview
  2. Docs
CLI tool to get bibtex entries from DOI/arXiv/PubMed IDs

Install

dune-project
 Dependency

Authors

Maintainers

Sources

doi2bib-0.7.9.tbz
sha256=22c68e7390cedcff3acc87a3638e22dbf6df630a89bcb831bab34b0a22b48fe6
sha512=7c6c9bab1831112888a3c5c7d10092fe1eb211a3cfb53d5e9712c2b57ac30e48038bdf5fed8c1e12f2b384023f2a69ea1e818b2a8f8822283ed135fbe6c5ba43

Description

Published: 23 Oct 2025

README

doi2bib Build status

Small CLI tools to work with bibtex entries: get entries from DOI/arXiv/PubMed IDs and format bibtex files.

Just so you know, there is now Zotero BIB on the browser that can do this (and more). I will keep maintaining doi2bib though, since it is an integral part of my workflow.

Tools

This package provides three CLI tools:

  1. doi2bib - Get bibtex entries from DOI, arXiv ID, or PubMed ID (pretty printed with bibfmt)
  2. bibfmt - Pretty print and format bibtex files (using very few dependencies)
  3. bibdedup - Deduplicate BibTeX entries across multiple files

doi2bib Usage

$ doi2bib --help=plain
NAME
   doi2bib - A little CLI tool to get the bibtex entry for a given DOI,
   arXiv or PubMed ID.

SYNOPSIS
   doi2bib [OPTION]... [ID]

ARGUMENTS
   ID  A DOI, an arXiv ID or a PubMed ID. The tool tries to automatically
       infer what kind of ID you are using. You can force the cli to
       lookup a DOI by using the form 'doi:ID' or an arXiv ID by using
       the form 'arXiv:ID'. PubMed IDs always start with 'PMC'.

OPTIONS
   -f FILE, --file=FILE
       With this flag, the tool reads the file and processes its lines
       sequentially, treating them as DOIs, arXiv IDs or PubMed IDs.
       Errors will be printed on standard error but will not terminate
       the operation.

   --help[=FMT] (default=auto)
       Show this help in format FMT. The value FMT must be one of `auto',
       `pager', `groff' or `plain'. With `auto', the format is `pager` or
       `plain' whenever the TERM env var is `dumb' or undefined.

   -o OUTPUT, --output=OUTPUT (absent=stdout)
       Append the bibtex output to the specified file. It will create the
       file if it does not exist.

   --version
       Show version information.

EXIT STATUS
   doi2bib exits with the following status:

   0   on success.

   124 on command line parsing errors.

   125 on unexpected internal errors (bugs).

BUGS
   Report bugs to https://github.com/mseri/doi2bib/issues

The tool retrieves the bibtex entry using published details when possible.

bibfmt Usage

$ bibfmt --help=plain
NAME
   bibfmt - A little CLI tool to pretty print bibtex files.

SYNOPSIS
   bibfmt [--file=FILE] [--output=OUTPUT] [OPTION]...

OPTIONS
   -f FILE, --file=FILE
       Reads the bib content from the specified file instead of the
       standard input.

   --force
       Force mode: ignore parsing errors and output only successfully
       parsed entries.

   -o OUTPUT, --output=OUTPUT (absent=stdout)
       Saves the pretty printed bib to the specified file.

   -q, --quiet
       Quiet mode: suppress all output except warnings and errors.

   -s, --strict
       Enable strict parsing mode that rejects BibTeX files with
       duplicate fields.

   --help[=FMT] (default=auto)
       Show this help in format FMT. The value FMT must be one of `auto',
       `pager', `groff' or `plain'. With `auto', the format is `pager` or
       `plain' whenever the TERM env var is `dumb' or undefined.

   --version
       Show version information.

EXIT STATUS
   bibfmt exits with the following status:

   0   on success.

   123 on indiscriminate errors reported on standard error.

   124 on command line parsing errors.

   125 on unexpected internal errors (bugs).

BUGS
   Report bugs to https://github.com/mseri/doi2bib/issues

bibdedup Usage

$ bibdedup --help=plain
NAME
   bibdedup - Deduplicate BibTeX entries across multiple files.

SYNOPSIS
   bibdedup [OPTION]... FILES...

ARGUMENTS
   FILES  BibTeX files to deduplicate.

OPTIONS
   -i, --interactive
       Enable interactive mode to resolve conflicts. If not set,
       automatically keeps the first occurrence of conflicting fields.

   -k KEYS, --keys=KEYS (absent=title,author,year)
       Comma-separated list of field names to use for duplicate
       detection. Special key 'citekey' matches on citation keys.
       Default: title,author,year

   -o OUTPUT, --output=OUTPUT (absent=stdout)
       Output file for deduplicated BibTeX. If not specified, writes to
       stdout.

   -s, --strict
       Enable strict mode that checks for and reports duplicate fields
       in entries.

   --help[=FMT] (default=auto)
       Show this help in format FMT. The value FMT must be one of `auto',
       `pager', `groff' or `plain'. With `auto', the format is `pager` or
       `plain' whenever the TERM env var is `dumb' or undefined.

   --version
       Show version information.

EXIT STATUS
   bibdedup exits with the following status:

   0   on success.

   124 on command line parsing errors.

   125 on unexpected internal errors (bugs).

BUGS
   Report bugs to https://github.com/mseri/doi2bib/issues

Examples

doi2bib Examples

Print bibtex entry to standard output:

$ doi2bib 10.1007/s10569-019-9946-9
$ doi2bib 1902.00436
$ doi2bib arXiv:1609.01724
$ doi2bib PMC2883744

Save bibtex entry to a file:

$ doi2bib doi:10.4171/JST/226 -o "bibliography.bib"

This will create the file if not present or append the bibliography to the existing file.

You can batch-process lists of entries by listing them line by line in a file and using the --file option.

bibfmt Examples

Format a bibtex file and print to stdout:

$ bibfmt -f bibliography.bib

Format a bibtex file and save to a new file:

$ bibfmt -f messy.bib -o clean.bib

Format bibtex content from stdin:

$ echo "@article{key, title={My Title}, author={John Doe}}" | bibfmt

Format with strict mode to check for duplicate fields (these can be removed with bibdedup):

$ bibfmt -f bibliography.bib --strict -q

You can use quiet mode to suppress normal output and only see warnings/errors:

$ bibfmt -f messy.bib --quiet

Force formatting even with parsing errors, by removing all the problematic entries (only do this after careful consideration):

$ bibfmt -f problematic.bib --force -o partial.bib

bibdedup Examples

Deduplicate entries from multiple files:

$ bibdedup file1.bib file2.bib -o merged.bib

Use custom keys for duplicate detection:

$ bibdedup --keys doi papers1.bib papers2.bib -o output.bib
$ bibdedup --keys title,year lib1.bib lib2.bib -o combined.bib

Deduplicate using citation keys:

$ bibdedup --keys citekey old.bib new.bib -o updated.bib

Interactive mode for conflict resolution:

$ bibdedup --interactive --keys title,author,year *.bib -o curated.bib

Enable strict mode to check for duplicate fields:

$ bibdedup --strict --keys doi papers.bib -o clean.bib

Installation

Each release comes with attached binaries for Windows, Mac, and Linux. You can simply unpack the binaries (doi2bib or bibfmt) and place them in a folder accessible by your terminal.

Building from Source

To build the package yourself, use opam:

$ opam install doi2bib    # or bibfmt if you only need the pretty printer

This will install both doi2bib and bibfmt tools, since the latter is a dependency of doi2bib.

To run the tests, clone this repository and from the root of the project run:

$ opam install --deps-only .    # first time only
$ dune runtest

Troubleshooting

If on macOS you get a Library not loaded: /usr/local/opt/gmp/lib/libgmp.10.dylib failure, you will need to install gmp:

  • MacPorts users: port install gmp
  • Homebrew users: brew install gmp

Editor Integration

Zed Configuration

Use the following to configure bibfmt as your bibtex formatter in Zed:

"languages": {
  "BibTeX": {
    "formatter": {
      "external": {
        "command": "/path/to/bibfmt"
      }
    }
  }
}

Replace /path/to/bibfmt with the actual path to your bibfmt binary.

Other Editors

Since bibfmt reads from stdin and writes to stdout by default, it can be easily integrated with other editors that support external formatters. The tool will preserve the content if parsing errors are encountered, making it safe to use in automated workflows.

API References

Dependencies (13)

  1. re >= "1.0.0"
  2. tls-lwt
  3. tls >= "0.12.0" & < "0.16"
  4. bigstringaf >= "0.2.0"
  5. lwt >= "5.5.0"
  6. ezxmlm >= "1.1.0"
  7. clz >= "0.1.0"
  8. cmdliner >= "1.1.0"
  9. cohttp-lwt-unix >= "2.5.0"
  10. astring >= "0.8.0"
  11. bibfmt = version
  12. ocaml >= "4.08"
  13. dune >= "2.8"

Dev Dependencies (1)

  1. odoc with-doc

Used by

None

Conflicts

None