package unstrctrd

  1. Overview
  2. Docs
Unstructured parser

Install

Dune Dependency

Authors

Maintainers

Sources

unstrctrd-v0.1.tbz
sha256=a7940f56f118383770bbd31a6dcca1282831031a60c2fd57c4801eca6484435d
sha512=95d5ad7090c9de99f28f1b7d95c39fce506239d702f0001e85e59a05186e76099893b8af925b2f5e1db50bb17b817cc58add40003c4f111147e272f91b8d5bb6

Description

Fast and compliant Unstructured parser according RFC 822

Published: 30 Jan 2020

README

Unstrctrd

An header of an email has a formal format described by RFC5322. After mrmime, it reveals that the more general form for any values of fields (like a date, an email address, etc.) is the unstructured form.

This library wants to provide the first ground of how to parse an email header. From that, we want to post-process unstructured values to cast them to any expected values like email address.

The idea behind this library is to handle a common format which can be found into several standards, the folding-whitespace. For example, the *.deb* file uses this kind of format where it's possible that one field can be associated to a multiline value.

Description: my superb Debian
 package!

This library wants to fold the value and, by this way, delete insignificant folding-whitespace to be able to apply a post-process like: parse an email address (eg. emile for more examples).

To: my.valid.mail
 (comment) @x25519.net
API

unstrctrd comes with several post-processes like:

  • val fold_fws : t -> t

  • val without_comments : t -> (t, [> error ]) result

  • val split_at : index:int -> t -> t * t

  • val split_on : on:[ WSP | FWS | Uchar of Uchar.t | Char of char | LF | CR ] -> t -> (t * t) option

Of course, it provides processes to manipulate a string and to convert an unstructured value to an UTF-8 string. With that, we can imply that unstrctrd handles UTF-8 encoding (and only UTF-8 according RFC 6532).

The API gives you a way to craft an unstructured value and ensures that this value is correct (and invalid any unstructured values which produces CRLF terminating token)

Angstrom

unstrctrd.parser provides an angstrom parser which can be safely composed with others angstrom parsers. It requires the allocation of an internal buffer used by ocamllex and ensure the safety.

Dependencies (5)

  1. angstrom >= "0.11.0"
  2. fmt
  3. uutf
  4. dune >= "1.10"
  5. ocaml >= "4.07.0"

Dev Dependencies (6)

  1. hxd with-test & = "0.1.0"
  2. bigstringaf with-test
  3. ke with-test
  4. alcotest with-test
  5. rresult with-test
  6. ocaml with-test & < "5.2"

Used by (2)

  1. mrmime >= "0.2.0" & < "0.3.1"
  2. multipart_form-lwt

Conflicts

None