package fmlib_pretty
Install
    
    dune-project
 Dependency
Authors
Maintainers
Sources
sha256=53cc6d31ce38c0de01b5921a7980219ba7dc6ba019c126d970834f838fde4a32
    
    
  md5=8a3862f8f0b47de77ac738b834c4e6df
    
    
  doc/pretty.html
Pretty Printing Overview
Basics
The pretty printer allows to print nicely formatted ascii text. The user generates a document with break hints. The primitives to generate documents are
- emptyEmpty document.
- text strDocument which contains the string- str.- strshould not contain newlines in order not to interfere with the formatter.
- break strBreak hint with alternative text- str.
- doc2 <+> doc2Concatenation of the documents- doc1and- doc2
- nest indent docIndented document.
- group docTreat all top level break hints of- docconsistently i.e. either print all break hints with their alternative text or as a newline.
With these primitives a surprisingly rich set of formattings can be made.
The user generates documents not only by using the primitives. There are a lot of convenience functions to make document generation easy.
Document creation is done lazily. Only very few resources are consumed in producing a document. The work starts with the layout function. The layout function does the layout and never buffers more than one line.
Layout is done lazily as well. The layout generates a stream of characters. Lines are formatted only if the characters of the line are pulled out of the stream.
If you just create a document and layout it but you never use the stream, then no work is done.
Term Printing
The usage of the pretty printer is best explained by an example. Suppose we want to print the function application f a b (g c d) e where the function names and arguments might have different length. We create a document which represents the structure by
let doc =
    group (
        text "f" <+> space <+>
        indent
            2
            (stack_or_pack
                " "
                [text "a";
                 text "b";
                 group (
                     text "(g" <+> space <+>
                     indent
                        2
                        (stack_or_pack " " [text "c"; text "d"])
                     <+> text ")");
                 text "e"])
    )where text "blabla" is a document with some unbreakable text, <+> concatenates two documents, space is a break hint whose alternative text is a blank, stack_or_pack atxt [...] stacks a list of documents separated by a break hint with the alternative text atxt.
The command
let stream = layout 5 doc creates a stream of characters which is nicely formatted using a desired line width of 5 characters. Since 5 characters are not enough to put any of the subterms completely on a line, the output is
123456789012345
f
  a
  b
  (g
    c
    d)
  di.e. each break hint is printed as a newline.
If we give the pretty printer a line width of 10, it could pack the application g c d on a line and print
123456789012345
f
  a
  b
  (g c d)
  dIf the pretty printer has enough line width e.g. a line width of 15, it can put the whole expression on a line.
123456789012345
f a b (g c d) dBy using stack_or_pack we instructed the pretty printer to either print all break hints as newlines or all break hints with their alternative texts. If we use pack instead of stack_or_pack, the pretty printer tries to pack as many arguments as possible on a line.
E.g. with a line width of 11 and using pack instead of stack_or_pack we get the output
123456789012345
f
  a b
  (g c d) dWith a line width of 10 and using pack we get
123456789012345
f
  a b
  (g c d)
  dbecause the pretty printer cannot pack (g c d) and d on a single line.
Character Stream
The basic type t of the pretty printer is a lazy character stream. I.e. characters are only generated if needed. The pretty printer implements the interface Fmlib_std.Interfaces.SOURCE to represent a character stream. You can ask the stream has_more r whether there are more characters in the stream and peek r to get the next character. The instruction advance r returns the stream r advanced by one character position.
The pretty printer has a function string_of r to return a string representation of the character stream.
However you very rarely need a string representation of a character stream. All functions in Fmlib are able to handle character streams.
Formatted Paragraphs
There are functions to generate formatted paragraphs with indentation.
    let words =
        wrap_words "bla bla bla bla bla bla bla" <+> cut
    let doc = paragraphs [
        words;
        words;
        nest 4 words;
        words;
    ]
    let stream = layout 16 docThe stream produces the following output
  12345678901234567890
    bla bla bla bla
    bla bla bla
    bla bla bla bla
    bla bla bla
        bla bla bla
        bla bla bla
        bla
    bla bla bla bla
    bla bla blaGenerate Documents
Clearly, it is tedious to write documents by hand. Usually you have some tree like structure and you want to generate a document from the tree structure.
Let's assume you have a tree structure like
type tree =
    { name: string; children: tree list; }
let leaf (name: string): tree =
    {name; children = [] }
let tree (name: string) (children: tree list): tree =
    {name; children}Write a function which converts the tree structure to a document.
let doc_of_tree (tree: tree): doc =
    let rec doc is_top tree =
        match tree.children with
        | [] ->
            text tree.name
        | _ ->
            let d =
                parent_child
                    " " 2
                    (text tree.name)
                    (children tree.children ())
            in
            if is_top then
                d
            else
                char '(' <+> d <+> char ')'
    and children lst () =
        match lst with
        | [last] ->
            doc false last
        | head :: tail ->
            doc false head <+> space
            >> children tail    (* Lazy concatenation!! *)
        | [] ->
            assert false (* 'lst' is never empty *)
    in
    doc true treeThen the simple command
tree
    "f"
    [leaf "a";
     leaf "b";
     tree "g" [leaf "c"; leaf "d"];
     leaf "e"]
|> layout 10generates the character stream
123456789012345
f
  a
  b
  (g c d)
  eNote the usage of the lazy concatentation operator >> in the recursive part of the function handling the children. This makes sure that even if the tree structure is hugh, the iteration over it is done only on demand. I.e. recursive calls are made only if the corresponding characters are needed when processing the character stream.