A collection of tools for processing PDF files.


Version on this page:
LTS Haskell 22.30:0.1.4
Stackage Nightly 2024-07-24:0.1.4
Latest on Hackage:0.1.4

See all snapshots pdf-toolbox-document appears in

BSD-3-Clause licensed by Yuras Shumovich
Maintained by Yuras Shumovich
This version can be pinned in stack with:pdf-toolbox-document-,1975

Mid level tools for processing PDF files.

Level of abstraction: document, catalog, page


  • fix compilation on ghc 7.4, 7.6 and 7.8
  • fix xobject handling in text extraction

  • support xobjects in text extraction

  • switch to errors-2.0

  • support ghc-7.10.1

  • support crypto handler version 4 (V2 and AESV2)

  • extracting text: try to insert spaces and newlines
  • fix attoparsec module deprication warnings
  • fix AMP warnings