pdf-toolbox-document

A collection of tools for processing PDF files.

https://github.com/Yuras/pdf-toolbox

Version on this page:0.1.2
LTS Haskell 22.43:0.1.4
Stackage Nightly 2024-12-04:0.1.4
Latest on Hackage:0.1.4

See all snapshots pdf-toolbox-document appears in

BSD-3-Clause licensed by Yuras Shumovich
Maintained by Yuras Shumovich
This version can be pinned in stack with:pdf-toolbox-document-0.1.2@sha256:4fb6512c5061d80970c3533e98cb609dc6df5982a0d79fa133319e8f210be175,2343

Mid level tools for processing PDF files.

Level of abstraction: document, catalog, page

Changes

unreleased

0.1.2

  • add missing file in test suite (#71)
  • reexport Info from Pdf.Document.Info (#73)

0.1.1

  • rework API
  • support ghc from 8.0 to 8.10 and drop older versions
  • lots of improvements to text extraction
  • interpret unknown xref stream entry type as reference to null object
  • support crypto handler version 4 (V2 and AESV2)

0.0.4.0

  • extracting text: try to insert spaces and newlines
  • fix attoparsec module deprication warnings
  • fix AMP warnings