pdf-toolbox-core

A collection of tools for processing PDF files.

Version on this page:	0.1.1@rev:1
LTS Haskell 24.36:	0.1.3
Stackage Nightly 2025-07-14:	0.1.3
Latest on Hackage:	0.1.3

See all snapshots pdf-toolbox-core appears in

BSD-3-Clause licensed by Yuras Shumovich

Maintained by Yuras Shumovich

This version can be pinned in stack with:pdf-toolbox-core-0.1.1@sha256:9f3a9eea11420982f4f84addda9994d6ee756e9c9ed5c1691214ab0fcc80b6c0,3944

Module documentation for 0.1.1

Pdf
- Pdf.Core
  - Pdf.Core.Encryption
  - Pdf.Core.Exception
  - Pdf.Core.File
  - Pdf.Core.IO
    - Pdf.Core.IO.Buffer
  - Pdf.Core.Name
  - Pdf.Core.Object
    - Pdf.Core.Object.Builder
    - Pdf.Core.Object.Util
  - Pdf.Core.Parsers
  - Pdf.Core.Stream
    - Pdf.Core.Stream.Filter
      - Pdf.Core.Stream.Filter.FlateDecode
      - Pdf.Core.Stream.Filter.Type
  - Pdf.Core.Types
  - Pdf.Core.Util
  - Pdf.Core.Writer
  - Pdf.Core.XRef

Depends on 14 packages(full list with versions):

attoparsec, base, base16-bytestring, bytestring, cipher-aes, cipher-rc4, containers, crypto-api, cryptohash, hashable, io-streams, scientific, unordered-containers, vector

Used by 2 packages in lts-21.25(full list with versions):

pdf-toolbox-content, pdf-toolbox-document

Low level tools for processing PDF files.

Level of abstraction: cross reference, trailer, indirect object, object

The API is based on random access input streams, and is designed to be memory efficient. We don't need to parse the entire PDF file and store it in memory when you need just one page or two. Usually it is also leads to time efficiency, but we don't try optimize performance by e.g. maintaining xref or object cache. Higher level layers should take care of it.

The library is low level. It may mean that you need to be familiar with PDF file internals to actually use it.

Changes

unreleased

0.1.1

rework API
support ghc from 8.0 to 8.10 and drop older versions
interpret unknown xref stream entry type as reference to null object
support 1- and 2-digit escapes sequence in literal string

0.0.3.0

add Functor and Applicative instances to fix AMP warnings
fix attoparsec module deprication warnings
add scientific dependency latest attoparsec uses it for numbers