Apache-2.0 licensed by Composewell Technologies and Contributors
Maintained by [email protected]
This version can be pinned in stack with:unicode-data-names-0.6.0@sha256:4783ba5d4db463a2f722527cc9aba7b9825ddabaedc10360af0a81785e25ae14,5723

Module documentation for 0.6.0

  • Unicode
    • Unicode.Char
    • Unicode.Internal
      • Unicode.Internal.Char
        • Unicode.Internal.Char.Label
        • Unicode.Internal.Char.Names
          • Unicode.Internal.Char.Names.Version
        • Unicode.Internal.Char.UnicodeData
          • Unicode.Internal.Char.UnicodeData.DerivedName
          • Unicode.Internal.Char.UnicodeData.NameAliases
Depends on 2 packages(full list with versions):
Used by 1 package in nightly-2026-06-05(full list with versions):

README

unicode-data-names provides Haskell APIs to efficiently access the Unicode character names and aliases from the Unicode character database.

There are 3 APIs:

  • String API: enabled by default.
  • ByteString API: enabled via the package flag has-bytestring.
  • Text API: enabled via the package flag has-text.

The Haskell data structures are generated programmatically from the Unicode character database (UCD) files. The latest Unicode version supported by this library is 17.0.0.

Please see the Haddock documentation for reference documentation.

Comparing with ICU

We can compare the implementation against ICU. This requires working with the source repository, as we need the internal package icu.

Warning: An ICU version with the exact same Unicode version is required.

cabal run -O2 --flag dev-has-icu unicode-data-names:tests -- -m ICU

Comparing with Python

In order to check Unicode implementation in Haskell, we compare the results obtained with Python.

Warning: A Python version with the exact same Unicode version is required.

cabal run -O2 -f "export-all-chars" -v0 export-all-chars > ./test/all_chars.csv
python3 ./test/check.py -v ./test/all_chars.csv

Licensing

unicode-data-names is an open source project available under a liberal Apache-2.0 license.

Contributing

As an open project we welcome contributions.

Changes

Changelog

0.6.0 (September 2025)

0.5.0 (September 2025)

0.4.0 (July 2024)

  • Updated to Unicode 15.1.0.
  • Added label and nameOrLabel to Unicode.Char.General.Names.

0.3.0 (July 2024)

  • Improved performance.
  • Added opional support for ByteString API. Use the package flag has-bytestring to enable it.
  • Added opional support for Text API. Use the package flag has-text to enable it.
  • Added unicodeVersion to Unicode.Char.General.Names.
  • Fixed the inlining of Addr# literals and reduce their size. This results in a sensible decrease of the executable size.

0.2.0 (September 2022)

0.1.0 (June 2022)

  • Initial release