Nerf, the named entity recognition tool based on linear-chain CRFs

Latest on Hackage:0.5.3

This package is not currently in any snapshots. If you're interested in using it, we recommend adding it to Stackage Nightly. Doing so will make builds more reliable, and allow to host generated Haddocks.

BSD3 licensed by Jakub Waszczuk

The package provides the named entity recognition (NER) tool divided into a back-end library (see the NLP.Nerf module) and the front-end tool nerf. Using the library you can model and recognize named entities (NEs) which, for a particular sentence, take the form of forest with NE category values kept in internal nodes and sentence words kept in forest leaves.

To model NE forests we combine two different techniques. The IOB codec is used to translate to and fro between the original, forest representation of NEs and the sequence of atomic labels. In other words, it provides two isomorphic functions for encoding and decoding between both representations. Linear-chain conditional random fields, on the other hand, provide the framework for label modelling and tagging.

comments powered byDisqus