HERD

HERD (Hajen Entity Recognizer and Disambiguator) is a tool for automatically recognizing names in text (entity recognition) and specifying who is meant (disambiguation).

It is written in Java, and depends on Solr Text Tagger, by David Smiley and has a lot of inspiration from the Tulip project by Marek Lipczak et al.

The code will not run as is. It contains static paths to directories on my machine, and needs Wikipedia to be processed a couple of times to generate said files. It can be interesting for someone to read though.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

HERD

Files

README.md

Latest commit

History

README.md

File metadata and controls

HERD