Eihera
Short description:
Basque named entities recognizer/classifier
Authors (IXA members):
Link (demo):
Contact:
i.alegria@ehu.es
Description:
Eihera is a system for Named Entity recognition and classification in written Basque. The system is designed in four steps: first, the development of a recognizer based on linguistic information represented on finite-state-transducers; second, the generation of semi-automatically annotated corpora from the result of these transducers; third, the achievement of the best possible recognizer by training different ML techniques on these corpora; and finally, the combination of the different recognizers obtained.
Functionality:
Eihera classifies the named entities into three classes: person, organization and location.
Technology:
Finite-state and Machine learning.
Modules:
Recognition by rules, recognition by ML, classification by rules, classification by ML. Eustagger is a previous step.
Innovation:
It is the first NERC system for Basque.
Development:
Different projects funded by the Basque government and the Spanish R&D agency.
Publications (papers):