Deep Cross-Lingual Coreference Resolution for Less-ResourcedLanguages: The Case of Basque

In this paper, we present a cross-lingual neural coreference resolution system for a less-resourced language such as Basque. To begin with, we build the first neural coreferenceresolution system for Basque, training it with the relatively small EPEC-KORREF corpus (45,000 words). Next, a cross-lingual coreference resolution system is designed. With this approach, the system learns from a bigger English corpus, using cross-lingual embeddings, to perform the coreference resolution for Basque. The cross-lingual system obtains slightly better results (40.93 F1 CoNLL) than the monolingual system (39.12 F1 CoNLL),without using any Basque language corpus to train it.
Egileak (ixakideak): 
Egileak: 
Gorka Urbizu, Ander Soraluze, Olatz Arregi
Fitxategi publikoak: 
Urtea: 
2019
Artikuluaren erreferentzia: 
Proceedings of the 2nd Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2019), co-located with NAACL 2019
ISBN edo ISSN (aldizkari, kongresu, liburu edo liburu atalak): 
978-1-948087-97-1

Argitalpen mota:

Argitalpen mota fina (argitalpen_sailkapen_ohia):