Paper

Neurona-sareetan oinarritutako euskararako korreferentzia-ebazpena

Lan honek euskararako korreferentzia-ebazpenean egindako lanari jarraipena ematea du helburu, korreferentzia-ebazpenerako neurona-sareetan oinarritutako sistema bat eraikiz. Horretarako polonierarako eraikitako sistema
bat hartu da abiapuntutzat, eta euskarara egokitu. EPEC-KORREF corpusetik abiatuta, aipamen-bikoteak
eta hauen ezaugarriak erauzi dira eta neurona-sarea entrenatu da aipamen-bikoteak korreferenteak ote diren
erabakitzeko. Jarraian, neurona-sarearen iragarpenetatik korreferentzia-klusterrak sortu eta ebaluatu egin dira.

Adapting NMT to caption translation in Wikimedia Commons for low-resource languages

This paper presents a successful domain adaptation of a general neural machine
translation (NMT) system using a bilingual corpus created with captions for images in Wiki-
media Commons for the Spanish-Basque and English-Irish pairs.
Keywords: Machine Translation, Low-resource languages, Bilingual corpora, Language
resources from Wikipedia

Interpretable Deep Learning to Map Diagnostic Texts to ICD10 Codes

Background
Automatic extraction of morbid disease or conditions contained in Death Certificates is a critical process, useful for billing, epidemiological studies and comparison across countries. The fact that these clinical documents are written in regular natural language makes the automatic coding process difficult because, often, spontaneous terms diverge strongly from standard reference terminology such as the International Classification of Diseases (ICD).

Literal occurrences of Multiword Expressions: rare birds that cause a stir

Multiword expressions can have both idiomatic and literal occurrences. For instance pulling strings can be understood either as making use of one’s influence, or literally. Distinguishing these two cases has been addressed in linguistics and psycholinguistics studies, and is also considered one of the major challenges in MWE processing. We suggest that literal occurrences should be considered in both semantic and syntactic terms, which motivates their study in a treebank.

Unitate Fraseologikoen agerpen literalak, urre baina urri

Unitate fraseologiko asko idiomatikoki eta literalki uler daitezke. Esate baterako, ziria sartzeak bi esanahi
izan ditzake testuinguruaren arabera: norbaiti iruzur egitea edo nonbait ziri bat sartzea literalki. Lan honetan,
corpusetan oinarritutako azterketa eleaniztun baten berri emango dugu, eta erakutsiko dugu, batetik, halako
hitz-konbinazioak oso gutxitan erabiltzen direla literalki praktikan, eta bestetik, idiomatiko-literal bereizketa

Pages

Subscribe to RSS - Paper