Morfologia

Mofologia Konputazionala Euskaraz, 35 urte

Artikulu honetan morfologia konputazionalaren garapena azaltzen da, Ixa taldeak euskararako egindako aplikazioa azpimarratuz. Bilakaera historikoa jaso nahi izan da, teknologiaren bilakaera eta aplikazioen bilakaera uztartuz, beti euskararen gainean egindakoa adibidetzat hartuta.

Distância diacrónica automática entre variantes diatópicas do português e do espanhol

O objetivo deste trabalho é aplicar uma metodo- logia baseada na perplexidade, para calcular automa- ticamente a distância interlinguística entre diferentes períodos históricos de variantes diatópicas de idiomas.

Measuring Language Distance of Isolated European Languages

Phylogenetics is a sub-field of historical linguistics whose aim is to classify a group of languages by considering their distances within a rooted tree that stands for their historical evolution. A few European languages do not belong to the Indo-European family or are otherwise isolated in the European rooted tree. Although it is not possible to establish phylogenetic links using basic strategies, it is possible to calculate the distances between these isolated languages and the rest using simple corpus-based techniques and natural language processing methods. The objective of this

A Methodology to Measure the Diachronic Language Distance between Three Languages Based on Perplexity

The aim of this paper is to apply a corpus-based methodology, based on the measure of perplexity, to automatically calculate the cross-lingual language distance between historical periods of three languages. The three historical corpora have been constructed and collected with the closest spelling to the original on a balanced basis of fiction and non-fiction.

Measuring diachronic language distance using perplexity. Application to English, Portuguese and Spanish.

The objective of this work is to set a corpus-driven methodology to quantify automatically diachronic language distance between chronological periods of several languages. We apply a perplexity-based measure to written text representing different historical periods of three languages: European English, European Portuguese and European Spanish. For this purpose, we have built historical corpora for each period, which have been compiled from different open corpus sources containing texts as close as possible to its original spelling. The results of our experiments show that a diachronic

Orriak

RSS - Morfologia-rako harpidetza egin