Informazioaren Erauzketa eta Berreskurapena

CLARIAH-EUS-gArA

CLARIAH-EUS-gArA proiektuak Giza eta Gizarte Zientzien (GGZ) eta Adimen Artifizialaren (AA) arteko bi norabideko zubia eraiki nahi du. Horretarako elkarrizketa bidezko laguntzaile adimentsu bat garatuko du GGZetan ikertzeko. Kazetaritzaren domeinuan landuko dugu Latxa euskarazko Hizkuntza Eredu Handienaren (HEH) gaineko elkarrizketa-sistema RAG sistema batekin lagunduta.

Curriculum Learning for large language models in low-resource languages

Large language models (LLMs) are at the core of the current AI revolution, and have laid the groundwork for tremendous advancements in Natural Language Processing. Building LLMs require huge amounts of data, which is not available for low resource languages. As a result, LLMs shine in high-resource languages like English, but lag behind in many others, especially in those where training resources are scarce, including many regional languages in Europe. The data scarcity problem is usually alleviated by augmenting the training corpora in the target

Orriak

RSS - Informazioaren Erauzketa eta Berreskurapena-rako harpidetza egin