Introduction to Language Technology Applications

The course will be offered online, both theory sessions and practical labs. We will try offer an engaging course, both at the theoretical and hands-on practical sessions.

Language Technology is increasingly present in many of the applications we use in our everyday activities (Google Home, Amazon Alexa, Siri, Google Translate, Grammar checkers, Google search engine, ChatGPT...) and the need of experts that can develop applications based on Language Technology is an ever growing demand both in the industry and academia. This course will introduce the most commonly used techniques to build applications based on Language Technology. Thus, the attendees will learn how to apply techniques such as document classification, sequence labeling, as well as vector-based word representations (embeddings) and pretrained language models for core applications such as Opinion Mining, Named Entity Recognition, Fake News Detection, Question Answering or Text Generation..

The course will have a practical focus (laboratories and practical tasks) learning to use readily available LT toolkits (Spacy, Flair, HuggingFace Transformers) based on machine and deep learning in a multilingual and multi-domain setting. The aim is to allow attendees to acquire the required autonomy to solve practical problems by applying and developing Language Technology applications. The course will be taught in English.

The course is part of the NLP master hosted by the Ixa NLP research group at the HiTZ research center of the University of the Basque Country (UPV/EHU).

Student profile

This course is targeted to graduate students and professionals from a range of disciplines (linguistics, journalism, computer science, sociology, etc.) that need an applied introduction to Language Technology. This involves identifying the required linguistic resources, appropriate tools/libraries and techniques with the aim of acquiring the required autonomy to solve practical problems by applying and developing applications based on Language Technology in different and creative ways.

For the practical content (coding exercises) some experience in python programming is recommended. Previous attendance to the Deep Learning for Natural Language Processing course is might be useful although not required.

Introduction to Applications of Language Technology

Natural Language Processing
Cross-lingual Information Extraction
LABORATORY: Stance detection with logistic regression
. Features
. Static Word Embeddings
Introduction to Flair
Introduction to Spacy

Text Classification

Fake News, Stance and Propaganda
Detection
. Fake News
. Hyperpartisanism
. Hate speech
Inference
. Fact-checking
. Stance
. Argumentation
LABORATORY: Stance Detection
. Training with Flair and Spacy

Sequence Labelling

Named Entity Recognition
. Contextual Word Representations
. Datasets
. Evaluation
Morphology
. Contextual and neural lemmatization
. Evaluation and application to high-inflected languages
LABORATORY: Train language independent neural sequence taggers with Flair and Transformers
. Named Entity Recognition
. Contextual lemmatization.

Opinion Mining

Fine-grained Sentiment Analysis
Aspect-based Sentiment Analysis
Multidomain and multilingual issues
LABORATORY:
Sentiment Analysis
. Text Classification
Opinion Targets and Aspects
. Sequence Labelling with Transformers

Question Answering

Redefining NLP tasks as QA
Pre-trained language models, Transformers
Multilingual transfer learning
Last words
LABORATORY Build and train a Question Answering system with encoders such as BERT.

Text Generation

Tasks based on Text Generation
Pre-trained language models, encoder-decoder (T5) and decoder (GPT) models
Multilingual transfer learning
Last words
LABORATORY: Automatic Argument generation using mT5.

Instructors

Rodrigo Agerri

Permanent researcher, member of Ixa
and HiTZ

Irune Zubiaga

FPI researcher, member of Ixa
and HiTZ

Practical details

General information

The classes will be held online. The practical labs will also be online.

Part of the Language Analysis and Processing master program.

5 theoretical sessions with corresponding programming labs (20 hours).

October 13th to 17th 2025, every afternoon from 15:00 to 19:00 CET.

Course language: English.

Capacity: 60 attendants (First-come first-served).

Cost: 270 euros + 4 insurance (270 for UPV/EHU members or if you also apply to the other Specialization courses.).

Registration

Prerequisites: Basic Python programming experience.; Not a requirement but, previous attendance to the Deep Learning for Natural Language Processing course held the previous week will help students to better understand the underlying algorithms of Language Technology applications.; Bring your own laptop (no need to install anything).; Although not strictly necessary we recommend subscribing to Google Collab Pro for more GPU allowance.

Previous editions

February 2024, UPV/EHU Donostia-San Sebastian. Winter edition, 5 weeks, 22.5 hours, labs with Spacy and Transformers.
July 2023, UPV/EHU Donostia-San Sebastian. Summer version, 20 hours, labs with Flair, Spacy and Transformers
February 2023, UPV/EHU Donostia-San Sebastian. Winter version, 5 weeks, 22.5 hours, labs with Flair, Spacy and Transformers
July 2022, UPV/EHU Donostia-San Sebastian. Summer version, 5 days, 20 hours, labs with Scikit-learn, Flair, Spacy and Transformers

Winter 2022, UPV/EHU Donostia-San Sebastian. Winter version, 9 days, 22.5 hours, labs with Scikit-learn, Flair and Spacy

July 2021, UPV/EHU Donostia-San Sebastian. Summer version, 5 days, 20 hours, labs with Scikit-learn, Flair and Spacy

Class of July 2022.

Class of July 2021.

Introduction to Language Technology Applications (8th edition)

20 hour, 5 afternoon summer course

October 13th to 17th 2025, Donostia-San Sebastian

Online, including tutorized labs

See also our sister "Specialization courses by HiTZ Chair of Artificial Intelligence and Language Technology "