Kursplan för

Language Technology
Språkteknologi

EDAN20F, 7.5 högskolepoäng

Gäller från och med: Autumn 2017
Beslutad av: Professor Thomas Johansson
Datum för fastställande: 2017-03-22

Allmänna uppgifter

Avdelning: Computer Science (LTH)
Kurstyp: Gemensam kurs, avancerad nivå och forskarnivå
Kursen ges även på avancerad nivå med kurskod: EDAN20
Undervisningsspråk: English

Syfte

In the past 15 years, language technology has considerably matured driven by the massive increase of textual and spoken data and the need to process them automatically. Although there are few systems entirely dedicated to language processing, there are now scores of applications that are to some extent "language-enabled" and embed language processing techniques such as spelling and grammar checkers, information retrieval and extraction, or spoken dialogue systems. This makes the field form a new requirement for the CS engineers. The course introduces theories used in language technology. It attempts to cover the whole field from character encoding and statistical language models to semantics and conversational agents, going through syntax and parsing. It focuses on proven techniques as well as significant industrial or laboratory applications.

Mål

Kunskap och förståelse

För godkänd kurs skall doktoranden

Färdighet och förmåga

För godkänd kurs skall doktoranden

Värderingsförmåga och förhållningssätt

För godkänd kurs skall doktoranden

Kursinnehåll

An overview of language technology: disciplines, applications, and examples Corpus and word processing: regular expressions, automata, an introduction to Perl, concordances, tokenization, counting words, collocations Morphology and part-of-speech tagging: word morphology, transducers, part-of-speech tagging, Phrase-structure grammars: constituents, trees, DCG rules, unification. Partial parsing: multiword detection, noun group and verb group extraction, information extraction, evaluation Syntax: formalisms, constituency and dependency, functions, parsing, statistical parsing, dependency parsing. Semantics: formal semantics, lambda-calculus, lexical semantics, predicate--argument structures, frame semantics, semantic parsing. Discourse and dialogue: reference and coreference, discourse and rhetoric, discourse relations, parsing discourse relations, dialogue automata, speech acts, multimodality.

Kurslitteratur

Language Processing with Perl and Prolog, Theories, Implementation, and Application. Pierre Nugues, 2014. ISBN 9783642414640.

Kursens undervisningsformer

Undervisningsformer: Föreläsningar, laborationer

Kursens examination

Examinationsform: Skriftlig tentamen
Betygsskala: Underkänd, godkänd
Examinator: Professor Pierre Nugues

Antagningsuppgifter

Förkunskapskrav: EDAA01 Programming - Second Course
Minsta antal deltagare: 1

Kurstillfällesinformation

Startdatum: 2021-08-30
Slutdatum: 2021-10-31
Kursfart: Full time

Kontaktinformation och övrigt

Kursansvarig: Pierre Nugues <pierre.nugues@cs.lth.se>
Hemsida: http://cs.lth.se/edan20


Fullständig visning