- Docente: Fabio Tamburini
- Credits: 6
- SSD: L-LIN/01
- Language: Italian
- Teaching Mode: Traditional lectures
- Campus: Bologna
- Corso: First cycle degree programme (L) in Arts (cod. 0958)
Learning outcomes
The course will provide the knowledge related to the basic processes and methodologies on automatic text processing and on corpus building.
Course contents
Corpora
- What is a corpus, how to use it and the kind of information it
provides.
- Parameters for corpus design. Representativeness.
- Syntagmatic and paradigmatic analysis.
- Concordances, collocations and lexical association indexes.
- Annotations
- Electronic texts, coding, mark-up format and conversion methods.
- How to collect electronic texts.
- Corpus access and text retrieval.
- Case study: the corpora CORIS/CODIS, BoLC e DiaCORIS
- Web as corpus.
- Laboratory: building and using a tagged corpus.
Readings/Bibliography
- McEnery T., Wilson A. (1996). Corpus Linguistics.
- Lenci, A., Montemagni, S. and Pirrelli, V. (2005). Testo e
computer. Carocci.
- Rossini Favretti, R. (2009), Un'introduzione alla linguistica
applicata. Patron.
- Slides and papers will be provided during the lessons.
Teaching methods
Face-to-face classes for 30 hours.
Assessment methods
Oral colloquium.
It is compulsory to register for the exam using the online procedure.
Teaching tools
The course web site is the central point for any kind of information about the course. It contains the handouts and the readings discussed during the lessons as well as a rich software repository useful for laboratory practice.
A CD-ROM has been prepared for the students containing a complete computing environment to practice with the procedures proposed during the course. This tool will be used also in the laboratory sessions.
Links to further information
http://corpora.dslo.unibo.it/LingAppl/
Office hours
See the website of Fabio Tamburini