B1872 - Text Encoding and Semantic Representation (1) (LM)

Academic Year 2023/2024

  • Teaching Mode: Traditional lectures
  • Campus: Bologna
  • Corso: Second cycle degree programme (LM) in Digital Humanities and Digital Knowledge (cod. 9224)

Learning outcomes

The course provides a theoretical and practical overview of digital methods for the encoding, analysis, and semanticrepresentationof texts (literary and not). At the end of the course students will be able to create, manipulate, query, organise and disseminate electronic documents in digital text collections.

Course contents

The course offers a gentle introduction to problems, approaches, and technologies for encoding, representing, organizing, and manipulating knowledge. At the end of the course students are aware of theoretical and practical aspects relevant to text encoding (XML/TEI); have acquired all the required skills to produce a machine-readable text (XML and HTML); can annotate and extract structured data from encoded texts; can manipulate text and semi-structured data via programming languages (e.g. XSLT, Python); can transform data into several formats and representations (e.g. HTML, RDF, JSON), and can produce Linked Open Data.

Readings/Bibliography

Lecture notes will be freely available from a dedicated GitHub repository before the course starts (please check this page before the beginning of the course for further information).

Slides and any additional material will be made available a few days before each lecture in the same repository.

Suggested readings are provided during classes.

Teaching methods

Lectures and hands-on classes.

Assessment methods

The exam consists in a project presentation. The project is shared with the second module of this course and the exam covers requirements of both modules. Specs of the other module (in pills) are:

  1. Gather information on a collection of at least ten items related to a topic

  2. Create an ontology to describe objects metadata and produce the documentation of the design phases (theoretical model, conceptual model, ontology, etc)

  3. Create a RDF dataset organised according to the ontology you designed

  4. Develop a website presenting the ontology design project and the dataset

Please, check the web page of the other module for a complete description of the project.

This module builds on top of the aforementioned program and adds the following requirements:

One of the collected items must a text, for which you must provide

  1. A XML/TEI document (a sample if too long) (included in the website)

  2. A XML to HTML transformation (python and/or XSLT) and a HTML document (included in the website)

  3. A XML/TEI to RDF transformation (python) and a RDF dataset (included in the website)

Metadata of all items must be transformed to RDF with python (according to your ontology)

  1. A .* to RDF transformation (python) and a RDF dataset for each item (included in the website)

 

Teaching tools

Classes are recorded and streamed at this link Teams.

Recordings are available at this link.

Office hours

See the website of Marilena Daquino