96803 - WEB AND SOCIAL MINING

Academic Year 2025/2026

  • Teaching Mode: In-person learning (entirely or partially)
  • Campus: Bologna
  • Corso: Second cycle degree programme (LM) in Statistics, Economics and Business (cod. 8876)

Learning outcomes

At the end of the course the student is able to choose and use recent methods for web and social Mining. In particular the student is able to extract knowledge from the web and social media by applying machine learning techniques to analyze associations and carry out clickstream, sentiment, text mining and network analysis. The student is able to: - use methods for extracting knowledge from the web; - use recent data mining software for solving practical problems of web mining; and has the experience to carry out independent study and research.

Course contents

  1. Aims and steps of web mining
  2. Data extraction
    • web scraping
      • structure of web pages, the html language
      • procedures and functions for web scraping
    • use of application programming interfaces (APIs)
      • data extraction from social media using APIs
  3. Text mining
    • data pre-processing
      • data cleaning
      • tokenization and part of speech tagging
      • text vectorization
    • exploratory data analysis of text data
    • topic modelling
    • sentiment analysis
  4. Analysis of Social Networks
    • network theory
    • centrality measures

Readings/Bibliography

Slides and R scripts of the lessons will be made available on virtuale.unibo.it

Teaching methods

Lectures and laboratory exercises using R software

Given the type of activity and teaching methods used, attendance at this training activity requires all students to first participate in modules 1 and 2 of the training course on safety in the workplace, [https://site.unibo.it/tutela-promozione-salute-sicurezza/it/ corsi-di-formazione/formazione-obbligatoria-su-sicurezza-e-salute-per-svolgimento-di-tirocinio-tesi-laboratorio] in e-learning mode.

Assessment methods

The examination is aimed at ascertaining the knowledge and ability to use the tools presented in the lecture.
The assessment of learning involves the presentation of a project in which the various topics covered in class are covered.

The elements that determine the evaluation of the project are:

  • a clear definition of the research question and specific objectives;
  • a detailed description of the process followed in the development of the work;
  • a presentation of the methods used to achieve each objective;
  • consistency between the results obtained and the objectives set;
  • clarity and effectiveness in the final presentation of the results;
  • originality, critical thinking, and personal insight.

Office hours

See the website of Fabrizio Alboni