40720 - Data Mining

Academic Year 2014/2015

  • Docente: Ida D'Attoma
  • Credits: 6
  • SSD: SECS-S/03
  • Language: English
  • Teaching Mode: In-person learning (entirely or partially)
  • Campus: Forli
  • Corso: Second cycle degree programme (LM) in Economics and Business Administration (cod. 8858)

Learning outcomes


This course introduces students to the study of the main statistical data mining methods to extract useful information from huge databases and to support the business intelligence process with explorative and predictive analysis.

Expected learning outcomes: at the end of the course the student is able to select the most appropriate methodology to the decision process problem, to quantitatively analyse the relationship between business phenomena and to critically interpret empirical results. 

Course contents


  1. Introduction to data mining.

  2. Organization of data: data objects and attributes type, data matrices and their transformations.

  3. Data Preprocessing and Exploratory Analysis: data cleaning, data bivariate exploratory analysis of qualitative and quantitative data.

  4. Data reduction methods: Principal Component Analysis.

  5. Measures of Distance.

  6. Hierarchical Cluster Analysis.

  7. Predictive Models: logistic regression (introduction), decision trees (CART and CHAID methodologies).

Readings/Bibliography

  • Stéphane Tufféry. Data Mining and Statistics for Decision Making. 2011. John Wiley & Sons. 

  • Giudici, P. , Figini, S. Applied Data Mining. 2009. John Wiley & Sons. 

Teaching methods

The module consists in theoretical session on methods and practical tutorials devoted to applications on real economic data, through the use of SAS statistical software.

Assessment methods

Written exam consisting in a multiple-choice section and a section requiring production and interpretation of statistical outputs. The multiple choice section aims at testing the student's knowledge of the theoretical topics. The second section is targeted at testing the ability of producing and interpreting statistical outputs, and their translation into applied conclusions.

Teaching tools

SAS software demonstrations on data analyisis will be provided. Notes are downloadable from the lecturer's web page.

Links to further information

http://www.unibo.it/docenti/Ida.dattoma2

Office hours

See the website of Ida D'Attoma