90909 - Workshop 2 (WS4)

Academic Year 2021/2022

Learning outcomes

Workshops are designed to provide students with transversal skills that can prove useful in their future careers. The objective of the workshop is to help students to practice skills through application of information technology, data analysis, decision-making techniques (e.g. simulation) in complex organizations.

Course contents

WS4: BIG DATA TECHNIQUES WITH R - part I

This workshop presents the Data Mining workflow and it focuses on the machine learning techniques for classification, that is a widely used tool exploited in many applications (such as Text Mining).

Topics will be introduced theoretically but also verified in R-based softwares during the laboratory hours.

More in details, the course contents are:

  • Introduction to Data Mining;
  • Introduction to programming in R;
  • Presentation of classification techniques for Big Data;
  • Implementation of R scripts to classify input data;
  • Presentation of some case studies and applications.

Part II is in WS7 (not mandatory).

Readings/Bibliography

Slides by the teacher

ROBERT, I., et al. "R in action: data analysis and graphics with R". 2011.

Teaching methods

Lectures and Laboratory lessons (in presence, hopefully)

Assessment methods

Evaluation of a final written report.

Teaching tools

Slides and script files by the teacher

Office hours

See the website of Elena Morotti