87195 - LAB OF BIG DATA ARCHITECTURES M

Scheda insegnamento

Anno Accademico 2019/2020

Conoscenze e abilità da conseguire

The Lab of Big Data architectures extends and integrates what learnt by the student in the course “statistics and architectures for big data processing” with a more in depth and practical knowledge of the big-data technologies and architectures. The students will learn how to design a big data system, the key concepts and differentiators behind state-of-the-art technologies and architectures, and how to use it effectively. This will be done by a series of practical exercises with interactive explanations, where students will learn by solving practical problems and examples.

Programma/Contenuti

Configuring a Python environment

Connecting to a remote Big-Data Cluster

Creating a Big-Data Pipeline

Working with large datasets: from Pandas data frame to Spark data frame

Machine learning on large-scale time-series dataset

Metodi didattici

The class with consists of the completion of a set of practical tutorials and assignments conducted on the own laptop and on a remote big data cluster hosted by the Italian Supercomputing Centre CINECA.

Modalità di verifica dell'apprendimento

Based on student's reports

Orario di ricevimento

Consulta il sito web di Andrea Bartolini