95662 - INTRODUCTION TO MACHINE LEARNING

Anno Accademico 2021/2022

  • Docente: Giovanni Della Lunga
  • Crediti formativi: 3
  • SSD: SECS-S/06
  • Lingua di insegnamento: Inglese
  • Modalità didattica: Convenzionale - Lezioni in presenza
  • Campus: Bologna
  • Corso: Laurea Magistrale in Quantitative finance (cod. 8854)

Conoscenze e abilità da conseguire

The main goal of the course is to present the first elements of Machine Learning accompanied by a brief reference to the most important elements of numerical analysis used in this field. We present also the Python ecosystem for machine learning and the functionality it provides with NumPy, Matplotlib and Pandas, scikit-learn. A general discussion of Supervised and Unsupervised is introduced. After discussing the idea of clustering, the student should learn that this class of algorithms explore input data without being given an explicit output variable. Students should also clearly understand when to use it. After that, we continue with the definition of Supervised Learning describing from a very general point of view how this class of algorithms works and when we should use it. Students should learn how to implement some of the most simple and standard methods for modelling relationship between independent input variables and dependent output variables. As regards decision trees and how they can be used for prediction, the student should learn what are potential advantages of this technique over linear or logistic regression and how to use it in classification problems. A simple introduction to bayesan learning is presented. Finally we explain how different machine learning algorithms can be combined to produce composite predictions. An important example of this is a random forest which is a procedure for generating many different decision trees and combining the results. Students should be familiar with the following concepts: Vector Spaces, Eigenfunctions and Eigenvectors, Operator and Matrix Calculus, Calculus of Extrema, the concept of Gradient, Condition for local and global minima, Conditional Probability, Bayes Rule. A basic experience with Python programming is required.

Contenuti

Lesson 1 - Introduction

In this lesson the student should learn the first elements of the subject accompanied by a brief reference to the most important elements of numerical analysis used in the field of machine learning. We present also the Python ecosystem for machine learning and learn about Python and it’s rising use for machine learning, SciPy and the functionality it provides with NumPy, Matplotlib and Pandas, scikit-learn that provides all of the machine learning algorithms. The student also learn how to install the Python ecosystem for machine learning on your workstation.

Topics:

  • What is Machine Learning
  • Supervised and Unsupervised Learning
  • Mathematics for Machine Learning (just a quick reminder)
  • Python Ecosystem for ML

Lesson 2 - Unsupervised Learning

In this class we present a simple example of an Unsupervised Learning Algorithm. After discussing the idea of clustering, the student should learn that this class of algorithms explore input data without being given an explicit output variable. The studend shoud also clearly understand when to use it that means when we don't know how to classify the data and we need an algorithm to find patterns.

Topics:

  • What is Unsupervised Learning
  • When to use it and how it work
  • k-Means clustering
  • Portfolio clustering with K-means

Lesson 3 - Supervised Learning

We begin with the definition of Supervised Learning, after that we describe from a very general point of view how this class of algorithms works and when we should use it. The student also lears how to implement two of the most simple and standard methods for modelling the past relationship between independent input variables and dependent output variables. Finally we present a technique that is tipically used for clustering but can be transformed to perform regression.

Topics:

  • What is Supervised Learning
  • When to use it and how it work
  • Linear Regression
  • Logistic Regression
  • SVM (Support Vector Machines)

Lesson 4 - Supervised Learning

In this lesson, we continue our discussion of supervised learning by considering how decision trees can be used for prediction. The student should learn what are potential advantages of this technique over linear or logistic regression and how to use it in classification problems. A simple introduction to bayesan learning is presented along with a simple application to credit feature selection. After that we explain how different machine learning algorithms can be combined to produce composite predictions. An important example of this is a random forest which is a procedure for generating many different decision treees and combining the results.

Topics:

  • Decision Tree
  • Naive Bayes
  • Random Forest

Lesson 5 - Introduction to Deep Learning

In this lesson the student learns what an Artificial Neural Network is. We then move on to provide some applications and esplain some extensions of the basic idea.

Topics:

  • What is a Neural Network
  • Single Layer NN
  • Gradient Descent Algorithm

Testi/Bibliografia

    • John C. Hull, Machine Learning in Business, An Introduction to the World of Data Science, Amazon (2019)
    • Paul Wilmott, Machine Learning, An Applied Mathematics Introduction, Panda Ohana Publishing (2019)
    • Sebastian Raschka and Vahid Mirjalili, Python Machine Learning, Packt (2019)
    • Francois Chollet, Deep Learning Python, Manning (2018)

Metodi didattici

These introductory lessons assume a basic level of statistical and mathematical knowledge. No previous knowledge of Machine Learning is assumed but a basic knowledge of Python language is mandatory.

Lessons are based on slides and Jupyter Notebook, delivered online in advance.

Modalità di verifica e valutazione dell'apprendimento

The final exam consists of a small project chosen by the student from a series of proposals related to the other topics covered during the course. The test must be requested from the teacher well in advance of the exam date during which the work will be analyzed and evaluated. During the exam, the student will have to present the developed project and discuss its main aspects.

Strumenti a supporto della didattica

  • Slides (power point/pdf)
  • Selected literature
  • Jupyter Notebook
  • Python Code Snippet

Orario di ricevimento

Consulta il sito web di Giovanni Della Lunga

SDGs

Istruzione di qualità Parità di genere Imprese innovazione e infrastrutture

L'insegnamento contribuisce al perseguimento degli Obiettivi di Sviluppo Sostenibile dell'Agenda 2030 dell'ONU.