99513 - ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR CLIMATE SCIENCE

Academic Year 2022/2023

  • Moduli: Gastone Castellani (Modulo 1) Daniel Remondini (Modulo 2) Daniel Remondini (Modulo 3)
  • Teaching Mode: Traditional lectures (Modulo 1) Traditional lectures (Modulo 2) Traditional lectures (Modulo 3)
  • Campus: Bologna
  • Corso: Second cycle degree programme (LM) in Science of Climate (cod. 5895)

    Also valid for Second cycle degree programme (LM) in Physics (cod. 9245)

Learning outcomes

The student will learn the basis of artificial intelligence and machine learning (clustering, classification, regression, neural networks and Bayesian methods) and its application to climate science, with particular attention to the analysis of time series and space-resolved data. The student will learn how to implement these methods in open-source environments (eg. Python and R).

Course contents

24 hours (3 CFU) Castellani

Introduction Probability Theory, Probability densities, Expectations and covariances, Bayesian probabilities, Bayesian curve fitting, Model Selection, the Curse of Dimensionality, Statistical Inference and decision, loss functions for regression

Probability Distributions Binary and Multinomial Variables, Beta, Dirichlet and Gaussian Distribution, Gaussian Mixtures, the Exponential Family, maximum likelihood and sufficient statistics, Conjugate priors, Noninformative priors, Nonparametric Methods Inference and association test

Linear models for Regression and Classification Linear Basis Function Models, Bias-Variance Decomposition, Bayesian Linear Regression, Bayesian Model Comparison, Discriminant Functions, Probabilistic Generative and Discriminative Models

Neural Networks Learning rules, Hebbian,BCM and Hopfield model, Feed-forward Network Functions, Network Training, Error Backpropagation, Regularization,

Introduction to Kernel methods and Graphical models

24 hours (3 CFU) Remondini

Feature extraction & optimal projection:

Principal Component Analysis, Singular Value Decomposition, Factor Analysis, Multi-Dimensional Scaling, ISOMAP, Relation to cost function maximization and eigenvalue problem (Rayleigh quotient), Practical aspects: data regularization and normalization.

Supervised classification:

Support Vector Machine, Discriminant Analysis, Random Forest.

Unsupervised clustering:

Hierarchical, k-means, spectral clustering techniques.

Introduction to Deep Learning:

Error minimization: the backpropagation algorithm.,Feedforward Neural Networks., Convolutional Neural Networks.

Robust analysis

Bias-variance dilemma, Best practices, Crossvalidation procedures

Suggested books:

Readings/Bibliography

Bishop - Machine learning and pattern recognition

Tibshirani Tusher - Methods of statistical learning

Teaching methods

Slides and blackboard

Assessment methods

Project, literature study and questions.

Teaching tools

Programming environment and server connection.

Office hours

See the website of Gastone Castellani

See the website of Daniel Remondini

See the website of Daniel Remondini

SDGs

Good health and well-being Quality education Gender equality Industry, innovation and infrastructure

This teaching activity contributes to the achievement of the Sustainable Development Goals of the UN 2030 Agenda.