99486 - Simulation Methods and Machine Learning in Medicinal Chemistry

Academic Year 2023/2024

Learning outcomes

At the end of the course, the student knows the basic aspects of consolidated and emerging computational approaches and methodologies in the pharmaceutical chemistry field. The student is able to address the many problems that characterize the phases of drug discovery and development through the choice of the most appropriate computational tool.

Course contents

  • Introduction to the course.
    Distinction between simulation methods and Machine Learning in the field of medicinal chemistry. Distinction between supervised and unsupervised Machine Learning.
  • Review of concepts learned in previous courses.
    Discrete and continuous variables. Empirical distributions. Univariate and multivariate statistical analysis. Operations on vectors and matrices. Eigenvectors and eigenvalues of a matrix. Python exercises.

Machine Learning methods:

  • The concept of "molecular featurization".
    Molecular fingerprints. Topological descriptors. Linear notations.
  • Simple and multiple linear regression.
    Loss functions. The training process. The method of least squares. Numerical methods. Evaluation of the performance of a regression model.
  • Feature selection and feature extraction.
    Univariate linear filters. Wrapper methods. Regularization methods. Principal Component Analysis. Python exercises.
  • Neural networks.
    The Multi-layer Perceptron. Training and optimization. Regularization and hyperparameter tuning. Introduction to generative methods. Autoencoders and variations. Prediction of pharmaceutically relevant molecular properties using neural networks.

Methods based on molecular simulations:

  • Introduction to simulation methods.
    The drug action process. Drug-target interaction: thermodynamics and kinetics aspects. Review of intermolecular interactions, enthalpic and entropic contributions.
  • Introduction to statistical mechanics.
    Microscopic definition of entropy. Microstates and macrostates. Boltzmann probability distribution. Helmholtz free energy.
  • Molecular mechanics.
    Force fields. Minimization algorithms.
  • Sampling methods.
    Introduction to the Metropolis Monte Carlo method. Introduction to the Molecular Dynamics method. Integration of the equations of motion. Brief overview of temperature control algorithms (thermostats). Boundary conditions. Trajectory analysis. Prediction of experimental observables of pharmaceutical interest using Molecular Dynamics simulations: binding free energy and association/dissociation rate constants.

Readings/Bibliography

Scientific articles and reviews suggested by the teacher.

Teaching methods

Frontal lectures and Pyhton exercises.

Assessment methods

Students will be required to present and discuss orally a written short summary (3-5 pages) of a research paper retrieved from the literature reporting a work dealing with an argument inherent to the course contents.

Teaching tools

Slides, scientific publications and other teaching material made available through the Virtual Learning Environment platform.

Office hours

See the website of Matteo Masetti