Emidio Capriotti

Associate Professor

Department of Pharmacy and Biotechnology

Academic discipline: BIOS-07/A Biochemistry

Director of Second Cycle Degree in Bioinformatics

Research

Keywords: Analysis and interpretation of cancer genome. Genome interpretation and prediction of disease-related protein mutations. Machine learning approaches in molecular biology. Protein-protein interactions. RNA structure comparison and prediction. Protein folding kinetics. Prediction of protein stability changes upon mutation. Protein structure prediction by threading methods and building by homology.

Mutation and Disease

Single Nucleotide Polymorphisms (SNVs) are an important source of human genome variability. The non-synonymous SNVs occurring in coding regions resulting in single amino acid polymorphisms (SAPs) may affect protein function and lead to pathology. We are interested to study the relationship between mutation and disease to develop machine learning methods for the prediction of disease-related SAPs. The input features of our methods are sequence, evolutive and functional information. We have implemented PhD-SNP , a simple method based on protein sequence and profile data. Recently, with SNPs&GO we improved the accuracy of the detection of disease-related SAPs including protein functional information. All the predictors has been tested using a cross-validation procedure on a set of annotated SAPs selected from SwissVar database.
In the 2009 the Marie-Curie IOF project Mut2Dis (PIOF-GA-2009-237225) has been granted by the European Union with ~217K Euro to develop new machine learning based approaches based on protein structure information to predict the impact of SAPs.
In 2017 we received funding from the Italian Ministry of Research and University (PRIN-201744NR8S). The main aim of our project consists in filling the gap between thermodynamic data and disease-related information on protein variants. We propose to integrate theoretical/computational approaches with experimental validations to assess the impact of amino acid variations on protein structure, function and protein-protein binding affinity.

Protein Folding Stability and Kinetics

Protein folding is a complex process that is responsible for the formation of the protein tree-dimensional structure. In our work we study the protein folding focusing on two main aspects: the protein stability and the folding kinetics. We are interested to predict the effect of single point protein mutation on protein stability. To measure the effect of the mutation, we use the variation of the free energy change (ddG) upon mutation that can be calculated as the difference of the free energy variation (dG) for the mutant and the wild-type proteins. If we consider the unfolding free energy we have that more stable mutations correspond to positive ddG values and less stable mutants to negative ddG. In the 2004 we developed I-Mutant a neural network binary classification method to predict if a mutation increase or decrease the protein stability using protein structure. In the 2005 the second version of the program I-Mutant2.0 has been implemented to predict ddG value using only sequence information. All this methods has been tested using a cross-validation procedure on a set of ddG values extracted from Protherm database. In 2022 we developed DDGun an untrained method for predicting the the ddG values using sequence and structure information.
We are also interested in the understanding of protein folding kinetics. In general proteins can have two-state kinetics when they folds directly to the native state from the unfolded state or multi-state kinetics when the protein folds through at least one intermediate state. The folding rate describes how fast the folding process is and is related to the activation energy of the process. In the 2007, K-Fold tools has been developed to predict the folding kinetics and rate of a given protein using structural information.

Protein and RNA Structure Comparison and Prediction

The structure of biologically relevant macromolecules such as protein and RNA is important to understand their function. According to this hypothesis the structure is important to infer the function. Our group is developing methods for structural alignment to extract common feature between different molecules and use them to predict unknown structure. In the 2008 SARA algorithm has been implemented to align RNA tree-dimensional structures. A new pipeline using SARA program has been used to assign RNA function using structure similarity.
In collaboration with Marc Marti Renom at the CNAG, Barcelona (Spain) we are implementing a statistical potential to score RNA structures and to select near native ones. We are also interested to develop new methods for RNA structure prediction.

Latest news

Receiving hours

Published on: March 19 2019

View all RSS

Emidio Capriotti

Research

Latest news

Restricted area