- Docente: Emidio Capriotti
- Credits: 10
- SSD: BIO/10
- Language: English
- Moduli: Rita Casadio (Modulo 1) Emidio Capriotti (Modulo 2) Pier Luigi Martelli (Modulo 3)
- Teaching Mode: Traditional lectures (Modulo 1) Traditional lectures (Modulo 2) Traditional lectures (Modulo 3)
- Campus: Bologna
- Corso: Second cycle degree programme (LM) in Bioinformatics (cod. 8020)
-
from Nov 22, 2024 to Jan 13, 2025
-
from Nov 05, 2024 to Nov 21, 2024
Learning outcomes
At the end of the course, the student has the basic knowledge for developing and using tools for sequence and structure analysis of biomolecules and more generally for annotation problems in the genomic era. In particular, the student will be able to: discuss the theoretical basics of some machine learning tools (Neural Networks, Hidden Markov Models); selecting programs for problem solving; writing programs.
Course contents
Laboratory of Bioinformatics I (first semester, 5CFU)
Theory and application on :
- The role of Bioinformatics
- Archives and Next Generation Sequencing experiments
- The problem of sequence annotation
- Protein sequence, structure and function
- Protein structure comparison: generating rules for sequence comparison
- Local and global alignment methods; data base search with BLAST
- Extreme value statistics
- The protein universe and UniProtKB
- Evolution did it: what can we learn from a pairwise structure comparison over the entire PDB
- Theoretical foundation of building by homology
- From sequence to structure and function
- When a protein is a protein
Best practice on:
- Handling of the different alignment methods
- Modeller and statistical validation of computed 3D models
- Comparison with SwissModeller
Laboratory of Bioinformatics I (second semester, 5CFU)
Theory and application on:
- Protein geometrical features
- Protein 3D, secondary and covalent structure
- Protein Domains: SCOP and CATH
- The notion of functional domains/Go terms
- Functional domains and evolution
- Protein families
- Biosequence analysis: a historical perspective
- Mapping structures into sequences and back
- Propensity scales and propensity plots
- The concept of averaging over a sliding window
- Conditional probability and secondary structure prediction
- Basics of feed-forward neural networks.
- Training, testing and applications of NN
- Critical evaluation of machine learning methods: HMM vs NN
- Protein prediction under 30% sequence identity
Best practice on:
- How to model a protein domain with a HMM
- Best practice of hmmer and statistical validation of a computed protein domain
- Comparison with PFAM
Readings/Bibliography
Online, selected articles and reviews in cloud sharing.
Teaching methods
Lectures, exercises and project development.
Assessment methods
Students will be evaluated both with written tests and a final oral exam. Both methods assess the learning outcome of the course and aim at verifying what the student has acquired in terms of Bioinformatics skills during the theoretical and practical parts of the program developed over the two semesters.
Since the course includes two semesters, a written test at the end of each semester will evaluate whether the student is idoneous to attend the final oral section. Students who do not attend the "in itinere" tests, will be requested to have a final general test before the final oral section.
Before attending the final session of the exam, the student has to provide two written reports on the two tutored practical sections, executed in class. During the final oral assessment, the student is expected to answer questions on all the topics discussed during the course.
Teaching tools
Online, Public Databases, PubMed, and materials (pdf of the lectures and selected articles) in cloud sharing.
Office hours
See the website of Emidio Capriotti
See the website of Rita Casadio
See the website of Pier Luigi Martelli