B3155 - PROJECT WORK IN MACHINE LEARNING AND DATA MINING

Academic Year 2023/2024

  • Teaching Mode: Traditional lectures
  • Campus: Bologna
  • Corso: Second cycle degree programme (LM) in Artificial Intelligence (cod. 9063)

Learning outcomes

At the end of the course, the student is able to apply the knowledge acquired in Machine Learning in order to carry out autonomously a project focusing on a topic agreed upon with the teacher.

Course contents

The professor assigns a dataset and a mining task to be developed on it, or the students proposes a dataset that can be accepted by the professor. For the assignment of the dataset do the following:

  • visit the Kaggle [https://www.kaggle.com/] website; choose three competitions of your interest, either active or closed; the dataset should be fairly complex, in other words, avoid very small datasets with a few hundred elements and a handful of columns
  • send and email to the teacher with the subject "[Project work] Kaggle competitions choice" and containing the link to the three competition of your interest
  • wait for the answer and, when the competition is assigned, do as specified on the "Assessment methods section below"

As an alternative, if you are interested to a specific project related to machine learning, prepare a short description of your idea and ask for an appointment.

Readings/Bibliography

Documentation provided by the teacher or publicly available.

Teaching methods

The teacher assigns a project , to be developed individually or by a very small group; tutoring will be available on demand.

Assessment methods

This activity is mainly oriented to develop abilities based on the knowledge acquired in the related course.

After the assignment of the task by the teacher:

  1. create a github folder named “surname-projecttitle” and share it with the teacher;
  2. create a readme.md describing the project, with objectives, results, pertinent references, and anything else that may be useful for understanding
  3. create a sub-folder where software and data will be delivered, or including permanent links to the data if the data is too large
    1. the folder must contain a readme.md with the installation and startup notes, and the prerequisites for operation
    2. the solution should produced as a python notebook with an adequate amount of comments and program outputs, in order to constitute an adequate report of the project development (export the notebook also as a pdf in landscape format); if the software is plain code, instead of a notebook, it is necessary to add a report, written in latex, in landscape format, deliver latex and pdf.
    3. the software must be commented in English, the names of the variables must be meaningful and inspired by English, filenames must start with the project title, possibly a shortcut
  4. prepare a powerpoint or latex beamer presentation of 15-20 slides, deliver:
    1. the .pptx or .tex with any complementary files
    2. the pdf
  5. agree with the teacher a date for the presentation and possible demonstration of the software
  6. Unless otherwise indicated at the time of assignment, the indicative delivery date should be within three months after assignment
    1. at any time it is possible to consult the teacher by e-mail or by requesting an appointment
    2. delays are not a problem, please, at the expiry of the three months, notify via e-mail the need to continue

Teaching tools

  • Python Notebooks
  • GitHub

Office hours

See the website of Claudio Sartori

SDGs

Quality education Industry, innovation and infrastructure

This teaching activity contributes to the achievement of the Sustainable Development Goals of the UN 2030 Agenda.