- Docente: Luigi Di Stefano
- Credits: 8
- SSD: ING-INF/05
- Language: English
- Teaching Mode: Traditional lectures
- Campus: Bologna
-
Corso:
Second cycle degree programme (LM) in
Computer Engineering (cod. 0937)
Also valid for Second cycle degree programme (LM) in Electronic Engineering (cod. 0934)
Second cycle degree programme (LM) in Automation Engineering (cod. 8891)
Second cycle degree programme (LM) in Artificial Intelligence (cod. 9063)
Learning outcomes
At the end of the course the students know the basic principles of computer vision and image processing algorithms. Thereby, they are able to understand and apply a variety of algorithms and operators aimed at either extracting relevant semantic information from digital images or improving image quality. They also understand the diverse challenges and design choices characterizing the main applications and acquire familiarity with software tools widely adopted in these scenarios.
Course contents
- Introduction – Basic definitions related to image processing and computer vision. An overview across major application domains.
- Image Formation and Acquisition – Geometry of image formation. Pinhole camera and perspective projection. Geometry of stereopsis. Using lenses. Field of view and depth of field. Projective coordinates and perspective projection matrix. Camera calibration: intrinsic and extrinsic parameters, lens distortion. Camera calibration based on planar targets and homography estimation (Zhang's algorithm). Image rectification and stereo calibration. Basic notions on image sensing, sampling and quantization.
- Intensity Transformations - Image Histogram, Linear and non-linear contrast stretching. Histogram equalization.
- Spatial Filtering – Linear Shift-invariant operators. Convolution and correlation. Mean and Gaussian filtering. Median Filtering. Bilateral filtering. Non-local means.
- Image Segmentation – Binarization by global
thresholding. Automatic threshold estimation. Spatially adaptive
binarization. Colour-based segmentation.
- Binary Morphology – Dilation and erosion. Opening and closing- Hit-and-Miss.
- Blob Analysis – Distances on the image plane and connectivity. Labeling of connected components. Basic descriptors: area, perimeter, compactness, circularity. Orientation and bounding-box, form factor and related descriptors. Euler number. Image moments, invariant moments.
- Edge Detection - Image gradient. Smooth derivatives: Prewitt, Sobel, Frei-Chen. Non-maxima suppression. Laplacian of Gaussion. Canny edge detector.
- Local Invariant Features – Detectors and descriptors. Harris Corners. Scale invariant features. SIFT features. Efficient feature matching by kd-trees.
- Object Detection – Pattern matching by SSD,
SAD, NCC and ZNCC. Fast pattern matching. Shape-based mathing.
Hough Transform for analystic shapes, Generalized Hough
Transform. Object detection by local invariant features:
Hough-based voting, least-squares similarity
estimation.
Theory is complemented by assisted hands-on lab sessions based on Python and the OpenCV library. Lab sessions cover selected topics such as intensity transformations, spatial filtering, camera calibration, motion estimation and local invariant features.
Readings/Bibliography
- Gonzales R., Woods R. : “Digital Image Processing”, Third Edition, Pearson Prentice-Hall, 2002.
- D.A. Forsyth, J. Ponce: “Computer Vision A
Modern Approach”. Pearson Prentice-Hall, 2003.
Teaching methods
Theory taught in lectures is complemented by assisted hands-on lab sessions covering selected topics. Students are provided with the software tools, image/video archives and support that enable practical implementation and testing of most of the topics discussed in classes, so as to deepen significantly their understanding of the course subject matter.
Assessment methods
Students are required to carry out and present a software project related to solving a real-world image processing or computer vision problem. Such a project can be either chosen among a list provided by the teacher through the course web-site or proposed by the student.
The, the exam is oral and comprises both project discussion as
well as assessment of theoretical knowledge.
Teaching tools
Available on the course website are:
- All slides related to lectures and lab sessions.
- A Software development framework based on OpenCV which allows students to practically implement the algorithms and methods taught in lectures.
- Images and videos allowing students to easily test their implementations.
- Links toimage processing and computer vision resources
freely available on the web (such as software tools and image/video
archives).
Links to further information
https://iol.unibo.it/course/view.php?id=33243
Office hours
See the website of Luigi Di Stefano
SDGs
This teaching activity contributes to the achievement of the Sustainable Development Goals of the UN 2030 Agenda.