Schedule BE4M33DZO Upload system Discussion forum

BE4M33DZO - Digital Image 2024/2025

Course topics

This course presents an overview of basic methods for digital image processing. Students practice the gained knowledge on six implementation tasks, which will help them understand the concepts and use them to solve practical problems.

Prerequisites

Students are expected to know topics from calculus, linear algebra, statistics and probability to the extent taught at CTU in Prague, FEE. Basic programming skills are also expected, especially in MATLAB.

Lectures

Lecturers:

  • Ondřej Drbohlav (CTU in Prague, FEE, Dept. of Cybernetics, room KN:E-215)
  • Daniel Sýkora (CTU in Prague, FEE, Dept. of Computer Graphics and Interaction, room KN:E-422)

Place and Time: Wednesday 09:15am-10:45am, Lecture Hall KN:E-301

Extent: 2 hours weekly

Week Date Topics Teaching Materials
1st 25.09.2024 Pointwise (Monadic) Operations - digital image, histogram, change of brightness and contrast, thresholding, negative, quantization, gamma correction, histogram equalization and mapping slides / video: CZ1, CZ2, EN1, EN2
2nd 02.10.2024 Fourier Transform - Fourier transform in 1D and 2D, basis functions, time and frequency domain, amplitude and phase, discrete Fourier transform, fast Fourier transform, sampling theorem slides / video: CZ1, CZ2, EN1, EN2
3rd 09.10.2024 Convolution - convolution in 1D and 2D, fast computation using separable kernels, mip-mapping, integral image and Fourier transform slides / video: CZ1, CZ2, EN1, EN2 (1, 2)
4th 16.10.2024 Linear Filtering - applications of 1D and 2D convolution: noise suppression, edge detection, blur, sharpening, Wiener filter, anti-aliasing, depth of field, motion blur, light scattering, glossy reflections, soft shadows, hybrid image slides / video: CZ1, CZ2, EN1, EN2 (1)
5th 23.10.2024 Non-linear Filtering - bilateral filter, fast computation using rectangular kernels, piecewise linear approximation, and bilateral grid, applications: noise suppression maintaining sharp edges, digital makeup, abstraction, tone mapping, appearance modification, detail fusion, simulation of HDR image, material/texture change, glossiness removal, noise suppression using flash, depth map sharpening. slides / video: CZ1, CZ2, EN1, EN2 (1, 2, 3, 4, 5, 6, 7, 8, 9)
6th 30.10.2024 Image Editing - linear and multi-band image stitching, Laplace pyramid, gradient domain editing and its applications: stitching, cloning, smoothing, fusion, context highlighting, color to gray conversion, diffusion curves slides / video: CZ1, CZ2, EN1, EN2 (1, 2, 3)
7th 06.11.2024 Image Deformation - translation, rotation, scaling, skew, affine and projective transformations, backward mapping, bilinear interpolation, moving least squares, as-rigid-as-possible and as-similar-as-possible image deformation slides / video: CZ1, CZ2, EN1, EN2 (1)
8th 13.11.2024 Image Registration 1 - image similarity metrics, estimation of translation, block-matching, fast computation using early termination, multi-resolution processing, winner-take-all strategy, phase correlation, gradient descent, texture synthesis, hole filling slides / video: CZ1, CZ2, EN1, EN2 (1, 2, 3)
9th 20.11.2024 Image Registration 2 - estimation of rotation and scaling, log-polar and Fourier-Mellin transformation, generalized gradient descent (affine and projective transformation), as-similar-as-possible image registration slides / video: CZ1, CZ2, EN1, EN2 (1, 2, 3, 4)
10th 27.11.2024 Image Registration 3 - keypoint extraction, Harris detector, SIFT keys and their correspondence, estimating transformation parameters using SVD and RANSAC, elimination of outliers, applications (panorama stitching, object detection, background reconstruction, augmented reality, 3D reconstruction). slides / video: CZ, EN1, EN2 (1, 2, 3)
11th 04.12.2024 Revision - suggest a topic for discussion slides, video: CZ1, CZ2, EN1, EN2 (1, 2, 3)
12th 11.12.2024 Image Segmentation - an overview of interactive image segmentation algorithms, modifications and extensions of the original optimization problem: color model, imprecise background and foreground specification, multiple segments, colorization, and applications: alpha-channel estimation, recoloring, interactive tone mapping, coloring of hand drawings slides1 slides2 / video: CZ1, CZ2, EN1, EN2 (1, 2, 3)
13th 18.12.2024 Daniel Sýkora's lecture: Towards High-quality Replication of Visual Appearance (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13,14, 15, 16, 17, 18, 19, 20, 21)
14th 08.01.2025 Recapitulation

Labs

Lab Assistants:

  • Jan Čech (CTU in Prague, FEE, Department of Cybernetics, Room: KN:G-2)
  • Radoslav Škoviera (CTU in Prague, CIIRC, Room:JP:B-611a)
  • Daniel Sýkora (CTU in Prague, FEE, Department of Computer Graphics and Interaction, Room: KN:E-422)
  • Ondřej Drbohlav (CTU in Prague, FEE, Department of Cybernetics, Room: KN:E-215)

Place and Time: Tuesday 2:30pm-4:00pm, Lab Room KN:E-230

Extent: 2 hours weekly

Week Date Topics Lab Assistant
1st 24.09.2024 Introduction to MATLAB Jan Čech, Radoslav Škoviera, Daniel Sýkora, Ondřej Drbohlav
2nd 01.10.2024 Monadic Operations 1 - brightness and contrast changes, gamma correction, histogram equalization and mapping Ondřej Drbohlav
3rd 08.10.2024 Monadic Operations 2 - submission of the 1st assignment (10 points) Ondřej Drbohlav
4th 15.10.2024 Fourier Transform 1 - 2D FFT, displaying spectrum, basis functions, shift, convolution and sampling theorems Jan Čech
5th 22.10.2024 Fourier Transform 2 - submission of the 2nd assignment (10 points) Jan Čech
6th 29.10.2024 Dean's Day
7th 05.11.2024 Linear and Non-linear Filtering 1 - convolution, Fourier transform, separable kernel, blurring, edge detection, deconvolution, bilateral filter Radoslav Škoviera
8th 12.11.2024 Linear and Non-linear Filtering 2 - submission of the 3rd assignment (10 points) Radoslav Škoviera
9th 19.11.2024 Image Editing 1 - gradient domain approach, solving the Poisson equation using a system of linear equations and Fourier transform, image cloning, gradient mixing Daniel Sýkora
10th 26.11.2024 Image Editing 2 - submission of the 4th assignment (10 points) Daniel Sýkora
11th 03.12.2024 Image Registration 1 - estimation of translation, scale and rotation using Fourier-Mellin transform (phase correlation, log-polar transformation) Radoslav Škoviera
12th 10.12.2024 Image Registration 2 - submission of the 5th assignment (10 points) Radoslav Škoviera
13th 17.12.2024 Image Segmentation 1 painting hand-drawn images using maxflow algorithm Daniel Sýkora
14th 07.01.2025 Image Segmentation 2 - submission of the 6th assignment (10 points) Ondřej Drbohlav

In total, it is possible to obtain 60 points from all assignments. In order to pass labs, student have to obtain at least 30 points.

Exam

The exam consists of two parts, written test and short oral part. The written part consist of a number of questions and problems covering all the course topics:

  • Monadic operations (simple modifications of pixel intensity, histogram equalization and mapping)
  • Fourier transform (formulation, properties, sampling theorem)
  • Convolution (formulation, properties, speed-up methods, applications)
  • Image deblurring (formulation, solution using Wiener filter)
  • Bilateral filter (formulation, speed-up methods, applications)
  • Image editing in the gradient domain (formulation, solution, applications)
  • Image deformation (1, 2, 3, 4-point)
  • Image registration (translation, solution methods)
  • Image registration (translation, rotation, and scaling using Fourier-Mellin)
  • Image segmentation (formulation, solution using maximum flow, applications)
  • Image segmentation (formulation, solution using random walker, applications)

The oral part is short and serves for going over the tests and clarifying anything as needed. The total of 40 points can be obtained for the written test. To pass the exam, a minimum of 20 points is required.

Evaluation

The final grade is determined by the sum of the points obtained from the labs (maximum 60) and from the exam (maximum 40):

Grade Point range Description
A 90 or more excellent
B 80 to 89 very good
C 70 to 79 good
D 60 to 69 satisfactory
E 50 to 59 passable
F less than 50 failed

Literature

  • He J., Kim C.-S., Kuo C.-C. J.: Interactive Segmentation Techniques: Algorithms and Performance Evaluation, Springer, 2014.
  • Radke R. J.: Computer Vision for Visual Effects, Cambridge University Press, 2012.
  • Goshtasby A. A.: Image Registration: Principles, Tools and Methods, Springer, 2012.
  • Paris S., Kornprobst P., Tumblin J., Durand F.: Bilateral Filtering: Theory and Applications, Now Publishers, 2009.
  • Gonzalez R. C., Woods R. E.: Digital Image Processing (3rd Edition), Prentice Hall, 2008.
  • Šonka M., Hlaváč V., Boyle R.: Image Processing, Analysis and Machine vision (3rd Edition), Thomson Learning, 2007.
  • Pratt W.: Digital Image Processing (3rd Edition), John Wiley, 2004.
courses/be4m33dzo/start.txt · Last modified: 2025/01/15 10:54 by sykorad