====== Computer Vision Methods ====== [[https://fel.cvut.cz/cz/education/rozvrhy-ng.B212/public/html/predmety/46/84/p4684506.html|Schedule on FEL (CZ course)]] [[https://fel.cvut.cz/cz/education/rozvrhy-ng.B212/public/html/predmety/46/85/p4685206.html|Schedule on FEL (EN course)]]\\ [[https://cw.felk.cvut.cz/upload/|Upload system]] [[https://cw.felk.cvut.cz/forum/forum-1777.html|Discussion forum]] [[https://cw.fel.cvut.cz/wiki/courses/mpv/labs/start|Labs]] ===== Course Description ===== This course focuses on the following computer vision problems: finding correspondences between images using image features and their robust invariant descriptors, image retrieval, object detection and recognition, and visual tracking. ===== Pre-requisites ===== The course has no formal pre-requisits. However, certain skills and knowledge are assumed, and it is the responsibility of the student to get to the required level. The assignments are implemented in the Python, numpy, [[https://pytorch.org| pytorch]] computing environment, mostly in form of [[https://jupyter.org|jupyter notebooks]], and familiarity with it will help. The programing assignments, involving either implementing, modifying or testing computer vision methods, are a substantial part of the [[https://cw.fel.cvut.cz/wiki/courses/mpv/labs/start|labs]]. Knowledge of the basics of digital image processing as convolution, filtration, intensity transformations, image function interpolations and basic geometric transformations of the image (see the first lab) is assumed. Knowledge of linear algebra and probability theory is needed to understand the presented computer vision methods. ===== Lectures: Monday 9:15 - 10:45, KN:E-107 ===== Lecturers: JM Jiří Matas, JC Jan Čech, DM Dmytro Mishkin, GT Giorgos Tolias, OD Ondřej Drbohlav, MS Milan Šulc Note: some of the lectures may change, but the 2021 recordings mostly provide a good idea about the content.\\ Lectures will be streamed on YouTube, link: https://www.youtube.com/playlist?list=PLQL6z4JeTTQnv27IWAY6NLafP6xiflmHe, and the recordings will be available in a playlist at https://www.youtube.com/playlist?list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5 For online feedback, connect via zoom link: https://feectu.zoom.us/j/97922104602\\ according the schedule, as listed below (from 9:15) ^Week ^ Date ^ Lecturer ^ Slides ^ Topic ^ | 1| 14.2. |JC|{{deep_learning_MPV_2022.pdf|Deep learning}}\\ [[https://cmp.felk.cvut.cz/~cechj/teaching/MPV/mpv_01_zoom_2021.02.15.mp4|recording 2021]] [[https://www.youtube.com/watch?v=0deyr1TGSmY&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=1|recording 2022]] |A shallow introduction into the deep machine learning. Convolutional Neural Networks. Principles, layers, architectures for image recognition. | | 2| 21.2. |JC|{{deep_learning_2_mpv_2022.pdf|Deep learning II}}\\ [[https://cmp.felk.cvut.cz/~cechj/teaching/MPV/mpv_02_zoom_2020.04.06.mp4|recording 2020]] [[https://www.youtube.com/watch?v=C5yQoXf_ILM&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=2|recording 2022]] | Deep architectures object detection and semantic segmentation. Further insights into the deep nets. Generative models (GANs). | | 3| 28.2.|JM, DM| [[http://cmp.felk.cvut.cz/~mishkdmy/MPV2022/MPV2022_local_features_day1.pdf| Correspondence 1st lecture slides]], \\ [[https://drive.google.com/file/d/1MD6oDfQKXkwUu9_8gblDwQ2M8PDsYpYW/view?usp=sharing|recording 2021]], [[https://www.youtube.com/watch?v=DtHjVb9MhPs&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=4|recording 2022]] |Correspondences and wide baseline stereo. Motivation and applications. Interest point and distinguished regions detection: Harris operator (corner detection)| | 4| 7.3.|DM| [[http://cmp.felk.cvut.cz/~mishkdmy/MPV2022/MPV2021_local_features_day2.pdf| Correspondence 2nd lecture slides]], [[https://drive.google.com/file/d/1Xk50BJeC8fywLiOHxNqOTMw4y2pxIrAg/view?usp=sharing|recording 2021]] [[https://www.youtube.com/watch?v=ipG5Z6EchVw&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=4|recording 2022]] | Laplace operator and its approximation by difference of Gaussians, Hessian detector, affine covariant version, Maximally Stable Extremal Regions (MSER). Descriptors of measurement regions: SIFT (scale invariant feature transform), RootSIFT, shape context. LBP (local binary patterns), Matching. | | 5| 14.3.|DM| [[http://cmp.felk.cvut.cz/~mishkdmy/MPV2022/MPV2021_local_features_day3.pdf| Correspondence 3rd lecture slides]], [[https://drive.google.com/file/d/10qGr4Eek6W3w7trElCg45brC2JCtFLqW/view?usp=sharing|recording 2021]] | Deep learned features: HardNet, R2D2, SuperPoint, AffNet. | | 6| 21.3.|DM, JM| {{ :courses:mpv:2021_ransac.pdf |RANSAC}} [[https://drive.google.com/file/d/1QwY98vUtvSNrkb7LsaDQ9jCQXJJgoa2a/view?usp=sharing|recording 2021]] | RANSAC. | | 7| 28.3.| MS| [[https://docs.google.com/presentation/d/1WbyD84HN7SECf2r36rhDw-6A2kJBI5GEYtPYqgBWmiU |slides]] | Computer Vision Applications: From Species Recognition to Business Documents.| | 8| 4.4.| GT| [[https://drive.google.com/file/d/1maTdz1eJ3emYa2v2JR9hnUT5g4rjsY3P/view?usp=sharing | Retrieval-part1]] [[ https://www.youtube.com/watch?v=e97dqBlbx0c&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=8 | recoding2022 ]] | Retrieval: task formulation, evaluation metrics, Bag-of-Words, VlAD, ASMK, spatial verification| | 9| 11.4.| GT| [[https://drive.google.com/file/d/1maTdz1eJ3emYa2v2JR9hnUT5g4rjsY3P/view?usp=sharing | Retrieval-part2]] - [[https://drive.google.com/file/d/1lMM-9fyzDcFPjClBCXG31H4bbsFKXssU/view?usp=sharing | Deep-retrieval-part1]] [[ https://www.youtube.com/watch?v=Mbn0VzDBYgI&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=9 | recording 2022]] | Retrieval: query expansion, special retrieval objectives: zoom in/out, details. Deep retrieval: FCN representation, global pooling methods, DELF | | 10| 18.4.|^ Easter Monday | | 11| 25.4.| GT| [[https://drive.google.com/file/d/1lMM-9fyzDcFPjClBCXG31H4bbsFKXssU/view?usp=sharing | Deep-retrieval-part2]] -[[https://www.youtube.com/watch?v=ZnHTmtK6tpg&list=PLQL6z4JeTTQl_HfTuIkuCltZ97inYDQT5&index=10 | recording 2022]] | Deep retrieval: loss function, training labels, other tasks, descriptor whitening | | 12| 2.5. | JM| {{ :courses:mpv:matas-2018.04-klt-only.pdf |KLT}}, {{2019_mpv_mean_shift_2.pdf|Mean Shift}}[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-04-26%2010.56.37%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording 2021]]| Tracking I. Introduction. Kanade-Lucas-Tomasi tracker. Mean Shift | | 13| 9.5.| JM| {{kcf_lecture2016.pdf|KCF Tracking}} {{TLD.pdf|TLD}}, {{Tracking_by_Segmentation.pdf|Tracking_by_Segmentation}}[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-05-03%2011.11.59%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording 2021]]| Tracking II. KCF Kernel Correlation Filter. Long-term Tracking, TLD: Tracking-Learning-Detection, Tracking by Segmentation. Introduction to [[https://cw.fel.cvut.cz/b212/courses/mpv/labs/4b_tracking/start| KCF lab task]].| | 14| 16.5.| JM |[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021.05.10_viola_jones.pdf|Viola-Jones face detector]], [[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021.05.10-waldboost-vision-apps.pdf| Waldboost]],{{2016.05_hough-transform.pdf|Hough Transform}}\\ [[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-05-10%2011.05.25%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording 2021]]| Object detection by sliding window and sequential decision making (Method of Viola and Jones, Waldboost. Hough transform and /** Update of course slide material */ ===== Evaluation ===== Work during the semester 50%, written part of the exam 40%, oral part of the exam 10%. For this semester, the "normalization factor" for your points gained during the semester is 68. That means, points which contribute to your exam, are ''(your total number of points from semester including bonus points)/68.0 * 50''. ===== Exam ===== Examples of exam [[courses:mpv:labs:exam_questions|questions]]. There will be 4-5 similar questions at the written part of the exam. The oral part of the questions takes place after the written part and will focused on discussion of your answers. ''[New]''Please find the assignment of students to exam time slots {{ :courses:mpv:mpv_exam2022.pdf |here}}. The order in the lists is the order in which students will be examined. ===== Literature ===== Lecture slides constitute the main source of study literature in this course. ===== Further Info ===== Further information is available in next sections of this page. We would appreciate your feedback on the contents and organization on the discussion [[https://cw.felk.cvut.cz/forum/forum-1639.html|forum]] of the course. ----- \\ Good luck to all participants of the course. | Lecturers |||| | [[http://cmp.felk.cvut.cz/~matas|{{http://cmp.felk.cvut.cz/~matas/images/jm/jm_ct2008.11-3.jpg?120}}]] | [[http://cmp.felk.cvut.cz/~cechj|{{http://cmp.felk.cvut.cz/~cechj/JanCech.jpeg?120}}]] | [[https://cmp.felk.cvut.cz/~toliageo/index.html|{{./gtolias.jpeg?120}}]] | [[http://dmytro.ai/|{{./dmytro.jpeg?120}}]] | [[http://cmp.felk.cvut.cz/~drbohlav/|{{http://cmp.felk.cvut.cz/~drbohlav/ondrej_drbohlav.jpg?120}}]]| | Jiří Matas | Jan Čech | Giorgos Tolias | Dmytro Mishkin | Ondřej Drbohlav | Consultations are possible upon request.