====== Computer Vision Methods ====== [[https://fel.cvut.cz/cz/education/rozvrhy-ng.B202/public/html/predmety/46/84/p4684506.html|Schedule on FEL (CZ course)]] [[https://fel.cvut.cz/cz/education/rozvrhy-ng.B202/public/html/predmety/46/85/p4685206.html|Schedule on FEL (EN course)]]\\ [[https://cw.felk.cvut.cz/upload/|Upload system]] [[https://cw.felk.cvut.cz/forum/forum-1702.html|Discussion forum]] [[https://cw.fel.cvut.cz/wiki/courses/mpv/labs/start|Labs]] ===== Course Description ===== This course focuses on the following computer vision problems: finding correspondences between images using image features and their robust invariant descriptors, image retrieval, object detection and recognition, and visual tracking. ===== Pre-requisites ===== The course has no formal pre-requisits. However, certain skills and knowledge are assumed, and it is the responsibility of the student to get to the required level. The assignments are implemented in the Python and numpy computing environment, and familiarity with it will help. The programing assignments, involving either implementing, modifying or testing computer vision methods, are a substantial part of the [[https://cw.fel.cvut.cz/wiki/courses/mpv/labs/start|labs]]. Knowledge of the basics of digital image processing as convolution, filtration, intensity transformations, image function interpolations and basic geometric transformations of the image (see the first lab) is assumed. Knowledge of linear algebra and probability theory is needed to understand the presented computer vision methods. ===== Lectures: Monday 11:00-12:30, KN:E-126 ===== Lecturers: JM Jiří Matas, JC Jan Čech, DM Dmytro Mishkin, GT Giorgos Tolias, OD Ondřej Drbohlav,MS Milan Šulc Lectures will be given via zoom, link: https://feectu.zoom.us/j/93818244152,\\ according the schedule, as listed below (from 11:00, zoom meeting opened at 10:50 ) ^Week ^ Date ^ Lecturer ^ Slides ^ Topic ^ | 1| 15.2. |JC|{{deep_learning_MPV_2021.pdf|Deep learning}} [[https://cmp.felk.cvut.cz/~cechj/teaching/MPV/mpv_01_zoom_2021.02.15.mp4|recording]] |A shallow introduction into the deep machine learning. Convolutional Neural Networks. Principles, layers, architectures for image recognition. | | 2| 22.2. |JC|{{deep_learning_2_mpv_2021.pdf|Deep learning II}} [[https://cmp.felk.cvut.cz/~cechj/teaching/MPV/mpv_02_zoom_2020.04.06.mp4|recording]]* | Deep architectures object detection and semantic segmentation. Further insights into the deep nets. Generative models (GANs). | | 3| 1.3.|JM, DM| [[https://drive.google.com/file/d/1PlfsNtcj0tFB4I3Hct0VRNtCXzQa1vrT/view?usp=sharing| Correspondence 1st lecture slides*]], [[https://drive.google.com/file/d/1MD6oDfQKXkwUu9_8gblDwQ2M8PDsYpYW/view?usp=sharing|recording]] |Correspondences and wide baseline stereo. Motivation and applications. Interest point and distinguished regions detection: Harris operator (corner detection)| | 4| 8.3.|DM| [[https://drive.google.com/file/d/15XRfA_Jrs3rOh9LusczI8E_AL2pplgoP/view?usp=sharing| Correspondence 2nd lecture slides*]], [[https://drive.google.com/file/d/1Xk50BJeC8fywLiOHxNqOTMw4y2pxIrAg/view?usp=sharing|recording]] | Laplace operator and its approximation by difference of Gaussians, Hessian detector, affine covariant version, Maximally Stable Extremal Regions (MSER). | | 5| 15.3.|DM| [[https://drive.google.com/file/d/1o1YwDI61SJYCoEn2t5DwH3C7pP6KvBNF/view?usp=sharing| Correspondence 3rd lecture slides*]], [[https://drive.google.com/file/d/10qGr4Eek6W3w7trElCg45brC2JCtFLqW/view?usp=sharing|recording]] | Descriptors of measurement regions: SIFT (scale invariant feature transform), RootSIFT, shape context. LBP (local binary patterns), Matching. Deep learned features (HardNet). | | 6| 22.3.|DM, JM| {{ :courses:mpv:2021_ransac.pdf |RANSAC-part1}} [[https://drive.google.com/file/d/1QwY98vUtvSNrkb7LsaDQ9jCQXJJgoa2a/view?usp=sharing|recording]] | Deep learned features (R2D2, SuperPoint): finish. Start of RANSAC. | | 7| 29.3.|JM, GT| {{:courses:mpv:2021_ransac.pdf | RANSAC-part2}} [[ https://cmp.felk.cvut.cz/~toliageo/ext/teach/mpv/202/mpv_29_march_2021_RANSAC_part2.mp4 | recoding ]] [[https://drive.google.com/file/d/1VoGZdrL-Ql_OKNWF3Y-uF3yya09553mH/view?usp=sharing | Retrieval-part1]] [[ https://cmp.felk.cvut.cz/~toliageo/ext/teach/mpv/202/mpv_29_march_2021_retrieval_part1.mp4 | recoding ]] | RANSAC: part 2. Image retrieval: task formulation, evaluation metrics, Bag-of-Words| | 8| 5.4.| ^ Easter Monday | | | 9| 12.4.|GT| [[https://drive.google.com/file/d/1VoGZdrL-Ql_OKNWF3Y-uF3yya09553mH/view?usp=sharing | Retrieval-part2]] [[ https://drive.google.com/file/d/1gULNMLBYIf8YUnup8uKbF5TjZwKc3rab/view?usp=sharing | recording]] | Representation and matching models, spatial verification, query expansion, other tasks | | 10| 19.4.|GT| [[https://drive.google.com/file/d/18ySFSiVdU93pNldaeLrATFmLkyn1sH5k/view?usp=sharing | Deep retrieval]] [[https://drive.google.com/file/d/1VRk9KVzSB9lnlXqVjqio91eO_xphukbH/view?usp=sharing | recording ]] | Transfer learning, architectures for global and local descriptors, pairwise loss, descriptor whitening | | 11| 26.4.|JM| {{ :courses:mpv:matas-2018.04-klt-only.pdf |KLT}}, {{2019_mpv_mean_shift_2.pdf|Mean Shift}}[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-04-26%2010.56.37%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording]]| Tracking I. Introduction. Kanade-Lucas-Tomasi tracker. Mean Shift | | 12| 3.5.|JM| {{kcf_lecture2016.pdf|KCF Tracking}} {{TLD.pdf|TLD}}, {{Tracking_by_Segmentation.pdf|Tracking_by_Segmentation}}[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-05-03%2011.11.59%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording]]| Tracking II. KCF Kernel Correlation Filter. Long-term Tracking, TLD: Tracking-Learning-Detection, Tracking by Segmentation| | 13| 10.5.| JM |[[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021.05.10_viola_jones.pdf|Viola-Jones face detector]], [[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021.05.10-waldboost-vision-apps.pdf| Waldboost]],{{2016.05_hough-transform.pdf|Hough Transform}} [[https://cmp.felk.cvut.cz/~matas/teaching/mpv/2021-05-10%2011.05.25%20MPV%20-%20Computer%20Vision%20Methods%20-%20Lecture%2093818244152/zoom_0.mp4|recording]]| Object detection by sliding window and sequential decision making (Method of Viola and Jones, Waldboost. Hough transform and its comparison with RANSAC(Random Sample Consensus).| | 14| 17.5.| MS| {{mpv__fine_grained_recognition_of_plants_and_fungi_from_images.pdf|Recognition of Plants and Fungi}} [[https://cw.fel.cvut.cz/wiki/_media/courses/mpv/2021-05-17-sulc-plantsandfungi.mp4|recording]] | Case study: Plant and Fungi recognition using Deep Nets | * Update of course slide material * I forgot to start recording of the second lecture, I am sorry. The provided recording is from the last year. Note that the slides has changed slightly. Semantic segmentation architectures and deep fakes were added this year. Minor errors were fixed in the slides. ===== Evaluation ===== Work during the semester 50%, written part of the exam 40%, oral part of the exam 10% ===== Exam ===== Examples of exam [[courses:mpv:labs:exam_questions|questions]]. There will be 4-5 similar questions at the written part of the exam. The oral part of the questions takes place after the written part and will focused on discussion of your answers. ===== Literature ===== Lecture slides constitute the main source of study literature in this course. ===== Further Info ===== Further information is available in next sections of this page. We would appreciate your feedback on the contents and organization on the discussion [[https://cw.felk.cvut.cz/forum/forum-1639.html|forum]] of the course. ----- \\ Good luck to all participants of the course. | Lecturers |||| | [[http://cmp.felk.cvut.cz/~matas|{{http://cmp.felk.cvut.cz/~matas/images/jm/jm_ct2008.11-3.jpg?120}}]] | [[http://cmp.felk.cvut.cz/~cechj|{{http://cmp.felk.cvut.cz/~cechj/JanCech.jpeg?120}}]] | [[https://cmp.felk.cvut.cz/~toliageo/index.html|{{./gtolias.jpeg?120}}]] | [[http://dmytro.ai/|{{./dmytro.jpeg?120}}]] | [[http://cmp.felk.cvut.cz/~drbohlav/|{{http://cmp.felk.cvut.cz/~drbohlav/ondrej_drbohlav.jpg?120}}]]| | Jiří Matas | Jan Čech | Giorgos Tolias | Dmytro Mishkin | Ondřej Drbohlav | Consultations are possible upon request.