B0M33BDT – Technologie pro velká data

Classes will be held in a full-time manner, or until further notice. In the case of online teaching, teaching will be done through faculty MS Teams. We use the group B0M33BDT+BE0M33BDT.

Flash news:


It is necessary to have a basic knowledge of following technologies:

  • Linux (basic commands)
  • SQL (create table, SELECT, agg SELECT, JOIN)
  • Python
    • typy list, tuple, dict, set
    • string manipulation
    • flow control (if, while, for)
    • function definition (def), lambda function
  • basic regular expressions

We recommend bringing your own laptop that can connect to the internet. It is necessary to have applications for SCP connection (e.g. WinSCP) and for SSH connection (e.g. PuTTY). A smart text editor for writing Python and SQL scripts (Notepad++, PSPad, etc.) is also useful.


Classes are always held on Wednesdays. Classes were planned to be held in the building on Charles Square. For the duration of distance learning, links to the online classes will be listed with the respective week.

  • odd week (L):
    • lecture 9:15–10:45, room KN:E-126
  • even week, option A (S-A):
    • lecture 9:15–10:45, room KN:E-126
    • practice 2 parallels 11:00–12:30 a 12:45–14:15, room KN:E-307

The first week of the semester is odd, see below. In the case of full-time classes, the only limitation is the capacity of the room for the exercises (22 seats, others must sit next to, on a chair without a table or squeeze in with someone).

Sylabus and schedule


If someone does not want to be listed in the table, contact us and we anonymize your row.

Name, surname Homework number Midterm test (20) Bonus practice Final test Practice
Atlasová Ester 2
Brož Jakub 4 17,5
Čáp Jan 4 14
Goldasová Silvia 4 18
Havelka Zdeněk 3 12,6
Hošťálek Jan 1 11,5
Jaroš Martin 4 15,5
Jína Filip 1
Kasl Tomáš 3 12
Komín Ondřej 2 13
Kopecký Filip 2
Kovář Adam 4 16
Kula David 1 13,3
Linder Pavel 3 13,3
Lukeš Michal 4 18
Mísař Matěj 1
Mráz Martin 3
Myšička Pavel 3
Novák Adam 1 17,8
Novotný Pavel 2 17,5
Petrželková Nela 3 14
Pokorný Šimon 2 13
Pospíšil Jiří 2
Prokš Dominik 1
Šefčík Šimon 2 16
Ševic Jan 2 14
Štěpánek Filip 4
Švagr Michal 4
Volprecht Vojtěch 1 11,3
Zahradník Petr 4 10,5
Žižka Ivan 1 17
Žuffa Marek 4 17
Auvinet Colin 3 17,6
Duran Terrero Andres 3 14
Fu Yongpan 3 18
Royer Paul 1 11,5
Sanz Puertas Vicente 2

Classification requirements (credit, examination)

How to get credits

  • Obtaining at least 30 points out of 60 possible for the continuous test, homework and practical test.
  • A maximum of 20 points can be earned on the mid-semester midterm test.
  • homework maximum of 20, optionally successful completion of AZ-900 certification within the duration of the course
  • max. 20 points for the theory and practice test at the end of the semester
  • The content of the practical test includes tasks on data processing using Hadoop and its technologies. Everyone works independently, but it is allowed to use your own and internet materials: notes, sample codes from lectures and exercises, Google, Stack Overflow, etc.
  • The practical test can be repeated once in a make-up period by agreement with the instructor. The result of the first test is cancelled and the result of the second test is valid, even if it is worse than the first test.


Check following pdf with tasks and check the list where you can see your assignment. Homeworks - PDF


It has a written part for 20 points and an oral part for 20 points. Both are compulsory and may lead to the need to retake the exam.

Termíny zkoušek budou vypsány v KOSu:

  • x.x. -



Hadoop: The Definitive Guide, 4th Edition, by Tom White

courses/b0m33bdt/start.txt · Last modified: 2022/11/22 23:01 by susicmar