Classes will be held in a full-time manner, or until further notice. In the case of online teaching, teaching will be done through faculty MS Teams. We use the group B0M33BDT+BE0M33BDT.

It is necessary to have a basic knowledge of following technologies:

  • Linux (basic commands)
  • SQL (create table, SELECT, agg SELECT, JOIN)
  • Python
    • typy list, tuple, dict, set
    • string manipulation
    • flow control (if, while, for)
    • function definition (def), lambda function
  • basic regular expressions

We recommend bringing your own laptop that can connect to the internet. It is necessary to have applications for SCP connection (e.g. WinSCP) and for SSH connection (e.g. PuTTY). A smart text editor for writing Python and SQL scripts (Notepad++, PSPad, etc.) is also useful.


Classes are always held on Wednesdays. Classes were planned to be held in the building on Charles Square. For the duration of distance learning, links to the online classes will be listed with the respective week.

  • odd week (L):
    • lecture 9:15–10:45, room KN:E-126
  • even week, option A (S-A):
    • lecture 9:15–10:45, room KN:E-126
    • practice 2 parallels 11:00–12:30 a 12:45–14:15, room KN:E-307

The first week of the semester is odd, see below. In the case of full-time classes, the only limitation is the capacity of the room for the exercises (22 seats, others must sit next to, on a chair without a table or squeeze in with someone).

Sylabus and schedule


If someone does not want to be listed in the table, contact us and we anonymize your row.

Classification requirements (credit, examination)

How to get credits

  • Obtaining at least 30 points out of 60 possible for the continuous test, homework and practical test.
  • A maximum of 20 points can be earned on the mid-semester midterm test.
  • homework maximum of 20, optionally successful completion of AZ-900 certification within the duration of the course
  • max. 20 points for the theory and practice test at the end of the semester
  • The content of the practical test includes tasks on data processing using Hadoop and its technologies. Everyone works independently, but it is allowed to use your own and internet materials: notes, sample codes from lectures and exercises, Google, Stack Overflow, etc.
  • The practical test can be repeated once in a make-up period by agreement with the instructor. The result of the first test is cancelled and the result of the second test is valid, even if it is worse than the first test.


It has a written part for 20 points and an oral part for 20 points. Both are compulsory and may lead to the need to retake the exam.

Hadoop: The Definitive Guide, 4th Edition, by Tom White

