B0M33BDT – Technologie pro velká data
Important links
Prerequisities
It is necessary to have a basic knowledge of following technologies:
SQL (create table, SELECT, agg SELECT, JOIN)
Python
typy list, tuple, dict, set
string manipulation
flow control (if, while, for)
function definition (def), lambda function
basic regular expressions
We recommend bringing your own laptop that can connect to the internet. A smart text editor for writing Python and SQL scripts (Notepad++, PSPad, etc.) is also useful.
Schedule
Classes are always held on Wednesdays. Classes were planned to be held in the building on Charles Square. For the duration of distance learning, links to the online classes will be listed with the respective week.
odd week :
lecture 9:15–10:45, room KN:A-310
practice 11:00–12:30 , room KN:E-310
even week:
Sylabus and schedule
-
2. week (1.10.2025): PŘEDNÁŠKA ZRUŠENA
-
4. week (15.10.2025): Parallel data processing theory
5. week (22.10.2025): Alternative data platforms (Snowflake)
6. week (29.10.2025): Alternative data platforms (MS Fabric)
7. week (5.11.2025): Import of data (Kafka)
8. week (12.11.2025): Spark Streaming
9. week (19.11.2025): Advanced Spark practically
10. week (26.11.2025): Cloud introduction
11. week (3.12.2025): Cloud - Azure
12. week (10.12.2025): Databricks Advanced
13. week (17.12.2025): Agentic AI
14. week (7.1.2026): Reserve + Homework consultations
19.12.2025: Potential pre-term exam
Classification requirements (credit, examination)
How to get credits
Obtaining at least 30 points out of 60 possible for the mid-term test, homework and the final test.
mid-semester midterm test - maximum of 20 points can be earned
homework - maximum of 20 points can be earned
final test at the end of the semester - maximum of 20 points can be earned
Homework
Details will be specified during the semester.
How to deliver the result?
Send your homework via e-mail - source code and the output is expected or you can send a link to your repository - source and the output is required there as well.
Homework must be completed and sent at least one week before the exam.
Exams
It has a written part for 20 points and an oral part for 20 points. Both are mandatory and may lead to the need to retake the exam.
Kontakt
Literatura