====== Semestral Work ====== 15.11.2018 - tips for CP1 Remind that the goal of the whole semestral work is to **integrate the schemas** of the datasets you have (integration on temporal/spatial extent is not enough). Now that you understand better your data, if you are still unsure about suitability of the datasets for the semestral work, please let us know, e.g. by email, or come and consult. Correct examples: * see tutorial 1 (Czech social security administration dataset integration). "Typ posudku - Invalidita - typ řízení zjišťovací" in one dataset determines "Počet nově přiznaných důchodů" in another dataset. * demografická statistika Štatistického úradu SR ([[https://slovak.statistics.sk/wps/portal/ext/themes/demography/population/indicators/!ut/p/z1/pZJPj4IwEMU_Uqel0nIsuNQKQVoF3V4MJ0Piv4PZz7-geFmzUxPn1sz7dV6nj3iyI_7c_fSH7tZfzt1xOH_7eG-FkWlKFci0ZWBE5WhhrS5aTrZ_BJX7ArNRtXZLToHPiB_a-bpmKuE6m7tVPrQzJtdlzADoxCMCjNfxxGdaLbgoAWSpZ2DUonGJjSJQ0Z0P-od_SsF7PGIQ4-nr_l4ECG8SFuAb8Z5_ZMDIt2X9aK8czcECFFUhN8ZmYuIRwSfzR4HHv2dL_F2CJSB0hw-F1KOvGJcciun11DxrB73pfwEIUzuL/dz/d5/L2dJQSEvUUt3QS80TmxFL1o2X1E3SThCQjFBMDhCVjIwSTdOUjFLUVFHSTky/|Živonarodení v manželstve podle Veku matky]]) vs. Vybrané demografické údaje (1989-2017) ČSÚ [[https://www.czso.cz/csu/czso/ceska-republika-od-roku-1989-v-cislech-2017-8jcopi31rm#01|Živě narozené děti podle věku matek při porodu]]. In CP2 you would decompose such categories (and find out that the latter is a subset of the former in this case) Incorrect example: * Dataset about number of parking places in Prague vs. Dataset about number of births in Prague - they are only related by the geographical axis (e.g. sharing the geospatial axis - Prague districts), but otherwise they are not connected. ===== Grading ===== The semestral work is graded in three checkpoints: * [[courses:osw:cp0|checkpoint 0]] (max 5 pts) * [[courses:osw:cp1|checkpoint 1]] (max 20 pts) * [[courses:osw:cp2|checkpoint 2]] (max 25 pts) To successfully complete the semestral project, you need to obtain at least 50% grading from **each** checkpoint. For completing a checkpoint you need to - (**by the checkpoint [[courses:osw:seminars|deadline]]**) push the deliverable from each checkpoint to the GIT repo https://gitlab.fel.cvut.cz/B181_B4M33OSW/%username%, resp. https://gitlab.fel.cvut.cz/B181_BE4M33OSW/%username% - (**by the checkpoint [[courses:osw:seminars|deadline]]**) upload a txt file 'cp0.txt', resp. 'cp1.txt', resp. 'cp2.txt' into the [[https://cw.felk.cvut.cz/brute/teacher/course/924|upload system]]. The only content of the file is the hash of the GIT commit you are submitting to the checkpoint - (**at the [[courses:osw:seminars|next lab]]**) defend the checkpoint. If you submit your checkpoint after the deadline, you will be penalized by losing 5 points for each commenced week of delay. This penalization will not be taken into account when deciding on passing/failing a checkpoint but will be used for the final grading. ===== Description ===== The basic goal of the semestral project is to **Create a linked data set together with an associated ontology and integrate it with other data sets as shown in this diagram:** {{:courses:osw:student-project.png?640|}} The topic for this term will be **Our City - the place we live in.**. Expected subtopics are (but not limited to) urban planning (cadastral data, land registry, ...), immovable properties, buildings (constructions, RUIAN, ...), and infrastructure (roads, cycling routes, utilities). ===== OSW Ontology ===== /* The current version of the ontology can be found [[http://onto.fel.cvut.cz/ontologies/city|here]]. {{:courses:osw:transportation.png|}} */ ===== Some Related Data Sources ===== ==== Generic Ontologies ==== * Protégé Ontology Library - http://protegewiki.stanford.edu/wiki/Protege_Ontology_Library * dbpedia.org * Time Ontologies * Time Ontology -- https://www.w3.org/TR/owl-time * Spatial Ontologies * WGS84 Ontology -- https://www.w3.org/2003/01/geo * GeoSparql Ontology -- http://www.opengeospatial.org/standards/geosparql ==== Generic Dataset Sources ==== * DataHub - http://datahub.io * European Data Portal - https://www.europeandataportal.eu/en/homepage * NKOD - https://data.gov.cz/datov%C3%A9-sady * Prague Open Data - http://opendata.praha.eu/ * Brno Open Data - https://kod.brno.cz/ * Ostrava Open Data - https://opendata.ostrava.cz/ * Pilsen Open Data - https://opendata.plzen.eu/ ==== Transportation Related Dataset Sources ==== * Road * Road info -- http://www.dopravniinfo.cz/seznam.aspx (Unexpected Events) * [[http://kbss.felk.cvut.cz/dopravni-info.zip|machine readable]] * Police accident records -- http://pcr.jdvm.cz ===== Sample projects from previous years ===== Archives {{ :courses:osw:osw2017-semestral-work-example-1.zip | semestral-work-example-1.zip}} and {{ :courses:osw:osw2017-semestral-work-example-2.zip | semestral-work-example-2.zip}} are two examples of semestral works from previous years. Note, that they cannot be used as a "template" for your semestral work since the rules from previous years were slightly different.