Table of Contents

Checkpoint 1 (max 10 pts)


Deadline 10. 10. 2021

Goal

The goal of this checkpoint is to find the research question to be answered, build a custom model to answer the question and find relevant data sources that could help answering the question.

Topics

Questions are sorted out by the topics.

Topic Protected sites offers following questions:

The topic Animal species is based on the natural taxonomy of animal kingdom. We offer following questions:

Deliverable

A PDF consisting of 1-2 pages describing

  1. research question to answer with detailed specification, your motivation for the topic,
  2. a model of classes and properties describing how to find a solution to answer the question. Model could be in a form of E-R diagram or similar (not in a formal language),
  3. list of data sources which you have found relevant for answering the research question (at least three and from various sources).

Details

Research question

Choose a research question from the list provided or come up with your own question. Consult the question with the lecturer – all proposed questions have some solution and require integration of data sets from various sources in order to answer it. Question created by students shall be equally complicated to answer and the solution shall exist. Specify the questions for specific space and time.

Conceptual model

Conceptual model represents visualization of knowledge needed to answer the question. Example question from the first tutorial – in which areas it is possible to legally sleep overnight in nature – is modeled in a following way:

Look into the legislation (national, European), if it handles the problem somehow. Find definitions of related terms.

The outcome model shows the relations between the inputs and outputs and shows the way how to answer the question. Feel free to use various colours and frame types to express abstract classes, different sources etc. Use any conceptual language you know to represent the model, e.g. UML or E-R model.

Tutorial models were created in yED.

Data sources

Find the specific data sets needed to answer the data. It is possible that it does not perfectly fit to the model. If so, edit or extend the model, eventually describe the conflicts in the output PDF.

While selecting the datasets, think about the value you get from them to solve your problem. Super rich data are not useful for you, if they does not contain one piece of information you need.

It is also recommended not to check only the schema and structure, but also the data content. Attributes may be part of the schema, but with no data.

Some data are published only regionally. Try to look for other data providing same or similar knowledge. It may be needed to combine more datasets to complete the knowledge over larger area.