Instructions#

Instructions#

The following presentation provides you more instructions and information to get started with the final work.

Aim of the final assignment#

The final project is a group work where the aim is to apply spatial data science methods to study a selected sustainability challenge, and report your findings in a poster. For the final project, you can select a pre-defined topic, or develop your own question.

  • You should take advantage of your programming and spatial data science skills, version control skills (git + GitHub) and good coding practices (writing readable code) when doing the final assignment.

  • Return your poster as a Powerpoint presentation with a single slide or as a PDF with single page. In the poster session, you will be presenting your poster to the students.

Suggested topics#

Present your idea to the course instructor (at latest) by the end of January before starting the work.

Option 1: Your own project work#

For the final assignment, you can fully decide your own topic. Requirement for the work is that it needs to relate to sustainability and you need to apply spatial data science methods in your work. You can (for example) choose a specific SDG target and a subset of its indicators that you analyze and present in the poster. However, the topic can be something different as well (not directly related to SDGs).

Option 2: Sustainable cities and communities#

In this topic, the aim is to focus on Sustainable Development Goal 11 (Sustainable cities and communities) and create an analysis workflow in which you:

  • fetch the data for given area of interest (choose two areas so that you can compare them),

  • conduct the data analysis that aims to provide information related to the given target/indicator (i.e. what is the current state in the area based on the given indicator)

  • repeat the analysis workflow for another area of interest (e.g. in another city or neighborhood)

  • report your findings with informative maps and graphics as well as in the poster
    • E.g. what does the indicators reveal?

    • How do the areas compare? Are there differences or similarities?

The main idea of the assignment is to calculate a set of metrics / indicators based on openly available data, and to compare the cities/regions based on these measures. This assignment is not accurately defined, as the idea is to allow you to use your own imagination and interest to explore different datasets and conduct analyses that interest to you, still providing useful insights about the given indicator in the areas that you picked.

NOTE: If many groups decide to work on SDG11 using the same set of indicators, we will plan/organize the work in such a way that every group will have a different set of cities that they are working on to avoid duplicate works.

What should be returned?#

As an output of this project, you should i) create a poster that documents your work and results (see separate instructions for the poster). In addition, your group should ii) return the codes that were used to create the results as a Jupyter Notebook (or alternatively as Python script files).

Poster: Return your poster as a Powerpoint presentation with a single slide or as a PDF with single page. In the poster session, you will be presenting your poster to the students. You can add the poster into your Github repository, or send it via email or Slack to Henrikki. Please send a message to Henrikki via Slack, once you are ready with the final work.

Codes: Return your codes (Notebook(s) etc.) via Github as instructed above.

Distribution of work: Because this is a group work, I want you to document and explain how did you distribute the work:

  • Who worked with the codes and analysis?

  • Who went through the literature (for background / contextual information)?

  • Who prepared the poster presentation?

When is the deadline?#

  1. The deadline for a FINALIZED POSTER is Tuesday February 20th, 2024 when you should return your poster presentation to the teacher. This is a hard deadline, so that we have enough time to set up your poster to our poster-session.

  2. The poster session is organized on Thursday 22nd February at 13:00-16:00 at Rakentajanaukio 4, Hall R3. Book this slot to your calendar and ensure that you can come to the campus to present your work.

  3. The deadline for finalizing the Github repository (i.e. returning your codes + documentation ) is on Sunday 25th February.

Code of conduct#

In the final assignment you will work as a team. Inclusion, kindness and respect should be the guiding principles in the way of working in your group. In practice, this means being compassionate to yourself and others, respect different viewpoints, identities and backgrounds, and apologizing when needed. Let’s make this project a shared space for the process of learning & unlearning. Do not hesitate to contact Henrikki in case you have any worries regarding the code of conduct.

Grading#

The grading is based on a typical 0-5 scale. The final assignment is graded based on:

  • Main analysis steps (data fetching, data analysis, visualization)

  • Repeatability (it should be possible to repeat the main analysis steps)

  • Quality of visualizations (maps and graphs)

  • Poster, presentation and overall documentation of the work

Peer assessment of the poster#

You will receive a template for assessing one poster of another group. To help you to assess the poster, you can take advantage of the following assessment matrix that considers different aspects of the work:

Notes#

Notice that there are no specific criteria how you should conduct and do the analyses, as they are up to you to decide and figure out. As said earlier, the main purpose of the final assignment is to demonstrate your analysis and programming skills, so aim to do the work in a way that you feel comfortable with. Remember that getting things done is better than perfect!

As you might have experienced during this course, the programming parts can take time. Hence, it is recommended to distribute the workload in such a way that some member(s) of the group can start preparing the poster already early-on even before you have the results ready. It can take surprisingly long time to design a good poster with nice layout and visual appearance.

Useful documentation#

Check these resources that are most likely very useful when doing the final assignment:

Data sources#

You can use any (spatial) data that you can find, for example from the following sources:

Data sources are not limited to these, hence you can also use other data from any source that you can find (remember to document where the data is coming from!).

Literature + inspiration#

You can use the literature provided during the course as inspiration and as a source for information, but please remember to cite your sources appropriately in your final assignment. Add a reference list to the end of your notebook.

Start your assignment#

See the documentation in “Starting your final assignment on Github” -page

How to work with the code?#

The leading principle in terms of the programming parts of your final assignment is that the workflow should be repeatable and well documented. In other words, anyone who gets a copy of your repository should be able to run your code, and read your code. Document well the crucial parts of your code and especially remember to document the data sources that you use in your work and add links to the sources.

Organize all your material into your groups Final-Assignment repository and store your work into the final_assignment.ipynb file. If you wish, you can write parts of your analysis workflow into separate Python script files (.py) and then apply them inside the Notebook. Anyone who downloads the repository should be able to read your code and documentation and understand what is going on, and run your code in order to reproduce the same results. Please return a clean and coherent notebook (think it as a supportive report/story with code alongside the poster) that only contains necessary code cells to reproduce and report the main findings of your analyses Hence, you should remove all unnecessary code blocks for printing the first rows of the GeoDataFrame etc. Please ensure that everything works before returning your work: Run the whole notebook (like this) and return the work once you do not have any surprising errors and your results look as they should (this is what reproducibility is all about!).

Note: If your code requires some python packages not found in the csc notebooks environment, please mention them also in the report and provide installation instrutions.