Skip to content

Commit

Permalink
apply corrections suggested by colleagues
Browse files Browse the repository at this point in the history
  • Loading branch information
Goli-SF committed Jan 29, 2025
1 parent 57df972 commit 123ca16
Show file tree
Hide file tree
Showing 9 changed files with 52 additions and 40 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ in Research, Data, and Methods. HERMES is funded by the
through grants from the European Union.

# Verion
V 2025.01.31
V2025.01.31

# License
This lesson has a [CC-BY license](LICENSE.md).
9 changes: 5 additions & 4 deletions episodes/01_introduction.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,11 +14,12 @@ exercises: 0
::::::::::::::::::::::::::::::::::::::::::::::::

::::::::::::::::::::::::::::::::::::: objectives
After completing this lesson, learners will be able to ...

- Introduce the use cases of data visualization for the humanities.
- Introduce the concept of statistical inference to humanities researchers.
- Teach humanities researchers to visualize data with python to infer information from it.
- Teach humanities researchers to use data visualization and statistical inference for data storytelling.
- Understand the use cases of data visualization for the humanities.
- Understand the concept of statistical inference to humanities researchers.
- Visualize data with python to infer information from it.
- Use data visualization and statistical inference for data storytelling.

::::::::::::::::::::::::::::::::::::::::::::::::

Expand Down
4 changes: 2 additions & 2 deletions episodes/02_graph_categories.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,8 +14,8 @@ exercises: 0

::::::::::::::::::::::::::::::::::::: objectives

- Discuss the benefits of data visualization in humanities research.
- Explore the most effective graph types for data visualization in the humanities.
- Learn about the benefits of data visualization in humanities research.
- Learn some of the most effective graph types for data visualization in the humanities.

::::::::::::::::::::::::::::::::::::::::::::::::

Expand Down
6 changes: 3 additions & 3 deletions episodes/03_statistical_inference.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,9 @@ exercises: 0

::::::::::::::::::::::::::::::::::::: objectives

- Explain the mathematical concept of statistical inference to humanities students and researchers.
- Explain the difference between descriptive and inferential statistics, correlation and causation.
- Explain the meaning of regression.
- Understand the mathematical concept of statistical inference.
- Understand the difference between descriptive and inferential statistics, correlation and causation.
- Understand the meaning of regression.

::::::::::::::::::::::::::::::::::::::::::::::::

Expand Down
13 changes: 5 additions & 8 deletions episodes/04_python_data_vis_for_inference_and_storytelling.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,11 +16,11 @@ exercises: 15

::::::::::::::::::::::::::::::::::::: objectives

- Creating scatter plots, bubble charts and correlograms in Python, using the Seaborn library.
- Implementing data visualization for exploratory analysis of a concrete dataset and telling a story
- Create scatter plots, bubble charts and correlograms in Python, using the Seaborn library.
- Implement data visualization for exploratory analysis of a concrete dataset and tell a story
based on the trends that it reveals.
- Using data visualization to infer information from a concrete dataset.
- Reflecting on the use cases of data visualization in humanities research.
- Use data visualization to infer information from a concrete dataset.
- Reflect on the use cases of data visualization in humanities research.

::::::::::::::::::::::::::::::::::::::::::::::::

Expand Down Expand Up @@ -59,14 +59,11 @@ Let’s answer these questions for our dataset by writing some code.
The dataset we're working with is stored in a CSV (comma-separated values) file on GitHub. Let's load it into
our notebook and store it in a pandas DataFrame named `happy_df`:

<span style="color: red;">The url below should be updated later, when the lesson is pushed to the incubator.</span>

```
import pandas as pd
# path to the dataset:
url= "https://raw.githubusercontent.com/HERMES-DKZ/data_challenges_data_carpentries/main/\
data_carpentries/statistical_inferece_data_visualization/data_statistical_inference_data_visualization/income_happiness_correlation.csv"
url= "https://raw.githubusercontent.com/HERMES-DKZ/stat_inf_data_vis/main/episodes/data/income_happiness_correlation.csv"
# loading the dataset and storing it in a pandas DataFrame:
happy_df= pd.read_csv(url)
Expand Down
11 changes: 6 additions & 5 deletions index.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,9 @@
site: sandpaper::sandpaper_site
---

This is a new lesson built with [The Carpentries Workbench][workbench].


[workbench]: https://carpentries.github.io/sandpaper-docs

In this lesson, you'll explore the different types of graphs and
their use cases. You'll then dive into the concept of statistical
inference. Next, you'll get hands-on with Python coding to analyze
the happiness and income dataset provided below. Finally, you'll
use the graphs you've created to make informed estimates about
countries not included in the dataset.
27 changes: 26 additions & 1 deletion learners/reference.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,5 +4,30 @@ title: 'Reference'

## Glossary

This is a placeholder file. Please add content here.
**Graph**
A graph is a visual representation of data. It's like a picture that shows how different pieces of information are
related to each other. You can think of it like a map: just as a map helps you see where places are in relation to
each other, a graph shows how different data points connect and how they are related to each other. Graphs are the
products of data visualization. They can help you understand the data better by seeing trends and relations in it,
introduce it to others and draw conclusions from it.

**Data Visualization**
Data visualization is the broader practice of using graphs, charts, maps, and other visual tools to represent data.
It’s all about turning raw data (numbers, facts, figures) into images that can communicate insights quickly and clearly.
For example, instead of reading through pages of numbers, a well-designed chart can tell you the story behind those
numbers, making it easier to understand patterns, trends, and relationships.

**Data Storytelling*:**
Data storytelling is the art of combining data with a narrative. It's about presenting data not just as isolated
facts, but in a way that tells a compelling story. It’s like writing a story, but instead of using words, you use data.
The goal is to make the data more engaging and understandable for an audience by providing context, explaining trends,
and helping people see the bigger picture. Good data storytelling helps people grasp what the data means and why it
matters.

**Statistics**
Statistics is the science of collecting, analyzing, and interpreting data. It involves methods for understanding and
making sense of data, including calculating averages, percentages, trends, and variations. While data visualization
and storytelling help communicate findings, statistics provides the tools to understand and measure the data itself.
For example, if you want to know how typical or unusual a certain data point is, you would use statistical techniques
to analyze it.

16 changes: 4 additions & 12 deletions learners/setup.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,18 +2,13 @@
title: Setup
---

In this lesson, you'll explore the different types of graphs and
their use cases. You'll then dive into the concept of statistical
inference. Next, you'll get hands-on with Python coding to analyze
the happiness and income dataset provided below. Finally, you'll
use the graphs you've created to make informed estimates about
countries not included in the dataset.

:::::::::::::::: callout
### What background knowledge do you need for this lesson?

1. Basic acquaintance with Python
2. Basic mathematical background
1. Basic acquaintance with Python: you should know how to import Python packages and load data into your code.
You also need basic familiarity with Python syntax.
2. Basic mathematical background: you need a basic understanding of statistics and probabilities.
3. Curiosity to learn more about Python programming, statistics and data storytelling
::::::::::::::::::

Expand All @@ -25,12 +20,9 @@ If you wish to save the the dataset on your computer, go ahead and download the
Otherwise, you can directly load it into your code later using the following link:

```
https://raw.githubusercontent.com/Goli-SF/stat_inf_data_vis/tree/main/episodes/data/income_happiness_correlation.csv
https://raw.githubusercontent.com/HERMES-DKZ/stat_inf_data_vis/main/episodes/data/income_happiness_correlation.csv
```

<span style="color: red;">The above link should be updated later, when the lesson is pushed to the incubator.</span>


## Software Setup

::::::::::::::::::::::::::::::::::::::: discussion
Expand Down
4 changes: 0 additions & 4 deletions links.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,4 @@ Place links that you need to refer to multiple times across pages here. Delete
any links that you are not going to use.
-->

[pandoc]: https://pandoc.org/MANUAL.html
[r-markdown]: https://rmarkdown.rstudio.com/
[rstudio]: https://www.rstudio.com/
[carpentries-workbench]: https://carpentries.github.io/sandpaper-docs/

0 comments on commit 123ca16

Please sign in to comment.