Skip to content

Latest commit

 

History

History

4__Data_Visualization

Industrial Strength Visualization: Data Storytelling Using Python Visualization Libraries

Data visualization provides a pictorial representation of your data. It provides more information on what you think your data contains.

In real-life data, visualization is an important process to data analysis, which involves storytelling and recommendations.

In this practical, we will be looking at real-world data on 2020 World Population by Countries, to gather as much information and write out our findings/insights.

The visualization tools used are Matplotlib and Seaborn

Setup:

  • If you haven't already, clone this repo.
  • We use virtual environments in ti. If you already have a virtual environment created for a previous practical, you can simply activate the environment and install pandas using
pip install matplotlib seaborn

Otherwise, create a new virtual environment then

pip install -r requirements.txt
  • Go through the notebook to try your hands on the codes. 🔨🔨
  • While doing that, you may realise that there are some details that weren't explored and that'll be awesome. So get at them! 🔨🔨

Data

The dataset used in this practice was sourced here. It contains details of each country/dependent territory's world population as of 2020, their yearly growth rate, and other population-related information.

Other Tools & Projects

S/N Tool Awesome projects
1 PowerBI Geospatial Data visualization Using Kepler, Mapbox and PowerBi
2 Folium Complete End to End Geographical Data Visualization Project for Python Beginners
3 Dash Interactive, Web-Based Dashboards in Python
4 Seaborn Mexican Government Report Text Analysis

Project Ideas

  • Go through this Customers' dataset, to gain an understanding of the features it has.
  • Get creative with your data exploration using the visualization tools you are familiar with.