Data Science project that researchs the relationship between economic complexity, income-inequality and recession-recovery performance across OECD member countries
Three datasets will be used in this research:
- The economic-complexity index is published by the "Atlas of Economic Complexity" which is part of the Growth Lab at Harvard University.The index data can be downloaded as a CSV file from the official website: https://atlas.cid.harvard.edu/rankings
- The "average OECD relative poverty rate by country" data is extracted from the OECD's "society at a glance 2019 report". This research will use one CSV file that contains the total percentage of people in relative poverty by country. The CSV file was downloaded from the official website: https://www.oecd-ilibrary.org/sites/8483c82f-en/index.html?itemId=/content/component/8483c82f-en
- Given that the GDP per capita in purchasing power parity (PPP) terms data is provided by both the IMF and the World Bank, this work will consider both sources, compare them and then choose one of them to test the hypotheses.
The World Bank data is included in a CSV file downloaded from the official website: https://api.worldbank.org/v2/en/indicator/NY.GDP.PCAP.PP.CD?downloadformat=csv
The IMF data will be web scraped from the official website using the Python package "Beautiful Soup"
To run this project, you will need the following:
- Jupyter Notebooks
- Python and Pip
pip install -r requirements.txt