A project idea developed during the winter break to use python for scraping environmental data from a website and then create visualizations. The final result is to be displayed on a self-created webpage.
A majority of the information and workflow applied in the project can be found from this article
- Pick a website
- Web scraping
- Store data into two .csv files (can be found in doc)
- Calling Gender API to enrich the data
- Generate basic charts using matplotlib, nltk
- Create mysql database / (Firebox) and store the data
- Python 3 (bs4, requset, matplotlib, nltk, FLask)
- MySQL
- Gender API
Please see the data scrapped stored in the data_title_author.csv and key_words.csv
Along with genders.csv for predicted gender (data enrichment)
Some plots can be found under /plots.