Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

README.md

Web Scraping Wikipedia getting data from a particular website to do further analysis.

This project is about getting data from Wikipedia using Python. Particularly, information market capitalization for top companies in the world. Why Wikipedia? Wikipedia is a great source of information and it is free. Besides, knowing the market capitalization of companies is important for investors to adjust their portfolios.

Installation

Use the package manager pip to install the following packages.

NB. You can install the packages in a virtual environment or globally. We used python 3.10 for this example.

Virtual Environment

pip install virtualenv

Activate Virtual Environment

source venv/bin/activate

pip install requests pandas lxml matplotlib seaborn

Usage

We use a Makefile to run the project. The Makefile contains the following commands:

make install

Activate the virtual environment and run the command.

. .venv/bin/activate

This command installs the required packages.

make run

This command runs the project.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

Creative Commons Zero v1.0 Universal

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Webscrape

Webscrape

README.md

Web Scraping Wikipedia getting data from a particular website to do further analysis.

Installation

Virtual Environment

Activate Virtual Environment

Usage

Contributing

License

Files

Webscrape

Directory actions

More options

Directory actions

More options

Latest commit

History

Webscrape

Folders and files

parent directory

README.md

Web Scraping Wikipedia getting data from a particular website to do further analysis.

Installation

Virtual Environment

Activate Virtual Environment

Usage

Contributing

License