A Python-based web scraper for Amazon, using the Playwright library.
This repository contains a web scraper for Amazon that allows you to extract product information, such as name, price, and URL, directly from the website. It is built using Python and the Playwright library.
Welcome to the Web Scraper for Amazon! This guide will help you set up your development environment and run the web scraping application.
Before you begin, ensure you have the following installed on your system:
Follow these steps to set up your development environment:
-
Clone the repository:
git clone https://github.com/Luizzavala/web-scraper
-
Navigate to the project directory:
cd web-scraper
-
Add an .env file to your
.gitignore
to manage sensitive data securely. -
Install the required Python packages:
pip install -r requirements.txt
Feature | Status | Details |
---|---|---|
CLI | ✅ Completed | Command-line interface for easy interaction |
Logging | ✅ Completed | Detailed logging system for tracking and debugging |
Scrapers | ||
- Amazon | ✅ Completed | Extracts product information, prices, availability, and reviews |
- MercadoLibre | ✅ Completed | Extracts product information from MercadoLibre |
This project is maintained and developed by the following contributors:
- @Luizzavala - Initial development and project maintenance.
- @DeviantRacoon - Contributing to the project.
See the list of contributors who participated in this project.
Feel free to open issues or submit pull requests for any improvements or feature suggestions!