Skip to content

Latest commit

 

History

History
79 lines (53 loc) · 2.52 KB

README.md

File metadata and controls

79 lines (53 loc) · 2.52 KB

Web Scraper

Status GitHub Commits License


A Python-based web scraper for Amazon, using the Playwright library.

Table of Contents

About

This repository contains a web scraper for Amazon that allows you to extract product information, such as name, price, and URL, directly from the website. It is built using Python and the Playwright library.

Getting Started

Welcome to the Web Scraper for Amazon! This guide will help you set up your development environment and run the web scraping application.

Prerequisites

Before you begin, ensure you have the following installed on your system:

Installation

Follow these steps to set up your development environment:

  1. Clone the repository:

    git clone https://github.com/Luizzavala/web-scraper
  2. Navigate to the project directory:

    cd web-scraper
  3. Add an .env file to your .gitignore to manage sensitive data securely.

  4. Install the required Python packages:

    pip install -r requirements.txt

Features

Feature Status Details
CLI ✅ Completed Command-line interface for easy interaction
Logging ✅ Completed Detailed logging system for tracking and debugging
Scrapers
- Amazon ✅ Completed Extracts product information, prices, availability, and reviews
- MercadoLibre ✅ Completed Extracts product information from MercadoLibre

Authors

This project is maintained and developed by the following contributors:

See the list of contributors who participated in this project.


Feel free to open issues or submit pull requests for any improvements or feature suggestions!