Wiki-WebCrawler-Scrapy

Wikipedia Web Crawler written in Python and Scrapy. The ETL process involves multiple steps, extracting specific data from wikipedia's web page using scrapy and organizing it into a structured format using scrapy items. Additionally, the extracted data is saved in JSON format for further analysis and integration into MySQL Workbench. The JSON dataset serves as a potential data source for an API, enhancing data accessibility.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
wikiCrawler		wikiCrawler
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wiki-WebCrawler-Scrapy

About

Releases

Packages

Languages

License

WillCaton2350/Wikipedia-WebCrawler

Folders and files

Latest commit

History

Repository files navigation

Wiki-WebCrawler-Scrapy

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages