The Twitter Stream Processing Pipeline is an application that designed to handle, process, and visualize insights about tweets in real-time. The pipeline ingests tweets from a simulated generator, processes and handle the data in a scala producer, and send data in python consumer and renders the results in a user-friendly web application using express & react. It supports searchability across text, time, and location.
- Programming Languages: Python 3.8+, JavaScript, scala
- Docker
docker compose up -d
[Optional: Set up Virtual Environment]
-
For Windows:
python -m venv .venv .venv\Scripts\activate
-
For macOS/Linux:
python3 -m venv .venv source .venv/bin/activate
pip install -r requirements.txt
python elasticsearch-consumer.py
-
For Windows:
copy .env.example .env
-
For macOS/Linux:
cp .env.example .env
npm i
npm run start
-
For Windows:
copy .env.example .env
-
For macOS/Linux:
cp .env.example .env
npm i
npm run start