Skip to content

This project involves creating a real-time stock market data pipeline using Kafka and AWS services.

Notifications You must be signed in to change notification settings

LokeshReddy-18/Stock-Market-AWS-Kafka-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Stock-Market-AWS-Kafka-Project

Screenshot 2024-07-27 at 8 00 12 PM

Overview

A Kafka cluster running on an EC2 instance to stream stock market data. A Python script running on an AWS SageMaker notebook to consume data from Kafka and upload it to an S3 bucket. AWS Glue crawlers to catalog the data in the S3 bucket. AWS Athena to query and analyze the data.

Components

Kafka Cluster: Deployed on AWS EC2 for streaming data.
AWS EC2: Hosts the Kafka server.
SageMaker Notebook: Consumes Kafka messages, processes them, and uploads them to S3.
Amazon S3: Stores the processed data.
AWS Glue: Crawls the data in S3 to create tables.
AWS Athena: Provides interactive querying of the data stored in S3.

Kafka Set Up on EC2

Install Java:

sudo yum install java-11-openjdk

Download and Extract Kafka:

wget https://downloads.apache.org/kafka/3.3.1/kafka_2.12-3.3.1.tgz
tar -xzf kafka_2.12-3.3.1.tgz
cd kafka_2.12-3.3.1

Start Zookeeper Server:

bin/zookeeper-server-start.sh config/zookeeper.properties

Start Kafka Server:

bin/kafka-server-start.sh config/server.properties

Create Kafka Topic:

bin/kafka-topics.sh --create --topic stock_market --bootstrap-server localhost:9092 --replication-factor 1 --partitions 1

Producer and Consumer Codes on Repository

Conclusion

This project sets up a comprehensive data pipeline for real-time stock market data using Kafka, AWS services, and provides mechanisms for data analysis using AWS Glue and Athena. Ensure all components are correctly configured and permissions are set to facilitate smooth data flow and analysis.

About

This project involves creating a real-time stock market data pipeline using Kafka and AWS services.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published