📍 San Jose, CA | LinkedIn | 📧 saivivek.chunduri123@gmail.com
🔍 Data Engineer | Data Analyst | Machine Learning Enthusiast
📊 Passionate about turning data into insights through SQL, Python, Power BI, Tableau & Cloud Technologies
🚀 I love working with Big Data, Real-time Streaming, ETL Pipelines, and Data Science solutions
- Developed a fraud detection system using Kafka, PySpark, and MongoDB, processing over 1M+ transactions with a 99% anomaly detection rate.
- Designed Kibana and Flask dashboards with a 1s refresh rate to visualize fraud detection and transaction patterns.
- GitHub: Real-Time Smart Bank Data Streaming
- Built 3 interactive Power BI dashboards using DAX and exploratory data analysis (EDA).
- Conducted churn analysis on 7,043 customers, identifying 26.54% churn rate, driving actionable retention strategies.
- GitHub: PwC Power BI Job Simulation
- Built Power BI dashboards analyzing 3.14M ratings and 994K votes, uncovering key insights for sales and customer engagement.
- Developed automated Azure Data Factory pipelines, ingesting 35.45K products and designing data models in Databricks & Synapse Analytics to structure 16 years of data.
- GitHub: E-Commerce Data Analysis on Microsoft Azure
- Built a Kickstarter success prediction model using Random Forest and LSTM, achieving 93% accuracy.
- Designed Power BI & Tableau dashboards analyzing 18.7K projects across 21 countries, identifying funding patterns & success drivers.
- GitHub: 🚀 Kickstarter Success Prediction | Kickstarter Platform Analysis
- Built an LSTM-based model to predict carbon emissions and estimated the cost of achieving carbon neutrality for the top 4 polluting countries.
- GitHub: Carbon Emission Prediction
💾 SQL, Python, R, Java, PySpark
📊 Power BI, Tableau, Kibana, EDA
☁️ AWS, Azure, Databricks, Snowflake, MongoDB, PostgreSQL
🚀 Apache Spark, Hadoop, TensorFlow, Scikit-Learn
🔧 Docker, Git, A/B Testing, Agile, CI/CD
🛠 ETL Pipelines, Data Modeling, Cloud Data Warehousing
- 🔍 I enjoy solving complex data challenges – whether it's optimizing SQL queries or debugging ETL pipelines.
- 🎨 I love creating interactive dashboards that tell compelling data stories.
- 🎯 I regularly work on side projects exploring real-world datasets & predictive modeling.
💡 Open to full-time opportunities in Data Engineering, Data Analytics, and Data Science.
🤝 Excited to collaborate on data-driven projects, research, and real-world challenges. Let's connect!