pysaprk

Star

Here are 9 public repositories matching this topic...

LalitSharma7 / F1-Data-Analysis

Star

Project based on application of azure databricks

azure databricks pysaprk pyspark-sql

Updated Mar 7, 2023
Python

Munanga / Pyspark-Analysis

Star

Sample analysis done using pyspark on parking violations issued for fiscal year 2017 using the databricks platform

python csv big-data spark analysis spark-sql pysaprk

Updated Feb 1, 2020
HTML

victorlifan / Sparkify--Pyspark-Big-Data-Project

Star

This project performed data wrangling, analysis, visualization as well as machine learning prediction on a hypothetical music app's user churn with pyspark.

machine-learning spark data-visualization pysaprk

Updated Mar 22, 2022
Jupyter Notebook

miltiadiss / CEID_NE4348-Big-Data-Management-Systems

Star

This project implements a real-time data pipeline with Kafka, Spark, and MongoDB. It generates vehicle data using UXSIM, streams it to a Kafka broker, processes it with Spark, and stores raw and processed data in MongoDB. Queries analyze vehicle counts, speeds, and routes over specified periods.

kafka pymongo pandas pysaprk uxsim

Updated Sep 20, 2024
Python

yhskgo / pyspark_deep_learning

Star

spark pysaprk

Updated Oct 8, 2019
Jupyter Notebook

SA01 / spark-english-api-tutorial

Star

ontains the code and examples for my article on Medium, which introduces the English SDK for Apache Spark, showcasing how to combine the power of Apache Spark with large language models (LLMs)

python big-data spark analytics data-engineering spark-sql pysaprk llm generative-ai

Updated Oct 25, 2024
Python

adharangaonkar / ETL-Pipelines

Star

A repository concentrating on using High end parallel pipelines to perform ETL across various data sources

spark etl postgresql aws-ec2 etl-pipeline redshift-cluster pysaprk

Updated Sep 23, 2021
Jupyter Notebook

johngodoi / learning_pyspark

Star

linkedinlearning pysaprk

Updated Jan 21, 2022
Jupyter Notebook

jpoberhauser / dist_comp_final

Star

NBA shot predictions with PySpark and SparkML

machine-learning pysaprk

Updated Dec 18, 2018
Jupyter Notebook

Improve this page

Add a description, image, and links to the pysaprk topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pysaprk topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pysaprk

Here are 9 public repositories matching this topic...

LalitSharma7 / F1-Data-Analysis

Munanga / Pyspark-Analysis

victorlifan / Sparkify--Pyspark-Big-Data-Project

miltiadiss / CEID_NE4348-Big-Data-Management-Systems

yhskgo / pyspark_deep_learning

SA01 / spark-english-api-tutorial

adharangaonkar / ETL-Pipelines

johngodoi / learning_pyspark

jpoberhauser / dist_comp_final

Improve this page

Add this topic to your repo