Second lab for Data-Intensive Computing course at KTH where we use Apache Kafka, Spark, and Cassandra to practice stream processing.
-
Updated
Feb 10, 2021 - Jupyter Notebook
Second lab for Data-Intensive Computing course at KTH where we use Apache Kafka, Spark, and Cassandra to practice stream processing.
First lab for Data-Intensive Computing course at KTH where we are introduced to Apache Spark MLlib and Spark SQL, Hadoop, and HBase.
Add a description, image, and links to the id2221 topic page so that developers can more easily learn about it.
To associate your repository with the id2221 topic, visit your repo's landing page and select "manage topics."