This repository contains the project for the Big Data course.
Download dataset:
#!/bin/bash
curl -L -o ~/Downloads/spotify-millions-playlist.zip\ https://www.kaggle.com/api/v1/datasets/download/adityak80/spotify-millions-playlist
Additional readmes:
- Initial Setup (instructions to setup the environment)
- AWS CLI cheatsheet (a collection of the most common commands to use on the AWS CLI)
- AWS Workflow (a vademecum of the list of things to do to setup the AWS environment and use it to deploy Spark jobs)
- Exam Project (instructions for the project, that is mandatory for the exam)