Hank Green had a video, where he asked his viewers to recommend books, movies, TV shows, and video games, that they thought were calming. I iterated through all the comments, stored them in a txt file, then used GPT 3.5 Turbo to parse the comments into a list of recommendations, grouped by category. I then used GPT 4 Code Interpreter to analyze the recommendations, and create visualizations for each category.
Here's an animated overview of the top recommendations across all categories:
For a more detailed look at each category, check out the static visualizations below:
- Collection: YouTube comments were collected from Hank Green's video asking for calming media recommendations.
- Processing: Comments were processed using GPT-3.5 Turbo to extract and categorize recommendations.
- Analysis: Data was cleaned and analyzed to identify the most frequently mentioned items in each category.
- Visualization: Visualizations were created to showcase the findings.
-
Clone this repository:
git clone https://github.com/yourusername/HankComments.git cd HankComments
-
Install the required dependencies:
pip install -r requirements.txt
-
Create a
.env
file with your API keys (see.env.example
for format):cp .env.example .env
Edit the
.env
file with your API keys.
Run the complete pipeline with a single command: