Skip to content

A full-stack app with JS, HTML, CSS, and Flask where users upload audio files, and AI characters summarize the content.

License

Notifications You must be signed in to change notification settings

HassanZafar-2021/Vocal-Vision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vocal-Vision

Description

This project extends NotebookLM by converting textual queries into engaging video podcasts with talking male and female avatars for input. By integrating advanced AI technologies such as speaker MP3 audio files to submit and video generation, the platform enhances NotebookLM’s capabilities, adding a visual dimension to information sharing. A website hosts this using frontend (JS, CSS, HTML) and backend (MongoDB Atlas, Python Flask).

Table of Contents

Installation

Download Python flask and download mp3 audio files with the support of Google Notebook

Usage

Open localhost python with support of MongoDB

Credits

No collaborators

License

No license

Badges

Input text or audio queries Upload images of speakers Generate personalized video podcasts with AI-generated talking avatars Supports up to 14 languages the agile methodology used to update this project with more languages and longer video duration

Features

Input text or audio queries Upload images of speakers Generate personalized video podcasts with AI-generated talking avatars Supports up to 14 languages the agile methodology used to update this project with more languages and longer video duration Award: This project won me a sponsored track, "Best use of Notebook LM," to win prizes at DivHacks at Columbia University.

How To Contribute

Fork repo and make a pull request with your changes

Tests

Make tests folder and run npm run tests

About

A full-stack app with JS, HTML, CSS, and Flask where users upload audio files, and AI characters summarize the content.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published