Animated story video generation #450

Yousif-GO · 2025-02-11T02:37:17Z

This Code demonstrates how to generate an animated story video by:

Generating a story sequence using structured Google Gemini API (for character consistency).
Generating images for each scene using Google’s Imagen API.
Synthesizing narration audio using Kokoro's KPipeline.
Creating short video clips (image + audio overlay) for each scene.
Combining all clips into one final video.
Cleaning up temporary files after processing.

1739225609_output_video.mp4

…structured Google Gemini API (for character consistency) and Imagen

review-notebook-app · 2025-02-11T02:37:23Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

markmcd · 2025-02-12T09:14:20Z

This is very cool! I love that it's a complex end-to-end example, contains text instructions throughout, and uses Google models and open-source tools exclusively. Plus it's fun!

I'll go over in a bit more detail when I get some time but for now, would you be able to use the google-genai SDK instead of google-generativeai? It'd be great to show this example off, and having the latest SDK makes it a useful demo.

Have you tried doing the audio generation using Gemini 2.0 by any chance? It's only available through the Live API now. But there is also a preview that we can look at getting you onto. This isn't required at all, but something we'll want to do once audio generation becomes GA.

And thank you for following our contrib template! Do you want to add your name on this anywhere? You're welcome to add a byline at the top with a link to a social account or website.

Yousif-GO · 2025-02-12T19:19:23Z

Hi Mark,

Thank you for your feedback. I just created another pull request with the suggested changes. The new code now provides a full demonstration of Gemini text, Live API, and Imagen working together to generate a story video.

Notable changes include:

Removed the use of Kokoro and added Gemini Live instead.
Tweaked various parts of the code to ensure smooth functionality.
Updated to the latest google-genai SDK rather than using google-generativeai.

Also , an access to Native audio output would also be great for additional experimentation.

I'm glad you liked it, and I hope this demo could effectively showcases the capabilities of Gemini and Imagen in a fun and engaging way.

Yousif-GO added 2 commits February 10, 2025 21:25

add a new example which shows how to generate a story sequence using …

dabf3b4

…structured Google Gemini API (for character consistency) and Imagen

add a new example which shows how to generate a story sequence using …

916c80b

…structured Google Gemini API (for character consistency) and Imagen

github-actions bot added status:awaiting review PR awaiting review from a maintainer component:examples Issues/PR referencing examples folder labels Feb 11, 2025

Yousif-GO closed this Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Animated story video generation #450

Animated story video generation #450

Yousif-GO commented Feb 11, 2025

review-notebook-app bot commented Feb 11, 2025

markmcd commented Feb 12, 2025

Yousif-GO commented Feb 12, 2025

Animated story video generation #450

Animated story video generation #450

Conversation

Yousif-GO commented Feb 11, 2025

review-notebook-app bot commented Feb 11, 2025

markmcd commented Feb 12, 2025

Yousif-GO commented Feb 12, 2025