Skip to content

vlm-run/vlmrun-cookbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VLM Run Logo

VLM Run Cookbook

Website | Platform | Hub | Docs | Blog | Discord

Discord Twitter Follow


Welcome to VLM Run Cookbook, a comprehensive collection of examples and notebooks demonstrating the power of structured visual understanding using the VLM Run Platform. This repository hosts practical examples and tutorials for extracting structured data from images, videos, and documents using Vision Language Models (VLMs).

💡 Why Use This Cookbook?


  • 📚 Practical Examples: A comprehensive collection of Colab notebooks demonstrating real-world applications of VLM Run.
  • 🔋 Ready-to-Use: Each example comes with complete code and documentation, making it easy to adapt for your use case.
  • 🎯 Domain-Specific: Examples cover various domains from financial documents to TV news analysis.

📖 Cookbook Notebooks


Our collection of Colab notebooks demonstrates various use cases and integrations:

Name Type Colab Last Updated
API Quickstart Colab 02-08-2025
Schema Showcase feature Colab 02-08-2025
Visual Grounding feature Colab 02-18-2025
Long-form Video Transcription feature Colab 03-13-2025
Video Inference (Fine-Tuning) feature Colab 02-18-2025
US Drivers License application Colab 02-08-2025
Parsing Financial Presentations application Colab 02-04-2025
TV News Analysis application Colab 02-15-2025
Fashion Product Catalog application Colab 02-20-2025
Fashion Images Hybrid Search application Colab 02-21-2025
Generate Custom Schema feature Colab 03-13-2025

🔗 Quick Links