Persona 🎭

Real-time AI Language Learning Assistant

Awards 🏆

First Place - UofTHacks 2025

Persona transforms language learning through an immersive AI tutoring experience that adapts to you in real-time. By combining computer vision, neural networks, and 3D animation, Persona creates a natural learning environment that understands and responds to your facial expressions, pronunciation, and learning style.

Features 🌟

Real-time Emotional Understanding: Analyzes facial expressions to gauge engagement and understanding
Precise Pronunciation Feedback: Tracks lip movements for accurate pronunciation guidance
Fluid 3D Animation: Generates natural, lip-synced character animations that respond to your interactions
Adaptive Learning: Personalizes conversations and lessons based on your progress and learning style
Multi-modal Processing: Simultaneously handles video, audio, and text inputs for seamless interaction

Technical Architecture 🔧

Computer Vision Pipeline

Continuous facial analysis using deep learning models
Advanced facial landmark detection
Emotion recognition neural networks
Multi-threaded feature extraction

3D Animation System

Real-time rigging and animation (Mixamo + Blender)
Live lip-sync through Rhubarb phoneme detection
Custom animation blending
Synchronized facial expression mapping

Natural Language Processing

WhisperAPI for speech-to-text
ElevenLabs for dynamic voice generation
Claude-powered conversation engine
Parallel AI model processing

System Requirements 💻

CPU: 4+ cores recommended for parallel processing
GPU: NVIDIA GPU with CUDA support (8GB+ VRAM recommended)
RAM: 16GB minimum
Storage: 5GB for models and basic assets
Webcam: Required for facial analysis
Microphone: Required for speech input

Architecture Overview 🏗️

The system operates through a microservices architecture that coordinates multiple processes:

Video Processing Service
- Handles real-time facial analysis
- Extracts emotional and pronunciation features
Animation Service
- Generates fluid 3D character movements
- Synchronizes lip movements with speech
Conversation Service
- Manages AI dialogue flow
- Processes language learning logic
Integration Layer
- Orchestrates all services
- Maintains real-time performance

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
backend		backend
flask-backend		flask-backend
frontend		frontend
uoft-hacks-12		uoft-hacks-12
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Persona 🎭

Real-time AI Language Learning Assistant

Awards 🏆

Features 🌟

Technical Architecture 🔧

Computer Vision Pipeline

3D Animation System

Natural Language Processing

System Requirements 💻

Architecture Overview 🏗️

About

Releases

Packages

Contributors 2

Languages

lxyhan/Persona-UofT-Hacks-12

Folders and files

Latest commit

History

Repository files navigation

Persona 🎭

Real-time AI Language Learning Assistant

Awards 🏆

Features 🌟

Technical Architecture 🔧

Computer Vision Pipeline

3D Animation System

Natural Language Processing

System Requirements 💻

Architecture Overview 🏗️

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages