Pipecat Cloud Vision Bot

A web-based user interface for interacting with a Pipecat Cloud Vision Bot using React, Next.js, and the Pipecat React SDK.

Project Overview

This application enables a visual conversation with an AI-powered vision bot. Users can interact with an AI assistant that processes and responds to visual input in real-time using Pipecat Cloud by Daily for backend services.

Tech Stack

Frontend: Next.js with React and TypeScript
UI Framework: Tailwind CSS, ShadCN components
Camera Integration: WebRTC-based custom camera hook
Backend Service: Pipecat Cloud by Daily
SDK/Packages:
- @pipecat-ai/client-js
- @pipecat-ai/client-react
- @pipecat-ai/daily-transport

Development Plan

Stage 1: Project Setup ✅

Initialize Next.js Project
Install Dependencies
Set up Environment Variables

Stage 2: Core UI Structure (In Progress)

Create Basic Layout
Implement Camera Component
Create Pipecat Client Provider

Stage 3: Pipecat SDK Integration

Establish Connection
Implement Camera Stream Handling
Implement Basic Interaction Controls

Stage 4: Camera Integration Details

Analyze the dailyco-camera-3-18-25 repository
Adapt Camera Component
Implement Rear Camera Functionality

Stage 5: Testing and Refinement

Local Testing
Pipecat Cloud Integration Testing
Iterate and Refine

Current Implementation

The project currently includes:

Basic UI Components:
- Welcome screen
- Camera feed
- Status bar
- Controls
- Caption display
Custom Hooks:
- useCustomCamera: For camera access and management
- useSessionTimer: For tracking session duration
- useAudioLevels: For audio processing
Core Application Flow:
- User welcome and permissions
- Camera setup
- Connection handling

TODO Priorities

Integrate Pipecat Client SDK:
- Set up RTVIClient provider context
- Implement connection with Pipecat backend
Connect Camera to Pipecat:
- Ensure the camera component works with Pipecat React SDK
- Ensure rear camera selection functions properly
Implement Real-time Communication:
- Set up message handling for the Vision Bot
- Implement real-time captioning based on bot responses

Important Resources

Getting Started

Clone the repository
Install dependencies:
```
npm install
```

Create a .env.local file with the required environment variables:

PIPECAT_API_URL=your_api_url
# Add any other required variables

Run the development server:
```
npm run dev
```
Open http://localhost:3000 in your browser

Configuration

For Pipecat SDK configuration, you'll need to:

Set up a Pipecat Cloud account
Configure your Vision Bot on Pipecat Cloud
Obtain necessary API keys and endpoints
Update the environment variables accordingly

Contributing

Please follow the staged development approach outlined in the Development Plan and ensure all changes are properly tested before submitting.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
attached_assets		attached_assets
client		client
nextjs-vision-bot		nextjs-vision-bot
server		server
shared		shared
.gitignore		.gitignore
.replit		.replit
HISTORY.md		HISTORY.md
README.md		README.md
drizzle.config.ts		drizzle.config.ts
generated-icon.png		generated-icon.png
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
theme.json		theme.json
tsconfig.json		tsconfig.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pipecat Cloud Vision Bot

Project Overview

Tech Stack

Development Plan

Stage 1: Project Setup ✅

Stage 2: Core UI Structure (In Progress)

Stage 3: Pipecat SDK Integration

Stage 4: Camera Integration Details

Stage 5: Testing and Refinement

Current Implementation

TODO Priorities

Important Resources

Getting Started

Configuration

Contributing

About

Releases

Packages

Languages

Rob-24-ai/VisionAI

Folders and files

Latest commit

History

Repository files navigation

Pipecat Cloud Vision Bot

Project Overview

Tech Stack

Development Plan

Stage 1: Project Setup ✅

Stage 2: Core UI Structure (In Progress)

Stage 3: Pipecat SDK Integration

Stage 4: Camera Integration Details

Stage 5: Testing and Refinement

Current Implementation

TODO Priorities

Important Resources

Getting Started

Configuration

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages