Skip to content

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.

License

Notifications You must be signed in to change notification settings

TEN-framework/TEN-Agent

Repository files navigation

TEN Agent banner


✨ TEN Agent + Deepseek

TEN Agent + Deepseek

TEN is a very versatile framework. That said, TEN Agent is compatible with DeepSeek R1, try experiencing realtime conversations with DeepSeek R1!


✨ TEN Agent + ESP32

TEN Agent ESP32 Client

TEN Agent is now running on the Espressif ESP32-S3 Korvo V3 development board, an excellent way to integrate realtime communication with LLM on hardware.


TEN Agent + Dify with RAG + Coze

TEN Agent + Dify Agent with RAG

Dify with RAG

TEN Agent + Dify

TEN Agent + Coze

TEN offers a great support to make the realtime interactive epxerience even better on other LLM platform as well, check out docs for more.


TEN Agent + Gemini Multimodal Live API

Gemini 2.0 Multimodal Live API

Usecases

Try Google Gemini Multimodal Live API with realtime vision and realtime screenshare detection capabilities, it is a ready-to-use extension, along with powerful tools like Weather Check and Web Search integrated perfectly into TEN Agent.


TEN Agent + Storyteller + Image Generator

Storyteller + Image Generator

Usecases

Describe a topic and ask TEN Agent to tell you a story while also generating images of the story to provide a more immersive experience for kids.


TEN Agent Usecases

Usecases


Ready-to-use Extensions

Ready-to-use Extensions


TEN Agent Playground in Local Environment

Prerequisites

Category Requirements
Keys • Agora App ID and App Certificate(free minutes every month)
OpenAI API key
Deepgram ASR (free credits available with signup)
FishAudio TTS (free credits available with signup)
Installation Docker / Docker Compose
Node.js(LTS) v18
Minimum System Requirements • CPU >= 2 Core
• RAM >= 4 GB

macOS: Docker setting on Apple Silicon

For Apple Silicon Macs, uncheck "Use Rosetta for x86/amd64 emulation" in Docker settings. Note: This may result in slower build times on ARM, but performance will be normal when deployed to x64 servers.

Docker Setting


Next step

1. Create .env file

cp ./.env.example ./.env

2. Setup Agora App ID and App Certificate in .env

AGORA_APP_ID=
AGORA_APP_CERTIFICATE=

3. Start agent development containers

docker compose up -d

4. Enter container

docker exec -it ten_agent_dev bash

5. Build agent

task use

6. Start the web server

task run

7. Edit playground settings

Open the playground at localhost:3000 to configure your agent.

  1. Select a graph type (e.g. Voice Agent, Realtime Agent)
  2. Choose a corresponding module
  3. Select an extension and configure its API key settings

Module Example

Now, we have successfully set up the playground. This is just the beginning of TEN Agent. There are many different ways to explore and utilize TEN Agent. To learn more, please refer to the documentation.


TEN Agent Components

Components Diagram


Stay Tuned

Before we get started, be sure to star our repository and get instant notifications for all new releases!

TEN star us gif


Join Community

  • Discord: Ideal for sharing your applications and engaging with the community.
  • GitHub Discussion: Perfect for providing feedback and asking questions.
  • GitHub Issues: Best for reporting bugs and proposing new features. Refer to our contribution guidelines for more details.
  • X: Great for sharing your agents and interacting with the community.

Star History

Star History Chart


Code Contributors

TEN


Contribution Guidelines

Contributions are welcome! Please read the contribution guidelines first.


License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

About

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatible with popular workflow platforms like Dify and Coze.

Topics

Resources

License

Stars

Watchers

Forks

Packages