NATS AI

This project allows you to run a NATS service which exposes a Micro service that can be used to communicate with LLM's

Getting Started

Start by pulling the llama3 model into ollama:

ollama pull llama3

Start the application:

nats-ai

Call the endpoint

nats req --replies=0 'ai.call' 'Hi there, Can you generate me some Benthos code?'

Many replies will be sent back to you, so you need to set the number of replies to 0 and depend on the reply-timeout (defaults to 300ms) to know when to stop listening for replies.

Each reply will hold the following headers:

nats-model: the model which was used to generate the reply
nats-thread-id: the thread id to identify the conversation thread

Using a different model

By default the llama3 model is being used as the model of choice, but this can be overwritten using request headers:

nats req --replies=0 --header='model:deepseek-coder:33b' 'ai.call' 'Hi there, Can you generate me some Benthos code?'

Resuming the conversation

Since each response returns a nats-thread-id header, you can use this to resume the conversation:

nats req --replies=0 --header='nats-thread-id:1234' 'ai.call' 'Can you explain this as if it was to a 2 year old?'

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
app		app
cmd		cmd
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NATS AI

Getting Started

Using a different model

Resuming the conversation

About

Releases

Packages

Contributors 2

Languages

License

shono-io/nats-ai

Folders and files

Latest commit

History

Repository files navigation

NATS AI

Getting Started

Using a different model

Resuming the conversation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages