data:image/s3,"s3://crabby-images/9b100/9b1003513d06e6a9bd4394719c58d45fefc9c625" alt="Screenshot 2024-10-23 at 11 49 51 AM"
Swift framework exploring ergonomic, lightweight multi-agent orchestration. Highly inspired by OpenAI library swarm.
The primary goal is just exploratory and a starting point for Swift engineers to explore Agents in their apps. SwiftSwarm is open source and accepts contributions PR's are more than welcome!
SwiftSwarm focuses on making agent coordination and execution lightweight. Its main use case is to enable seamless transitions between agents in a conversation.
It accomplishes this through two abstractions: AgentRepresentable and ToolResponseHandler. An AgentRepresentable contains an Agent that encompasses instructions and tools. At any point, it can hand off a conversation to another Agent through the ToolResponseHandler.
Same as Swarm OpenAI library, SwiftSwarm Agents are not related to Assistants in the Assistants API. They are named similarly for convenience, but are otherwise completely unrelated. SwiftSwarm is entirely powered by the Chat Completions API and is hence stateless between calls.
To see demonstrations of state management in iOS applications, visit the Examples folder in this project. These examples are demos only and are not intended for production use.
Note: Currently, only streaming is supported. While the project uses the OpenAI API exclusively, Claude support could be added to the roadmap if there is sufficient interest.
- Open your Swift project in Xcode.
- Go to
File
->Add Package Dependency
. - In the search bar, enter this URL.
- Choose the main branch. (see the note below).
- Click
Add Package
.
Note: Xcode has a quirk where it defaults an SPM package's upper limit to 2.0.0. This package is beyond that
limit, so you should not accept the defaults that Xcode proposes. Instead, enter the lower bound of the
release version that you'd like to support, and then
tab out of the input box for Xcode to adjust the upper bound. Alternatively, you may select branch
-> main
to stay on the bleeding edge.
/// A structure representing an agent in the system.
///
/// The `Agent` structure contains the essential properties required to define
/// an agent, including the model it uses, its instructions, and its available tools.
public struct Agent {
public var name: String
public var model: Model
public var instructions: String
public var tools: [ChatCompletionParameters.Tool]
}
SwiftSwarm enables autonomous agents through two main interfaces: AgentRepresentable
and ToolResponseHandler
public protocol AgentRepresentable: CaseIterable, RawRepresentable where RawValue == String {
/// The `Agent` that contains all tools for agent orchestration.
var agent: Agent { get }
/// The base definition for this agent type.
///
/// This property allows each conforming type to provide its base configuration
/// such as model, instructions, and custom tools.
/// This should only be used internally - consumers should always use the `agent`
/// property when making run requests.
var agentDefinition: AgentDefinition { get }
}
public protocol ToolResponseHandler {
/// The type of agent that this handler works with.
///
/// `AgentType` must conform to the `AgentRepresentable` protocol, ensuring that
/// it can be converted to or from an agent.
associatedtype AgentType: AgentRepresentable
/// Attempts to transfer the tool parameters to a matching agent.
///
/// This method checks the provided parameters to find a suitable agent
/// that matches the given tool keys and values, returning the corresponding agent if found.
///
/// - Parameter parameters: A dictionary of parameters that may contain information
/// for selecting an agent.
/// - Returns: An optional `Agent` that matches the parameters or `nil` if no match is found.
func transferToAgent(_ parameters: [String: Any]) -> Agent?
/// Handles the tool response content asynchronously.
///
/// Given a set of parameters, this method processes the content generated by the tools
/// and returns the resulting string asynchronously.
///
/// - Parameter parameters: A dictionary of parameters containing tool inputs.
/// - Returns: A string representing the tool's response content.
/// - Throws: Any errors that may occur during content handling.
func handleToolResponseContent(parameters: [String: Any]) async throws -> String?
}
In this section, we will walk through step by step what to do to use SwiftSwarm
First, define your agents using an enum with a String
raw value that conforms to AgentRepresentable
. Here's an example:
enum Team: String {
case engineer = "Engineer"
case designer = "Designer"
case product = "Product"
}
In this example, we have a Team enum with 3 agents. These agents can transfer the conversation to a new agent when the user's question requires specialized expertise.
Now let's use the AgentRepresentable protocol to define the agents. The final implementation will look like this:
enum Team: String, AgentRepresentable {
case engineer = "Engineer"
case designer = "Designer"
case product = "Product"
var agentDefinition: AgentDefinition {
switch self {
case .engineer:
.init(agent: Agent(
name: self.rawValue,
model: .gpt4o,
instructions: "You are a technical engineer, if user asks about you, you answer with your name \(self.rawValue)",
tools: [])) // <---- you can define specific tools for each agent.
case .designer:
.init(agent: Agent(
name: self.rawValue,
model: .gpt4o,
instructions: "You are a UX/UI designer, if user asks about you, you answer with your name \(self.rawValue)",
tools: [])) // <---- you can define specific tools for each agent.
case .product:
.init(agent: Agent(
name: self.rawValue,
model: .gpt4o,
instructions: "You are a product manager, if user asks about you, you answer with your name \(self.rawValue)",
tools: [])) // <---- you can define specific tools for each agent.
}
}
}
Note: You can define specific tools for each agent.
Now that we have defined our agents, we need to create an object that conforms to ToolResponseHandler. Here's an example:
struct TeamDemoResponseHandler: ToolResponseHandler {
/// 1.
typealias AgentType = Team
/// 2.
func handleToolResponseContent(
parameters: [String: Any])
async throws -> String?
{
/// 3.
}
}
-
The
ToolResponseHandler
associated type must be an enum that conforms to AgentRepresentable. -
The
handleToolResponseContent
function is triggered when a function call associated with your agent tools occurs. This is where you implement custom functionality based on the provided parameters.Note: The
ToolResponseHandler
automatically manages agent switching using thetransferToAgent
function in a protocol extension. -
When a function call is triggered, you can retrieve values using the keys defined in your tool definition's parameter properties. For instructions in how to define a tool got to
SwiftOpenAI
function call documentation.
SwiftSwarm relies on SwiftOpenAI to manage communication with OpenAI APIs. A Swarm object requires two dependencies:
- An instance of
OpenAIService
- An instance of
ToolResponseHandler
The code will look like this:
import SwiftOpenAI
import SwiftSwarm
let apiKey = "MY_API_KEY"
let openAIService = OpenAIServiceFactory.service(apiKey: apiKey)
let toolResponseHandler = TeamDemoResponseHandler()
let swarm = Swarm(client: openAIService, toolResponseHandler: toolResponseHandler)
let userMessage = "I need design input"
let message = ChatCompletionParameters.Message(role: .user, content: .text(userMessage))
var currentAgent = Team.engineer.agent
let streamChunks = await swarm.runStream(agent: currentAgent, messages: [message])
for try await streamChunk in streamChunks {
content += streamChunk.content ?? ""
if let response = streamChunk.response {
currentAgent = response.agent <--- Transfer to designer here.
}
}
print("Switched to: \(currentAgent.name)")
print("Content: \(content)")
This will print:
Switched to: Designer
Content: As a UX/UI designer, I'm here to help you with your design needs. What specific aspect of design do you need input on? Whether it's user interface design, user experience strategy, or even color schemes and typography, feel free to let me know!
Note: SwiftSwarm is stateless. For examples of how to manage agent conversations with state, please visit the examples folder in this repository.
This project uses SwiftOpenAI as a dependency, therefore an OpenAI API Key is needed. Go to SwiftOpenAI documentation for more information.