Operatio

Operatio is a tool for manipulating language models using task vectors. It's designed to help me learned about the underlying of the task arithmetic and about the mojo lang, it's also a tool to explore large language models through weight manipulation. I might try to implement the papers in the pdf/.

What are Task Vectors?

Task vectors are the difference between the weights of a base model and a fine-tuned model. They capture the "knowledge" or "skills" that a model has acquired for a specific task during fine-tuning. By manipulating these task vectors, we can potentially transfer (or suppress) skills between models or create models with combined capabilities.

What Operatio Does

Operatio provides a suite of operations for working with language models and task vectors:

Load Models: Efficiently load and save model weights.
Extract Difference: Compute the task vector by finding the difference between a base model and a fine-tuned model.
Transform Model: Apply a task vector to a base model, potentially transferring skills or knowledge.
Full Pipeline: Perform all of the above operations in sequence.

Installation

I used magic from modular so I recommand using it too. After having clone this github, simply use magic install then magic shell and you can use the following commands.

Usage

Operatio has a command-line tool with several operations:

mojo run main.mojo <operation> <base_model> [<ft_model>] [--output_dir <dir>] [--scaling_coef <coef>]

Operations:

load: Load and save model weights
extract: Compute the task vector
transform: Apply a task vector to a model
full: Run the complete pipeline

Arguments:

<operation>: The operation to perform (load, extract, transform, or full)
<base_model>: Path or identifier for the base model
<ft_model>: Path or identifier for the fine-tuned model (required for extract and full operations)
--output_dir: Directory to save output files (default: "models")
--scaling_coef: Scaling coefficient for the task vector (default: 0.5)

Examples:

Load models:

mojo run main.mojo load path/to/base/model path/to/finetuned/model

Extract task vector:

mojo run main.mojo extract path/to/base/model path/to/finetuned/model

Transform a model:

mojo run main.mojo transform path/to/base/model --scaling_coef 0.7

Run full pipeline:

mojo run main.mojo full path/to/base/model path/to/finetuned/model --scaling_coef 0.7

License

This project is licensed under the Apache License, Version 2.0. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
img		img
operatio		operatio
pdf		pdf
pyoperatio		pyoperatio
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
magic.lock		magic.lock
main.mojo		main.mojo
mojoproject.toml		mojoproject.toml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Operatio

What are Task Vectors?

What Operatio Does

Installation

Usage

Operations:

Arguments:

Examples:

License

About

Releases

Packages

Languages

License

teilomillet/operatio

Folders and files

Latest commit

History

Repository files navigation

Operatio

What are Task Vectors?

What Operatio Does

Installation

Usage

Operations:

Arguments:

Examples:

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages