MaKLlama

All

19 repositories

ktransformers
Public
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Python
•
Apache License 2.0
•799•1•0•0•Updated Mar 4, 2025Mar 4, 2025
gpustack
Public
Manage GPU clusters for running LLMs
Python
•
Apache License 2.0
•181•0•0•0•Updated Mar 4, 2025Mar 4, 2025
ollama
Public
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
Go
•
MIT License
•11k•0•0•0•Updated Feb 27, 2025Feb 27, 2025
llama-box
Public
LLM inference server implementation based on llama.cpp.
C++
•
MIT License
•15•0•0•0•Updated Feb 16, 2025Feb 16, 2025
stable-diffusion.cpp
Public
Stable Diffusion and Flux in pure C/C++
C++
•
MIT License
•349•0•0•0•Updated Feb 14, 2025Feb 14, 2025
llama.cpp
Public
LLM inference in C/C++
C++
•
MIT License
•11k•3•0•0•Updated Feb 13, 2025Feb 13, 2025
exo
Public
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Python
•
GNU General Public License v3.0
•1.6k•0•0•0•Updated Nov 28, 2024Nov 28, 2024
llama-cpp-python
Public
Python bindings for llama.cpp
Python
•
MIT License
•1.1k•0•0•0•Updated Nov 26, 2024Nov 26, 2024
fastfetch
Public
Like neofetch, but much faster because written mostly in C.
C
•
MIT License
•478•0•0•0•Updated Nov 19, 2024Nov 19, 2024
vllm
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•6k•0•0•0•Updated Oct 16, 2024Oct 16, 2024
k8sgpt
Public
Giving Kubernetes Superpowers to everyone
Go
•
Apache License 2.0
•750•0•0•0•Updated Sep 24, 2024Sep 24, 2024
k8sgpt-operator
Public
Automatic SRE Superpowers within your Kubernetes cluster
Go
•
Apache License 2.0
•98•0•0•0•Updated Jul 31, 2024Jul 31, 2024
llm.c
Public
LLM training in simple, raw C/CUDA
Cuda
•
MIT License
•3k•0•0•0•Updated Jul 22, 2024Jul 22, 2024
ollama-registry-pull-through-proxy
Public
A proxy that allows you to host ollama images in your local environment
Go
•
MIT License
•2•0•0•0•Updated Jul 2, 2024Jul 2, 2024
ollama-benchmark
Public
LLM Benchmark for Throughput via Ollama (Local LLMs)
Python
•
MIT License
•28•0•0•0•Updated Jun 11, 2024Jun 11, 2024
makllama
Public
MaK(Mac+Kubernetes)llama - Running and orchestrating large language models (LLMs) on Kubernetes with macOS nodes.
go kubernetes ai inference llama containerd apple-silicon llm llms
Go
•
Apache License 2.0
•3•36•0•0•Updated May 22, 2024May 22, 2024
containerd
Public
An open and reliable container runtime
Go
•
Apache License 2.0
•3.5k•1•0•0•Updated May 22, 2024May 22, 2024
.github
Public
0•0•0•0•Updated May 21, 2024May 21, 2024
cri
Public
Go
•17•1•0•0•Updated May 21, 2024May 21, 2024