Skip to content
View lixd's full-sized avatar

Block or report lixd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

27 repositories

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,233 2,714 Updated Mar 3, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 8,827 1,530 Updated Mar 1, 2025

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 9,585 1,125 Updated Mar 3, 2025

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 6,672 426 Updated Mar 3, 2025

LLM Frontend for Power Users.

JavaScript 11,931 2,871 Updated Mar 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,993 5,985 Updated Mar 3, 2025

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 14,807 1,716 Updated Mar 2, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

18,628 1,794 Updated Sep 19, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,758 6,057 Updated Mar 3, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,584 1,765 Updated Feb 26, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,785 5,224 Updated Mar 3, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,351 5,030 Updated Jan 22, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,632 10,698 Updated Mar 3, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,722 1,889 Updated Apr 30, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,803 513 Updated Mar 3, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 33,852 5,756 Updated Nov 29, 2024

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 37,432 3,627 Updated Feb 27, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 12,702 1,303 Updated Feb 24, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 18,963 2,021 Updated Oct 15, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 76,842 11,191 Updated Mar 3, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Python 1,527 195 Updated Mar 3, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 20,968 2,574 Updated Mar 3, 2025

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 11,298 999 Updated Mar 3, 2025

The ultimate LLM/AI application development framework in Golang.

Go 1,647 106 Updated Feb 28, 2025

Machine Learning Toolkit for Kubernetes

TypeScript 14,715 2,467 Updated Feb 20, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 12,116 789 Updated Mar 3, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 80,347 9,623 Updated Mar 2, 2025