Skip to content
View grimoire's full-sized avatar
🙀
meooow!
🙀
meooow!

Organizations

@open-mmlab

Block or report grimoire

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Datacenter Scale Distributed Inference Serving Framework

Rust 3,193 214 Updated Mar 26, 2025

NVIDIA Inference Xfer Library (NIXL)

C++ 197 16 Updated Mar 26, 2025

DeeperGEMM: crazy optimized version

Cuda 61 Updated Mar 16, 2025

🥗 All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching.

TypeScript 12,429 765 Updated Apr 7, 2024

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,663 282 Updated Mar 10, 2025

A compendium of absurd "open-source" licenses.

1,585 62 Updated Jan 3, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,073 528 Updated Mar 25, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,299 676 Updated Mar 25, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,375 810 Updated Mar 1, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

6,934 228 Updated Mar 4, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,103 891 Updated Mar 25, 2025

s1: Simple test-time scaling

Python 6,061 708 Updated Mar 6, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,355 1,438 Updated Mar 10, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,865 2,208 Updated Feb 1, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,828 482 Updated Feb 7, 2025

The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention

Python 2,405 177 Updated Mar 18, 2025

Written in C++ and using SDL, The Powder Toy is a desktop version of the classic 'falling sand' physics sandbox, it simulates air pressure and velocity as well as heat.

C++ 4,749 796 Updated Mar 19, 2025

【您配吗】配你吗

JavaScript 1,734 84 Updated Aug 11, 2024

More relighting!

Python 7,764 476 Updated Feb 20, 2025

Open-source framework to automate DevOps and ITOps using your preferred LLM.

Python 1,304 88 Updated Mar 25, 2025

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,639 426 Updated Mar 18, 2025

Applied AI experiments and examples for PyTorch

Python 250 24 Updated Mar 21, 2025

小猿口算_已达到0.00s

Python 1,363 176 Updated Nov 5, 2024

用于小猿口算的基于Python的自动答题工具

Python 637 75 Updated Oct 11, 2024

A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.

HTML 785 180 Updated Mar 20, 2025

MLIR For Beginners tutorial

C++ 931 83 Updated Feb 7, 2025

Fast low-bit matmul kernels in Triton

Python 271 21 Updated Mar 25, 2025

A repository of Maker Skill Trees and templates to make your own.

Jinja 3,082 147 Updated Mar 24, 2025

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

JavaScript 6,241 629 Updated Jan 8, 2025
Next
Showing results