Skip to content
View charlesCXK's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@HRNet @Atten4Vis

Block or report charlesCXK

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 90 5 Updated Feb 21, 2025

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,906 179 Updated Mar 9, 2025

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 951 60 Updated Feb 25, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,445 1,656 Updated Feb 26, 2025

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,228 59 Updated Nov 22, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,626 2,181 Updated Feb 1, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,971 287 Updated Feb 28, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 706 36 Updated Feb 24, 2025

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Python 369 17 Updated Jan 14, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,822 502 Updated Sep 25, 2024

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,180 216 Updated Mar 9, 2025

Schedule-Free Optimization in PyTorch

Python 2,108 71 Updated Feb 28, 2025

Annotated version of the Mamba paper

Jupyter Notebook 474 18 Updated Feb 27, 2024

VideoSys: An easy and efficient system for video generation

Python 1,939 131 Updated Mar 9, 2025

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,113 241 Updated Mar 6, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,958 2,451 Updated Mar 9, 2025

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,467 234 Updated Jun 14, 2024

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,823 129 Updated Aug 20, 2024

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,073 273 Updated Jan 10, 2025

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,247 46 Updated Dec 11, 2024

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,718 134 Updated Feb 19, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,057 4,648 Updated Mar 1, 2025

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 964 44 Updated Aug 12, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

857 38 Updated Mar 8, 2025

Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"

Python 46 3 Updated Nov 16, 2023

✨✨Latest Advances on Multimodal Large Language Models

14,167 913 Updated Mar 5, 2025

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

10,338 501 Updated May 5, 2024

[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS

Python 110 3 Updated Mar 18, 2024

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Python 195 4 Updated Nov 14, 2023
Next
Showing results