charlesCXK

🎯

Focusing

Xiaokang Chen charlesCXK

🎯

Focusing

Researcher at DeepSeek AI. <-- Ph.D. student at Peking University

435 followers · 62 following

DeepSeek AI, Peking University
Beijing
charlesCXK.github.io

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

UmiMarch / OpenVideo

Python 90 5 Updated Feb 21, 2025

tensorzero / tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

Rust 2,906 179 Updated Mar 9, 2025

ZHO-ZHO-ZHO / ComfyUI-DeepSeek-JanusPro

Python 95 8 Updated Feb 21, 2025

magic-research / Sa2VA

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 951 60 Updated Feb 25, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,445 1,656 Updated Feb 26, 2025

apple / ml-aim

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,228 59 Updated Nov 22, 2024

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,626 2,181 Updated Feb 1, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,971 287 Updated Feb 28, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 706 36 Updated Feb 24, 2025

thunlp / LLaVA-UHD

LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer

Python 369 17 Updated Jan 14, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,822 502 Updated Sep 25, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,180 216 Updated Mar 9, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,108 71 Updated Feb 28, 2025

srush / annotated-mamba

Annotated version of the Mamba paper

Jupyter Notebook 474 18 Updated Feb 27, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,939 131 Updated Mar 9, 2025

showlab / Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, and various other applications.

4,113 241 Updated Mar 6, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,958 2,451 Updated Mar 9, 2025

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,467 234 Updated Jun 14, 2024

3DTopia / LGM

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,823 129 Updated Aug 20, 2024

ali-vilab / VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Python 3,073 273 Updated Jan 10, 2025

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,247 46 Updated Dec 11, 2024

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,718 134 Updated Feb 19, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,057 4,648 Updated Mar 1, 2025

chongzhou96 / EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Jupyter Notebook 964 44 Updated Aug 12, 2024

DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

857 38 Updated Mar 8, 2025

naver-ai / dual-teacher

Official code for the NeurIPS 2023 paper "Switching Temporary Teachers for Semi-Supervised Semantic Segmentation"

Python 46 3 Updated Nov 16, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

14,167 913 Updated Mar 5, 2025

ZachGoldberg / Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

10,338 501 Updated May 5, 2024

lxtGH / Tube-Link

[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS

Python 110 3 Updated Mar 18, 2024

impiga / Plain-DETR

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Python 195 4 Updated Nov 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiaokang Chen charlesCXK

Achievements

Achievements

Highlights

Organizations

Block or report charlesCXK

Stars

UmiMarch / OpenVideo

tensorzero / tensorzero

ZHO-ZHO-ZHO / ComfyUI-DeepSeek-JanusPro

magic-research / Sa2VA

deepseek-ai / DeepSeek-VL2

apple / ml-aim

deepseek-ai / Janus

open-compass / VLMEvalKit

bytedance / 1d-tokenizer

thunlp / LLaVA-UHD

deepseek-ai / DeepSeek-V2

EvolvingLMMs-Lab / lmms-eval

facebookresearch / schedule_free

srush / annotated-mamba

NUS-HPC-AI-Lab / VideoSys

showlab / Awesome-Video-Diffusion

unslothai / unsloth

luosiallen / latent-consistency-model

3DTopia / LGM

ali-vilab / VGen

lxtGH / OMG-Seg

Yuliang-Liu / Monkey

lm-sys / FastChat

chongzhou96 / EdgeSAM

DirtyHarryLYL / LLM-in-Vision

naver-ai / dual-teacher

BradyFU / Awesome-Multimodal-Large-Language-Models

ZachGoldberg / Startup-CTO-Handbook

lxtGH / Tube-Link

impiga / Plain-DETR