Skip to content
View staytactical's full-sized avatar

Organizations

@inu-appcenter

Block or report staytactical

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4 1 Updated Jan 30, 2025

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 273 24 Updated Mar 4, 2025

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 190 26 Updated Sep 20, 2024

Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same-tokenizer and cross-tokenizer LLM distillation.

Python 42 5 Updated Nov 6, 2024

Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.

73 4 Updated Dec 7, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 962 112 Updated Oct 7, 2024

GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models

Python 19 3 Updated Jul 12, 2023

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

886 52 Updated Feb 27, 2025

Survey on LLM Agents (Published on CoLing 2025)

116 4 Updated Mar 5, 2025

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 251 19 Updated Oct 8, 2024

Fully open reproduction of DeepSeek-R1

Python 22,313 1,999 Updated Mar 7, 2025

SciQAG is a novel framework for automatically generating high-quality science question-answer pairs from a large corpus of scientific literature using large language models.

Jupyter Notebook 16 2 Updated Jul 22, 2024

Chat GPT autoblogger for Wordpress websites

Python 78 21 Updated Sep 16, 2023

한국어 ColBERT 개발

Python 7 Updated Apr 11, 2024

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

466 31 Updated Jun 25, 2024

[ACL 2023] PromptRank: Unsupervised Keyphrase Extraction Using Prompt

Python 51 3 Updated May 16, 2023

🔬 A curated list of awesome LLMs & deep learning strategies & tools in financial market.

3,831 456 Updated Dec 20, 2024

Awesome LLM Papers and repos on very comprehensive topics.

208 22 Updated Aug 22, 2024

Awesome list of Korean Large Language Models.

460 31 Updated Oct 31, 2023

list of papers, code, and other resources

986 160 Updated Jul 13, 2024

[ACL 2021] Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"

Python 75 14 Updated Aug 27, 2021

Easy and Efficient Quantization for Transformers

C++ 192 15 Updated Feb 7, 2025
Jupyter Notebook 744 165 Updated Aug 26, 2024

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

Shell 144 29 Updated Jul 3, 2020

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,439 1,044 Updated Mar 7, 2025

Code for "WR-One2Set: Towards Well-Calibrated Keyphrase Generation" (EMNLP 2022)

8 3 Updated Dec 25, 2022

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,253 4,282 Updated Mar 6, 2025

Data for ArXiv 2023 paper "Is ChatGPT A Good Keyphrase Generator? A Preliminary Study".

6 Updated Jan 8, 2024

Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023

Python 11 Updated Dec 18, 2023
Next
Showing results