GitHub - subhendukhatuya/LLMLingua: To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Setup

pip install -r requirements.txt

To compress lmsys-data and push to hf-hub (already pushed)

cd lmsys_data
python create_data.py

To compress arxiv-data and push to hf-hub (not pushed, also please change HF profile to the organization instead of mitramango)

cd arxiv_data
python create_arxiv_data.py --action push_hf

To test compression results on arxiv-data (5 examples) and save csv

cd arxiv_data
python create_arxiv_data.py --action save_csv

To use LLMLingua2 for compression (Python Code)

import sys
import os

sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
from llmlingua import PromptCompressor

llm_lingua2 = PromptCompressor(
    model_name="microsoft/llmlingua-2-xlm-roberta-large-meetingbank",
    use_llmlingua2=True,
) # for LLMLingua2-large

llm_lingua2_small = PromptCompressor(
    model_name="microsoft/llmlingua-2-bert-base-multilingual-cased-meetingbank",
    use_llmlingua2=True,  # Whether to use llmlingua-2
) # for LLMLingua2-small

prompt = "Input your prompt here"

compress_results = llm_lingua2.compress_prompt(prompt)

Name		Name	Last commit message	Last commit date
Latest commit History 114 Commits
.github		.github
arxiv_data		arxiv_data
examples		examples
experiments/llmlingua2		experiments/llmlingua2
images		images
llmlingua		llmlingua
lmsys_data		lmsys_data
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DOCUMENT.md		DOCUMENT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
Transparency_FAQ.md		Transparency_FAQ.md
compare_llmlingua_1v2.csv		compare_llmlingua_1v2.csv
prompt_hardest.txt		prompt_hardest.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
test_setup.py		test_setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Setup

To compress lmsys-data and push to hf-hub (already pushed)

To compress arxiv-data and push to hf-hub (not pushed, also please change HF profile to the organization instead of mitramango)

To test compression results on arxiv-data (5 examples) and save csv

To use LLMLingua2 for compression (Python Code)

About

Releases

Packages

Languages

License

subhendukhatuya/LLMLingua

Folders and files

Latest commit

History

Repository files navigation

Setup

To compress lmsys-data and push to hf-hub (already pushed)

To compress arxiv-data and push to hf-hub (not pushed, also please change HF profile to the organization instead of mitramango)

To test compression results on arxiv-data (5 examples) and save csv

To use LLMLingua2 for compression (Python Code)

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages