EliteKV

【paper】EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection

🚧 Work in Progress 🚧 This repository is under active development. Feedback are welcome! More updates coming soon—stay tuned!

Dataset

RefinedWeb

Model

LLaMA2-7B and LLaMA2-13B

Start

Installation

conda create --name EliteKV python=3.10 -y
conda activate EliteKV
git clone --depth 1 https://github.com/CiaranZhou/EliteKV.git
cd EliteKV
pip install -r requirements.txt

Quickstart

RoPElite

bash RoPElite/cal_then_rank.sh

Dimension Allocation

python dimension_allocation/allocation_ppl.py \
    --model_path path/to/your/model \
    --data_path path/to/your/data \
    --file_path RoPElite/rank/RoPElite_1.pkl \
    --start 1 \
    --end 32 \
    --eval_iters 32

You can visualize the result using dimension_allocation/draw_fig.ipynb

Model Conversion

python convert/convert.py \
    --model_path path/to/model \
    --pe_mode EliteKV \
    --half_of_rope_dim 12 \
    --kv_dim 2048 \
    --save_dir convert/model

Acknowledgements

Some code in this project is cited and modified from transformers. We train model using Llama-Factory, an easy and efficient LLM training framework. we evaluate our method by using lm-evaluation-harness.

Citation

@misc{zhou2025elitekvscalablekvcache,
      title={EliteKV: Scalable KV Cache Compression via RoPE Frequency Selection and Joint Low-Rank Projection}, 
      author={Yuhao Zhou and Sirui Song and Boyang Liu and Zhiheng Xi and Senjie Jin and Xiaoran Fan and Zhihao Zhang and Wei Li and Xuanjing Huang},
      year={2025},
      eprint={2503.01586},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2503.01586}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
RoPElite		RoPElite
convert		convert
dimension_allocation		dimension_allocation
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EliteKV

Dataset

Model

Start

Installation

Quickstart

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EliteKV

Dataset

Model

Start

Installation

Quickstart

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages