Instruction-Tuning

This repository is the implementation of HW3 for CSIE Applied Deep Learning in 2024 Fall.

Setting the Environment

To set the environment for inferencing, run this command:

pip install -r requirements.txt

To set the environment for Training, run this command:

pip install -r requirements_qlora.txt

Download LoRA checkpoint

To download the checkpoint of LoRA, run the command:

bash ./download.sh

Reproducing

To reproduce the result, run the command:

bash ./run.sh <pretrain model path> <lora checkpoint path> <input data path> <output file path>

Note: I use zake7749/gemma-2-2b-it-chinese-kyara-dpo for fine-tuning.

For example:

python inference.py --model_path zake7749/gemma-2-2b-it-chinese-kyara-dpo \
                --adapt_checkpoint_path adapter_checkpoint/ \
                --input_path data/private_test.json \
                --output_path prediction.json

Training

To fine-tune the pre-trained model, run the command:

python qlora.py --dataset data/train.json \
				--model_name_or_path zake7749/gemma-2-2b-it-chinese-kyara-dpo \
				--learning_rate 0.0008 \
				--per_device_eval_batch_size 8\
 				--per_device_train_batch_size 8\
 				--max_steps 5000\

You can adjust and add any parameters as long as qlora.py accepts it.

Operating Environment

The training was conducted on Colab Pro, with A100 and 80GB RAM. Check train_colab.ipynb for more information about the implementaion on Google Colab.

Please note that the default version of many packages on Colab is not compatible with the reruirements of qlora.py. If any problem is encountered, utilize the commented "pip install" commands.

Reference

Qlora : https://github.com/artidoro/qlora
zake7749/Gemma-2 :https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.gitignore		.gitignore
README.md		README.md
download.sh		download.sh
eval.sh		eval.sh
inference.py		inference.py
ppl.py		ppl.py
prediction.json		prediction.json
preprocess.py		preprocess.py
qlora.py		qlora.py
report.pdf		report.pdf
requirements.txt		requirements.txt
requirements_qlora.txt		requirements_qlora.txt
run.sh		run.sh
train.sh		train.sh
train_colab.ipynb		train_colab.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instruction-Tuning

Setting the Environment

Download LoRA checkpoint

Reproducing

Training

Operating Environment

Reference

About

Releases

Packages

Languages

kogby/Instruction-Tuning

Folders and files

Latest commit

History

Repository files navigation

Instruction-Tuning

Setting the Environment

Download LoRA checkpoint

Reproducing

Training

Operating Environment

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages