Skip to content

Chinmaya-Kausik/RLHF-comparison

Repository files navigation

Comparing various RLHF methods for instruction-tuning LLMs. Builds on top of HuggingFace TRL.

You can find the project website here.