Add Cloud RMs #173

natolambert · 2024-09-06T01:35:38Z

See @zankner's repo https://github.com/zankner/CLoud, RM's that think out loud!

scottsuk0306 · 2024-11-25T14:56:30Z

Hi @natolambert, I recently trained a generative RM (prometheus-eval/prometheus-RM-Llama-8B-v1.0) based on the CLoud code base and l think the inference using the huggingface transformers can be easily integrated to existing code of reward-bench. Can I try working on this?

natolambert · 2024-12-02T03:53:21Z

@scottsuk0306 yes please! I've been curious for a bit and we're building new datasets now

FYI -- my issue was the tokenization timing was specific and would require a bunch of handling or a refactor. Lmk if you figure out something clever.

zankner · 2024-12-02T03:57:08Z

Hi sorry realized I never followed up on this. @scottsuk0306 @natolambert we support HF inference in the CLoud repo which I can also add (hf-inference). However, HF generate is extremely slow. More recently we added vllm-support but I'm not sure whether that would be easy to support in your repo, if you have any thoughts.

natolambert · 2024-12-03T17:41:29Z

We could try running Cloud RM with the "generative" pipeline, which is a bit different.
Otherwise, I feel like it isn't needed to use VLLM, unless we can figure out how to wrap it in the same abstraction -- which actually may be possible, because we just do some general model loading.

Either way, curious where we end up.

natolambert added the New Model Add a new model to the leaderboard/codebase label Sep 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Cloud RMs #173

Add Cloud RMs #173

natolambert commented Sep 6, 2024

scottsuk0306 commented Nov 25, 2024

natolambert commented Dec 2, 2024 •

edited

Loading

zankner commented Dec 2, 2024

natolambert commented Dec 3, 2024

Add Cloud RMs #173

Add Cloud RMs #173

Comments

natolambert commented Sep 6, 2024

scottsuk0306 commented Nov 25, 2024

natolambert commented Dec 2, 2024 • edited Loading

zankner commented Dec 2, 2024

natolambert commented Dec 3, 2024

natolambert commented Dec 2, 2024 •

edited

Loading