Top-K Sampling Support #59

mikemykhaylov · 2024-12-15T19:30:07Z

Hello,

I find that for some models, like Mistral Nemo, it is very beneficial to restrict the number of considered completions to improve the coherence of the output. However, compared to llama.cpp, MLX does not seem to support top K sampling in LM Studio. It does look like the underlying library supports that, so implementing that would be much appreciated.

llama.cpp models support Top-K

MLX models do not support Top-K

neilmehta24 · 2024-12-16T17:07:23Z

Thanks for bringing this to our attention. The MLX core library does indeed support a top-k matrix operation, but the MLX LLM library does not support top-k sampling. Here are the supported generation/sampling options as of today https://github.com/ml-explore/mlx-examples/blob/dfa4dd6/llms/mlx_lm/utils.py#L200-L215 . Please track this issue ml-explore/mlx-examples#1167 for adding support in mlx_lm

mikemykhaylov · 2025-01-21T18:07:43Z

Looks like the upstream PR got merged, any chance we could have the sampler in the MLX engine?

neilmehta24 · 2025-01-21T18:32:47Z

Looks like the upstream PR got merged, any chance we could have the sampler in the MLX engine?

Shouldn't be difficult to add to our app now, I should be able to get this done soon.

neilmehta24 · 2025-01-27T15:49:25Z

@mikemykhaylov FYI this is slightly delayed since we need to update the LM Studio UI to inform users about the limitation described here ml-explore/mlx-examples#1219

neilmehta24 mentioned this issue Dec 16, 2024

[Feature Request] Top-K sampling support in mlx_lm.utils.generate_step ml-explore/mlx-examples#1167

Closed

neilmehta24 added the enhancement New feature or request label Dec 16, 2024

mikemykhaylov mentioned this issue Jan 21, 2025

Add top-k sampling support #80

Merged

neilmehta24 closed this as completed in #80 Jan 21, 2025

neilmehta24 mentioned this issue Jan 21, 2025

[MLX] Add top-k sampling support lmstudio-ai/lmstudio-js#183

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Top-K Sampling Support #59

Top-K Sampling Support #59

mikemykhaylov commented Dec 15, 2024

neilmehta24 commented Dec 16, 2024

mikemykhaylov commented Jan 21, 2025 •

edited

Loading

neilmehta24 commented Jan 21, 2025

neilmehta24 commented Jan 27, 2025

Top-K Sampling Support #59

Top-K Sampling Support #59

Comments

mikemykhaylov commented Dec 15, 2024

neilmehta24 commented Dec 16, 2024

mikemykhaylov commented Jan 21, 2025 • edited Loading

neilmehta24 commented Jan 21, 2025

neilmehta24 commented Jan 27, 2025

mikemykhaylov commented Jan 21, 2025 •

edited

Loading