You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is there a way to change the generation behavior to something other than Streaming?
mlx_lm has two methods: the normal "generate" method and the "generate_stream" method.
I want to output structured data, so generating it with streaming is not preferable.
If there is a way, please let me know.
Thank you.
The text was updated successfully, but these errors were encountered:
The callback didGenerate is indeed how you can observe (and control) the stream being generated, but if you consume the result you can get the full output text or tokens.
Is there a way to change the generation behavior to something other than Streaming?
mlx_lm has two methods: the normal "generate" method and the "generate_stream" method.
I want to output structured data, so generating it with streaming is not preferable.
If there is a way, please let me know.
Thank you.
The text was updated successfully, but these errors were encountered: