Reasoning off feature?

#12

by hell0ks - opened 28 days ago

28 days ago

Hello,

I discovered there is "reasoning_effort" feature in chat_template.jinja. When set it to "low" or "minimal", it is designed to turn off reasoning by adding end tokens.

However it doesn't seem to consistent. Sometimes it emit reasoning behavior even with reasoning_effort = low.

I'd like to know if it is "designed feature" or some kind of leftover.

Thanks.

SSON9

upstage org about 19 hours ago

Model responses may fluctuate from time to time (which is not intended).
We recommend running inference with vLLM and logits processors.
Please refer to the vLLM section in the README.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment