Think Just Enough: Sequence-Level Entropy as a Confidence Signal for LLM Reasoning
Paper • 2510.08146 • Published • 1
None defined yet.
ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?
METIS: Mentoring Engine for Thoughtful Inquiry & Solutions