-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 504 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
Universal Language Model Fine-tuning for Text Classification
Paper • 1801.06146 • Published • 8
Lukas Hein
LukasHein
·
AI & ML interests
Deep Learning in general
Organizations
None yet
Papers
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 504 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 15 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 63 -
Universal Language Model Fine-tuning for Text Classification
Paper • 1801.06146 • Published • 8
BERT Models
models
0
None public yet
datasets
0
None public yet