Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 6 days ago • 82
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 6 days ago • 82
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models Paper • 2512.24618 • Published 6 days ago • 114
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents Paper • 2512.22322 • Published 10 days ago • 37
LTD-Bench: Evaluating Large Language Models by Letting Them Draw Paper • 2511.02347 • Published Nov 4, 2025 • 8
LTD-Bench: Evaluating Large Language Models by Letting Them Draw Paper • 2511.02347 • Published Nov 4, 2025 • 8 • 1
Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning Paper • 2509.22601 • Published Sep 26, 2025 • 29
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise Paper • 2312.12436 • Published Dec 19, 2023 • 15
Masked Autoencoders are Efficient Class Incremental Learners Paper • 2308.12510 • Published Aug 24, 2023
Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion Paper • 2009.05757 • Published Sep 12, 2020
Woodpecker: Hallucination Correction for Multimodal Large Language Models Paper • 2310.16045 • Published Oct 24, 2023 • 17
MMICT: Boosting Multi-Modal Fine-Tuning with In-Context Examples Paper • 2312.06363 • Published Dec 11, 2023 • 1
MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models Paper • 2306.13394 • Published Jun 23, 2023
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis Paper • 2405.21075 • Published May 31, 2024 • 26
CAPro: Webly Supervised Learning with Cross-Modality Aligned Prototypes Paper • 2310.09761 • Published Oct 15, 2023