pipizhao's picture

pipizhao

pipizhao

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

pipizhao/Pandalyst_13B_V1.0

updated a dataset 7 days ago

pipizhao/SkillRouter-Eval-Core

published a dataset 7 days ago

pipizhao/SkillRouter-Eval-Core

View all activity

Organizations

upvoted a paper 24 days ago

Rubrics to Tokens: Bridging Response-level Rubrics and Token-level Rewards in Instruction Following Tasks

Paper • 2604.02795 • Published 28 days ago • 4

upvoted a paper 28 days ago

ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents

Paper • 2604.01664 • Published 29 days ago • 8

upvoted 2 papers about 1 month ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 50

SkillRouter: Retrieve-and-Rerank Skill Selection for LLM Agents at Scale

Paper • 2603.22455 • Published Mar 23 • 2