ZHANG Jipeng

OldFriends

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

upvoted a paper 2 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

upvoted a paper 5 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27 • 96

upvoted a paper 2 months ago

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Paper • 2510.04996 • Published Oct 6 • 15

upvoted a paper 5 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 89

upvoted a paper 6 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published Jun 23 • 40

upvoted a paper 7 months ago

Fractured Chain-of-Thought Reasoning

Paper • 2505.12992 • Published May 19 • 23

upvoted a paper 8 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 93

upvoted 2 papers 9 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 56

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 83

upvoted 4 collections 10 months ago

upvoted a collection 11 months ago

UI Agent

Collection

a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robotics • 435 items • Updated 4 days ago • 65

upvoted 2 papers 11 months ago

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Paper • 2411.18203 • Published Nov 27, 2024 • 41

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published Dec 27, 2024 • 87

liked 2 datasets about 1 year ago

QuixiAI/Code-290k-ShareGPT-Vicuna

Viewer • Updated Feb 12, 2024 • 289k • 135 • 17

Sterzhang/PVIT-3M

Viewer • Updated Nov 2, 2024 • 3M • 3.94k • 18

upvoted a collection about 1 year ago

MIT Talk 31/10 Papers

Collection

14 items • Updated Oct 28, 2024 • 32

updated a model about 1 year ago

OldFriends/llava-critic-7b-hf

Image-to-Text • 8B • Updated Oct 30, 2024 • 4

upvoted a collection about 1 year ago

LLaVA-Critic

Collection

as a general evaluator for assessing model performance • 6 items • Updated Oct 6, 2024 • 10

ZHANG Jipeng

AI & ML interests

Recent Activity

Organizations

OldFriends's activity