7581 6

GuoLiangTang

Tommy930

https://github.com/TommyTang930

AI & ML interests

LLM，NLP，ML

Recent Activity

upvoted a paper 1 day ago

Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating

upvoted a paper 1 day ago

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

upvoted a paper 1 day ago

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

View all activity

Organizations

None yet

upvoted 18 papers 1 day ago

Emergent Misalignment Can Be Induced by Sycophancy and Reversed via Alignment Gating

Paper • 2606.09068 • Published 4 days ago • 4

ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations

Paper • 2606.11188 • Published 3 days ago • 23

Struct-Searcher: Agentic Structural Thinking Advances Multimodal Deep Information Seeking

Paper • 2606.07689 • Published 7 days ago • 5

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 7 days ago • 52

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 4 days ago • 28

SCAIL-2: Unifying Controlled Character Animation with End-to-end In-Context Conditioning

Paper • 2606.10804 • Published 3 days ago • 35

MemDreamer: Decoupling Perception and Reasoning for Long Video Understanding via Hierarchical Graph Memory and Agentic Retrieval Mechanism

Paper • 2606.07512 • Published 7 days ago • 36

ABot-Earth 0.5: Generative 3D Earth Model

Paper • 2606.09967 • Published 4 days ago • 204

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

Paper • 2606.10917 • Published 2 days ago • 73

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 4 days ago • 49

Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval

Paper • 2606.04391 • Published 9 days ago • 10

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

Paper • 2606.09365 • Published 3 days ago • 2

AsyncWebRL: Efficient Multi-Step RL for Visual Web Agents

Paper • 2606.05597 • Published 8 days ago • 4

DuMate-DeepResearch: An Auditable Multi-Agent System with Recursive Search and Rubric-Grounded Reasoning

Paper • 2606.07299 • Published 7 days ago • 6

Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill

Paper • 2606.03980 • Published 10 days ago • 13

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Paper • 2606.06087 • Published 8 days ago • 59

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Paper • 2606.07297 • Published 7 days ago • 108

Agents' Last Exam

Paper • 2606.05405 • Published 9 days ago • 254

upvoted 2 papers 3 days ago

OASIS: From Simulation Data Collection to Real-World Humanoid Loco-Manipulation

Paper • 2606.08548 • Published 5 days ago • 2

Echo-Memory: A Controlled Study of Memory in Action World Models

Paper • 2606.09803 • Published 4 days ago • 31

GuoLiangTang

AI & ML interests

Recent Activity

Organizations

Tommy930's activity