Kairun Wen

kairunwen

7 29 89

https://kairunwen.github.io/

AI & ML interests

Computer Vision、Machine Learning

Recent Activity

upvoted a paper 5 days ago

RynnWorld-4D: 4D Embodied World Models for Robotic Manipulation

upvoted a paper about 1 month ago

InterleaveThinker: Reinforcing Agentic Interleaved Generation

updated a dataset about 1 month ago

kairunwen/d4

View all activity

Organizations

upvoted a paper 5 days ago

RynnWorld-4D: 4D Embodied World Models for Robotic Manipulation

Paper • 2607.06559 • Published 8 days ago • 92

upvoted a paper about 1 month ago

InterleaveThinker: Reinforcing Agentic Interleaved Generation

Paper • 2606.13679 • Published Jun 11 • 83

upvoted 2 papers about 2 months ago

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

Paper • 2605.21487 • Published May 20 • 23

PhysBrain 1.0 Technical Report

Paper • 2605.15298 • Published May 14 • 145

upvoted 2 papers 2 months ago

Qwen-Image-2.0 Technical Report

Paper • 2605.10730 • Published May 11 • 117

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 43

upvoted a paper 4 months ago

Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World

Paper • 2603.12746 • Published Mar 13 • 1

upvoted 2 papers 5 months ago

SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation

Paper • 2602.02402 • Published Feb 2 • 33

HY3D-Bench: Generation of 3D Assets

Paper • 2602.03907 • Published Feb 3 • 24

upvoted a paper 6 months ago

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

Paper • 2601.02281 • Published Jan 5 • 33

upvoted 4 papers 7 months ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Paper • 2512.16793 • Published Dec 18, 2025 • 76

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper • 2511.23002 • Published Nov 28, 2025 • 26

Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation

Paper • 2512.08186 • Published Dec 9, 2025 • 23

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published Dec 2, 2025 • 37

upvoted a paper 9 months ago

ARTDECO: Towards Efficient and High-Fidelity On-the-Fly 3D Reconstruction with Structured Scene Representation

Paper • 2510.08551 • Published Oct 9, 2025 • 34

upvoted a paper 12 months ago

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published Jul 7, 2025 • 49

upvoted 4 papers about 1 year ago

IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering

Paper • 2506.23329 • Published Jun 29, 2025 • 8

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Paper • 2506.17612 • Published Jun 21, 2025 • 65

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5, 2025 • 83

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9, 2025 • 52

Kairun Wen

AI & ML interests

Recent Activity

Organizations

kairunwen's activity