-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 156 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:2605.06548
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
Paper • 2605.15178 • Published • 84 -
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency
Paper • 2605.06924 • Published • 15 -
Diffusion Policy Policy Optimization
Paper • 2409.00588 • Published • 20
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 328 -
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Paper • 2512.23988 • Published • 19 -
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Paper • 2512.25075 • Published • 16 -
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Paper • 2512.24176 • Published • 8
-
FAN: Fourier Analysis Networks
Paper • 2410.02675 • Published • 29 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 91 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 25 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 9
-
MolmoAct2: Action Reasoning Models for Real-world Deployment
Paper • 2605.02881 • Published • 345 -
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding
Paper • 2605.27365 • Published • 116 -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80
-
iioos/llm-evaluation-model
Updated -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
Paper • 2605.03042 • Published • 124 -
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence
Paper • 2605.12882 • Published • 269
-
Multi-agent cooperation through in-context co-player inference
Paper • 2602.16301 • Published • 24 -
TIDE: Every Layer Knows the Token Beneath the Context
Paper • 2605.06216 • Published • 9 -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
Hallucinations Undermine Trust; Metacognition is a Way Forward
Paper • 2605.01428 • Published • 24
-
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 180 -
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
Paper • 2601.07832 • Published • 53 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 72 -
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
Paper • 2601.19895 • Published • 27
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 156 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
-
FAN: Fourier Analysis Networks
Paper • 2410.02675 • Published • 29 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 91 -
Scalable-Softmax Is Superior for Attention
Paper • 2501.19399 • Published • 25 -
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling
Paper • 2502.09509 • Published • 9
-
MolmoAct2: Action Reasoning Models for Real-world Deployment
Paper • 2605.02881 • Published • 345 -
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding
Paper • 2605.27365 • Published • 116 -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer
Paper • 2605.15178 • Published • 84 -
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency
Paper • 2605.06924 • Published • 15 -
Diffusion Policy Policy Optimization
Paper • 2409.00588 • Published • 20
-
iioos/llm-evaluation-model
Updated -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration
Paper • 2605.03042 • Published • 124 -
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence
Paper • 2605.12882 • Published • 269
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Multi-agent cooperation through in-context co-player inference
Paper • 2602.16301 • Published • 24 -
TIDE: Every Layer Knows the Token Beneath the Context
Paper • 2605.06216 • Published • 9 -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
Hallucinations Undermine Trust; Metacognition is a Way Forward
Paper • 2605.01428 • Published • 24
-
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 328 -
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process
Paper • 2512.23988 • Published • 19 -
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
Paper • 2512.25075 • Published • 16 -
Guiding a Diffusion Transformer with the Internal Dynamics of Itself
Paper • 2512.24176 • Published • 8
-
LTX-2: Efficient Joint Audio-Visual Foundation Model
Paper • 2601.03233 • Published • 180 -
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
Paper • 2601.07832 • Published • 53 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 72 -
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep
Paper • 2601.19895 • Published • 27