Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2605.06548

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 231
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 156
Pretraining Language Models to Ponder in Continuous Space

Paper • 2505.20674 • Published May 27, 2025 • 3

about 13 hours ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80

about 15 hours ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 16 days ago • 84
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency

Paper • 2605.06924 • Published 23 days ago • 15
Diffusion Policy Policy Optimization

Paper • 2409.00588 • Published Sep 1, 2024 • 20

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published Mar 27 • 144
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 328
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 19
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

Paper • 2512.25075 • Published Dec 31, 2025 • 16
Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Paper • 2512.24176 • Published Dec 30, 2025 • 8

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 91
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published 26 days ago • 345
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 4 days ago • 116
Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80

LLM Evaluation Frameworks

Collection of LLM Evaluation Frameworks

iioos/llm-evaluation-model

Updated Dec 24, 2025
Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 26 days ago • 124
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 17 days ago • 269

Multi-agent cooperation through in-context co-player inference

Paper • 2602.16301 • Published Feb 18 • 24
TIDE: Every Layer Knows the Token Beneath the Context

Paper • 2605.06216 • Published 23 days ago • 9
Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
Hallucinations Undermine Trust; Metacognition is a Way Forward

Paper • 2605.01428 • Published 28 days ago • 24

Papers I'm going to read

about 21 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 180
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 53
Motion Attribution for Video Generation

Paper • 2601.08828 • Published Jan 13 • 72
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

Paper • 2601.19895 • Published Jan 27 • 27

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 231
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 156
Pretraining Language Models to Ponder in Continuous Space

Paper • 2505.20674 • Published May 27, 2025 • 3

interesting architecture

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 29
Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11, 2025 • 91
Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published Jan 31, 2025 • 25
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling

Paper • 2502.09509 • Published Feb 13, 2025 • 9

about 13 hours ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published 26 days ago • 345
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 4 days ago • 116
Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80

about 15 hours ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published 16 days ago • 84
A^2RD: Agentic Autoregressive Diffusion for Long Video Consistency

Paper • 2605.06924 • Published 23 days ago • 15
Diffusion Policy Policy Optimization

Paper • 2409.00588 • Published Sep 1, 2024 • 20

LLM Evaluation Frameworks

Collection of LLM Evaluation Frameworks

iioos/llm-evaluation-model

Updated Dec 24, 2025
Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

Paper • 2605.03042 • Published 26 days ago • 124
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 17 days ago • 269

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published Mar 27 • 144
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 156
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

Multi-agent cooperation through in-context co-player inference

Paper • 2602.16301 • Published Feb 18 • 24
TIDE: Every Layer Knows the Token Beneath the Context

Paper • 2605.06216 • Published 23 days ago • 9
Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 23 days ago • 80
Hallucinations Undermine Trust; Metacognition is a Way Forward

Paper • 2605.01428 • Published 28 days ago • 24

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 328
Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

Paper • 2512.23988 • Published Dec 30, 2025 • 19
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

Paper • 2512.25075 • Published Dec 31, 2025 • 16
Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Paper • 2512.24176 • Published Dec 30, 2025 • 8

Papers I'm going to read

about 21 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 180
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 53
Motion Attribution for Video Generation

Paper • 2601.08828 • Published Jan 13 • 72
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

Paper • 2601.19895 • Published Jan 27 • 27

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs