CODI zen-E/CODI-gpt2 Updated Jun 4, 2025 zen-E/CODI-llama3.2-1b-Instruct Updated Jun 4, 2025 CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 4 zen-E/GSM8k-Aug Viewer • Updated Apr 16, 2025 • 387k • 5.25k • 3
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 4
SSA zen-E/SSA-1B 1B • Updated Jan 30 • 3 SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 28 EleutherAI/SmolLM-135M-100b Viewer • Updated Mar 18, 2025 • 109M • 260 • 2 zen-E/FullAttn-1B 1B • Updated Jan 30 • 2
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 28
CODI zen-E/CODI-gpt2 Updated Jun 4, 2025 zen-E/CODI-llama3.2-1b-Instruct Updated Jun 4, 2025 CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 4 zen-E/GSM8k-Aug Viewer • Updated Apr 16, 2025 • 387k • 5.25k • 3
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Paper • 2502.21074 • Published Feb 28, 2025 • 4
SSA zen-E/SSA-1B 1B • Updated Jan 30 • 3 SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 28 EleutherAI/SmolLM-135M-100b Viewer • Updated Mar 18, 2025 • 109M • 260 • 2 zen-E/FullAttn-1B 1B • Updated Jan 30 • 2
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 28