Zhenyi Shen's picture

In a Training Loop 🔄

Zhenyi Shen

zen-E

·

https://www.zhenyishen.com/

AI & ML interests

LLM Reasoning

Recent Activity

updated a model 15 days ago

zen-E/qwen3-4b-instruct-grpo-dapo-2epoch-8k

published a model 15 days ago

zen-E/qwen3-4b-instruct-grpo-dapo-2epoch-8k

updated a model 16 days ago

zen-E/qwen3-4b-instruct-grpo-dapo-1epoch-16k

View all activity

Organizations

None yet

Collections 2

Papers 5

arxiv:2602.03784

arxiv:2512.24618

arxiv:2511.20102

arxiv:2503.01606

models 28

zen-E/qwen3-4b-instruct-grpo-dapo-2epoch-8k

Updated 15 days ago

zen-E/qwen3-4b-instruct-grpo-dapo-1epoch-16k

Updated 16 days ago

zen-E/qwen3-8b-think-math-step100-opsd

Updated 25 days ago

zen-E/qwen3-8b-think-math-step500-grpo

Updated 25 days ago

zen-E/qwen3-8b-base-math-step700-grpo

zen-E/SSA-1B

1B • Updated Jan 30 • 47

zen-E/FullAttn-1B

1B • Updated Jan 30 • 9

zen-E/MoBA-1B

1B • Updated Jan 30 • 11

zen-E/NSA-1B

1B • Updated Jan 30 • 105

zen-E/opsd_qwen3_1b_hybrid_factor0p01_lennorm_adv_ckpt1160

datasets 7

zen-E/StrategyQA_CoT_GPT4o

Viewer • Updated May 10, 2025 • 2.04k • 52 • 1

zen-E/StrategyQA_GPT4o_CoTx10

Viewer • Updated May 7, 2025 • 17k • 95

zen-E/CommonsenseQA-GPT4omini

Viewer • Updated May 6, 2025 • 9.42k • 449

zen-E/GSM8k-Aug

Viewer • Updated Apr 16, 2025 • 387k • 2.57k • 4

zen-E/GSM8k-Aug-NL

Viewer • Updated Apr 13, 2025 • 385k • 287 • 1

zen-E/NEWS5M-simcse-roberta-large-embeddings-pca-256

Updated Oct 3, 2023 • 7

zen-E/ANLI-simcse-roberta-large-embeddings-pca-256

Updated Oct 3, 2023 • 16