TongZheng PRO
TongZheng1999
AI & ML interests
Natural Language Processing
Recent Activity
updated a model about 9 hours ago
AutoTTS/history published a model about 9 hours ago
AutoTTS/history upvoted a paper about 22 hours ago
G-Zero: Self-Play for Open-Ended Generation from Zero DataOrganizations
models 394
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB-by-Judge
4B • Updated • 13
TongZheng1999/Final-Reasoning-4B-Iter1-Strong-Init-Filtered-RB
4B • Updated • 6
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-Filtered-RB
4B • Updated • 3
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_
Updated
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-Filter-step1200
4B • Updated • 1
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-Filter-step1000
4B • Updated • 2
TongZheng1999/Initial-Dual-Reasoning-4B-Iter1-Strong-Init-No-Filter-step300
4B • Updated • 1
TongZheng1999/Initial-Dual-Reasoning-4B-Added-Special-Tokens
4B • Updated • 68
TongZheng1999/Initial-Dual-Reasoning-4B
4B • Updated • 4
TongZheng1999/HS_Reasoning_4B_Filter_1_epoch
4B • Updated • 1
datasets 60
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge_f_by_judge
Viewer • Updated • 22.1k • 67
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_filtered_by_judge
Viewer • Updated • 5.43k • 20
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed_Merge
Viewer • Updated • 33.4k • 55
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_processed
Viewer • Updated • 16.7k • 19
TongZheng1999/Bespoke-Stratos-17k-Processed
Viewer • Updated • 16.7k • 30
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150
Viewer • Updated • 16.7k • 19
TongZheng1999/Bespoke-Stratos-17k-Init-Model-Final-Reinforce-Baseline-Iter1-Strong-Init-Filtered-Merged
Viewer • Updated • 46.5k • 3
TongZheng1999/iter_1_reinforce_baseline_per_sample_200epoch_strong_init_step_150_filtered
Viewer • Updated • 13.1k • 5
TongZheng1999/Reasoning-Gym-Hard
Viewer • Updated • 30 • 3
TongZheng1999/Reasoning-Gym
Viewer • Updated • 30 • 3