Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yilun-Kong
/
M3DT
like
1
reinforcement learning
multi-task
Mixture of Experts
transformer
License:
mit
Model card
Files
Files and versions
xet
Community
main
M3DT
/
gradient_24experts+moe
675 MB
1 contributor
History:
1 commit
Yilun-Kong
Upload 175 files
adfc48f
verified
8 months ago
expert_0_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_10_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_11_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_12_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_13_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_14_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_15_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_16_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_17_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_18_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_19_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_1_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_20_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_21_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_22_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_23_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_2_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_3_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_4_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_5_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_6_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_7_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_8_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
expert_9_iter_200000
12.6 MB
xet
Upload 175 files
8 months ago
moe__iter_400000
373 MB
xet
Upload 175 files
8 months ago