Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
Zayd Muhammad Kawakibi Zuhri PRO
zaydzuhri
AI & ML interests
I really like watching loss go down
Recent Activity
upvoted a paper about 17 hours ago
LinguDistill: Recovering Linguistic Ability in Vision- Language Models via Selective Cross-Modal Distillation updated a dataset 18 days ago
zaydzuhri/anbncn-language-33 published a dataset 18 days ago
zaydzuhri/anbncn-language-33Organizations
None yet