My series of fully open, state-of-the-art small mixture-of-experts models.
aquiffoo
aquiffoo
AI & ML interests
thanks for everything.
Recent Activity
liked a model 2 days ago
mistralai/Leanstral-2603 liked a model 2 days ago
mistralai/Mistral-Small-4-119B-2603 liked a model 7 days ago
RekaAI/reka-edge-2603Organizations
Mesh-v0.1 Preview
Solo research around the `mesh` architecture: a novel solution to the problems of MoE
-
mesh-labs/v0.1-2x2-stage001
Text Generation • 0.4B • Updated • 2 • 1 -
mesh-labs/v0.1-2x2-stage002
Text Generation • 0.4B • Updated • 2 • 1 -
mesh-labs/v0.1-2x2-stage003
Text Generation • 0.4B • Updated • 1 • 1 -
mesh-labs/v0.1-2x2-stage002-adapter
Text Generation • Updated • 1
neo
my GPT-2-like model, pretrained from scratch
My top picks
Models i really really like
-
zai-org/GLM-5
Text Generation • 754B • Updated • 102k • • 1.82k -
zai-org/GLM-4.7-Flash
Text Generation • 31B • Updated • 1.76M • • 1.61k -
moonshotai/Kimi-K2.5
Image-Text-to-Text • 1.1T • Updated • 3.17M • • 2.3k -
fal/FLUX.2-dev-Turbo
Text-to-Image • Updated • 16.4k • • 340
neo-2
The second generation of my pretrained models.
neo-3
My series of fully open, state-of-the-art small mixture-of-experts models.
My top picks
Models i really really like
-
zai-org/GLM-5
Text Generation • 754B • Updated • 102k • • 1.82k -
zai-org/GLM-4.7-Flash
Text Generation • 31B • Updated • 1.76M • • 1.61k -
moonshotai/Kimi-K2.5
Image-Text-to-Text • 1.1T • Updated • 3.17M • • 2.3k -
fal/FLUX.2-dev-Turbo
Text-to-Image • Updated • 16.4k • • 340
Mesh-v0.1 Preview
Solo research around the `mesh` architecture: a novel solution to the problems of MoE
-
mesh-labs/v0.1-2x2-stage001
Text Generation • 0.4B • Updated • 2 • 1 -
mesh-labs/v0.1-2x2-stage002
Text Generation • 0.4B • Updated • 2 • 1 -
mesh-labs/v0.1-2x2-stage003
Text Generation • 0.4B • Updated • 1 • 1 -
mesh-labs/v0.1-2x2-stage002-adapter
Text Generation • Updated • 1
neo-2
The second generation of my pretrained models.
neo
my GPT-2-like model, pretrained from scratch