Performance Discussion

by IndenScale - opened 9 days ago

9 days ago

能代替 Qwen3 30B 系列吗？

这个尺寸的模型非常适合上生产。或者作为 pre-commit/push hooks 用来构建多层次的 CI Guardrail。

后续会更新在 MLX 上的性能表现。

9 days ago

I wanted to know token per second speed.

9 days ago

I wanted to know token per second speed.

I'll try using it with vllm on my AI max 395+

8 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment