Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,10 @@ pipeline_tag: text-generation
|
|
| 12 |
---
|
| 13 |
|
| 14 |
# mlx-community/Qwen1.5-MoE-A2.7B-4bit
|
| 15 |
-
This model was converted to MLX format from [`Qwen/Qwen1.5-MoE-A2.7B`]() using mlx-lm version
|
|
|
|
|
|
|
|
|
|
| 16 |
Refer to the [original model card](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) for more details on the model.
|
| 17 |
## Use with mlx
|
| 18 |
|
|
@@ -24,5 +27,5 @@ pip install mlx-lm
|
|
| 24 |
from mlx_lm import load, generate
|
| 25 |
|
| 26 |
model, tokenizer = load("mlx-community/Qwen1.5-MoE-A2.7B-4bit")
|
| 27 |
-
response = generate(model, tokenizer, prompt="
|
| 28 |
```
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
# mlx-community/Qwen1.5-MoE-A2.7B-4bit
|
| 15 |
+
This model was converted to MLX format from [`Qwen/Qwen1.5-MoE-A2.7B`](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) using mlx-lm version version [d661440](https://github.com/ml-explore/mlx-examples/commit/d661440dbb8e1970fadad79c5061e786fe1c54ca).
|
| 16 |
+
|
| 17 |
+
Model added by [Prince Canuma](https://twitter.com/Prince_Canuma).
|
| 18 |
+
|
| 19 |
Refer to the [original model card](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B) for more details on the model.
|
| 20 |
## Use with mlx
|
| 21 |
|
|
|
|
| 27 |
from mlx_lm import load, generate
|
| 28 |
|
| 29 |
model, tokenizer = load("mlx-community/Qwen1.5-MoE-A2.7B-4bit")
|
| 30 |
+
response = generate(model, tokenizer, prompt="Write a story about Einstein", verbose=True)
|
| 31 |
```
|