Qwen
/

Qwen2.5-Coder-32B

Text Generation

text-generation-inference

Model card Files Files and versions

cyente commited on Nov 8, 2024

Commit

a36cd5d

·

verified ·

1 Parent(s): af88dec

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7,
 **We do not recommend using base language models for conversations.** Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., or fill in the middle tasks on this model.
-For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2.5-coder/), [GitHub](https://github.com/QwenLM/Qwen2.5-Coder), [Documentation](https://qwen.readthedocs.io/en/latest/), [Arxiv](https://arxiv.org/abs/2409.12186).
 ## Requirements
@@ -76,7 +76,7 @@ We advise adding the `rope_scaling` configuration only when processing long cont
 ## Evaluation & Performance
-Detailed evaluation results are reported in this [📑 blog](https://qwenlm.github.io/blog/qwen2.5-coder/).
 For requirements on GPU memory and the respective throughput, see results [here](https://qwen.readthedocs.io/en/latest/benchmark/speed_benchmark.html).

 **We do not recommend using base language models for conversations.** Instead, you can apply post-training, e.g., SFT, RLHF, continued pretraining, etc., or fill in the middle tasks on this model.
+For more details, please refer to our [blog](https://qwenlm.github.io/blog/qwen2.5-coder-family/), [GitHub](https://github.com/QwenLM/Qwen2.5-Coder), [Documentation](https://qwen.readthedocs.io/en/latest/), [Arxiv](https://arxiv.org/abs/2409.12186).
 ## Requirements
 ## Evaluation & Performance
+Detailed evaluation results are reported in this [📑 blog](https://qwenlm.github.io/blog/qwen2.5-coder-family/).
 For requirements on GPU memory and the respective throughput, see results [here](https://qwen.readthedocs.io/en/latest/benchmark/speed_benchmark.html).