Update to Checkpoint-Two (420B tokens)
Browse files- README.md +14 -11
- assets/checkpoint-two.webp +3 -0
README.md
CHANGED
|
@@ -12,12 +12,12 @@ model trained entirely from scratch at the 72 billion parameter scale.
|
|
| 12 |
It is being trained with 20+ globally distributed participants coordinated via
|
| 13 |
decentralized infrastructure on the Bittensor blockchain.
|
| 14 |
|
| 15 |
-
**Checkpoint-
|
| 16 |
-
tokens processed**. Model files are available in the [Checkpoint-
|
| 17 |
-
branch](https://huggingface.co/tplr/Covenant72B/tree/Checkpoint-
|
| 18 |
checkpoints will be updated here.
|
| 19 |
|
| 20 |
-
. Future
|
| 18 |
checkpoints will be updated here.
|
| 19 |
|
| 20 |
+

|
| 21 |
|
| 22 |
---
|
| 23 |
|
|
|
|
| 27 |
|-----------|--------|
|
| 28 |
| **Model size** | 72B |
|
| 29 |
| **Architecture** | LLaMA-style |
|
| 30 |
+
| **Target token budget** | 1.2T (420B for current checkpoint) |
|
| 31 |
| **Compute participants** | 20+ |
|
| 32 |
| **Minimal compute per participant** | 8×B200 or equivalent |
|
| 33 |
| **Dataset** | DCLM-baseline |
|
|
|
|
| 36 |
---
|
| 37 |
|
| 38 |
## Performance on Benchmarks
|
| 39 |
+
_All results are 0-shot acc-norm (%) unless noted._
|
| 40 |
|
| 41 |
+
| Model | Compute Environment / Permissions | Size | Tokens | ARC-C | ARC-E | PIQA | OpenBookQA | HellaSwag | Winogrande (acc) | MMLU (acc) |
|
| 42 |
|:------|:----------------------------------|------:|--------:|------:|------:|------:|------------:|-----------:|-------------:|------:|
|
| 43 |
+
| **Intellect-1** | Internet / Whitelist | 10B | 1T | 44.8 | 71.6 | 77.7 | 43.6 | 70.5 | 63.1 | 32.7 |
|
| 44 |
+
| **Psyche Consilience-7Y9** | Internet / Whitelist | 40B | 1.2T | 31.1 | 55.8 | 76.1 | 34.8 | 63.7 | 57.0 | 24.2 |
|
| 45 |
+
| **Covenant72B (Checkpoint-Two)** | Internet / Permissionless | 72B | **420B** | **53.84** | **77.74** | **80.58** | **44.60** | **77.08** | **71.43** | **47.49** |
|
| 46 |
+
| **LLM360 K2 ckpt_108** | Centralized Cluster | 65B | 420B | 45.73 | 70.54 | 80.90 | 43.20 | 78.23 | 71.90 | 50.01 |
|
| 47 |
+
| **LLM360 K2 Stage 1** | Centralized Cluster | 65B | 1.4T | 53.84 | 75.93 | 82.48 | 48.00 | 82.81 | 76.64 | 63.90 |
|
| 48 |
+
| **LLaMA-2-7B** | Centralized Cluster | 7B | 2T | 45.90 | 74.58 | 75.92 | 44.20 | 75.92 | 68.90 | 40.86 |
|
| 49 |
+
| **LLaMA-2-70B** | Centralized Cluster | 70B | 2T | 57.59 | 80.77 | 82.92 | 48.60 | 83.86 | 77.58 | 65.56 |
|
| 50 |
|
| 51 |
---
|
| 52 |
|
assets/checkpoint-two.webp
ADDED
|
Git LFS Details
|