tplr
/

joellidin commited on
Commit
2633fe9
·
verified ·
1 Parent(s): b18e50b

Update to Checkpoint-Two (420B tokens)

Browse files
Files changed (2) hide show
  1. README.md +14 -11
  2. assets/checkpoint-two.webp +3 -0
README.md CHANGED
@@ -12,12 +12,12 @@ model trained entirely from scratch at the 72 billion parameter scale.
12
  It is being trained with 20+ globally distributed participants coordinated via
13
  decentralized infrastructure on the Bittensor blockchain.
14
 
15
- **Checkpoint-One** marks the first release, corresponding to **200 billion
16
- tokens processed**. Model files are available in the [Checkpoint-One
17
- branch](https://huggingface.co/tplr/Covenant72B/tree/Checkpoint-One). Future
18
  checkpoints will be updated here.
19
 
20
- ![Checkpoint One](assets/checkpoint-one.webp)
21
 
22
  ---
23
 
@@ -27,7 +27,7 @@ checkpoints will be updated here.
27
  |-----------|--------|
28
  | **Model size** | 72B |
29
  | **Architecture** | LLaMA-style |
30
- | **Target token budget** | 1.2T (210B for current checkpoint) |
31
  | **Compute participants** | 20+ |
32
  | **Minimal compute per participant** | 8×B200 or equivalent |
33
  | **Dataset** | DCLM-baseline |
@@ -36,14 +36,17 @@ checkpoints will be updated here.
36
  ---
37
 
38
  ## Performance on Benchmarks
39
- _All results are 0-shot acc-norm (%)_
40
 
41
- | Model | Compute Environment / Permissions | Size | Tokens | ARC-C | ARC-E | PIQA | OpenBookQA | HellaSwag | Winogrande | MMLU |
42
  |:------|:----------------------------------|------:|--------:|------:|------:|------:|------------:|-----------:|-------------:|------:|
43
- | **Intellect-1** | Over the internet / White List | 10B | 1T | 44.8 | 71.6 | 77.7 | 43.6 | 70.5 | 63.1 | 32.7 |
44
- | **Psyche Consilience-7Y9** | Over the internet / White List | 40B | 1.2T | 31.1 | 55.8 | 76.1 | 34.8 | 63.7 | 57.0 | 24.2 |
45
- | **Covenant72B Checkpoint One** | Over the internet / Permissionless | 70B | 210B | 46.2 | 72.6 | 79.2 | 43.0 | 73.5 | 70.3 | 38.0 |
46
- | **K2 Checkpoint 54** | Centralized Cluster | 65B | 210B | 41.8 | 69.5 | 80.1 | 42.4 | 74.9 | 68.9 | 33.7 |
 
 
 
47
 
48
  ---
49
 
 
12
  It is being trained with 20+ globally distributed participants coordinated via
13
  decentralized infrastructure on the Bittensor blockchain.
14
 
15
+ **Checkpoint-Two** marks the second release, corresponding to **420 billion
16
+ tokens processed**. Model files are available in the [Checkpoint-Two
17
+ branch](https://huggingface.co/tplr/Covenant72B/tree/Checkpoint-Two). Future
18
  checkpoints will be updated here.
19
 
20
+ ![Checkpoint Two](assets/checkpoint-two.webp)
21
 
22
  ---
23
 
 
27
  |-----------|--------|
28
  | **Model size** | 72B |
29
  | **Architecture** | LLaMA-style |
30
+ | **Target token budget** | 1.2T (420B for current checkpoint) |
31
  | **Compute participants** | 20+ |
32
  | **Minimal compute per participant** | 8×B200 or equivalent |
33
  | **Dataset** | DCLM-baseline |
 
36
  ---
37
 
38
  ## Performance on Benchmarks
39
+ _All results are 0-shot acc-norm (%) unless noted._
40
 
41
+ | Model | Compute Environment / Permissions | Size | Tokens | ARC-C | ARC-E | PIQA | OpenBookQA | HellaSwag | Winogrande (acc) | MMLU (acc) |
42
  |:------|:----------------------------------|------:|--------:|------:|------:|------:|------------:|-----------:|-------------:|------:|
43
+ | **Intellect-1** | Internet / Whitelist | 10B | 1T | 44.8 | 71.6 | 77.7 | 43.6 | 70.5 | 63.1 | 32.7 |
44
+ | **Psyche Consilience-7Y9** | Internet / Whitelist | 40B | 1.2T | 31.1 | 55.8 | 76.1 | 34.8 | 63.7 | 57.0 | 24.2 |
45
+ | **Covenant72B (Checkpoint-Two)** | Internet / Permissionless | 72B | **420B** | **53.84** | **77.74** | **80.58** | **44.60** | **77.08** | **71.43** | **47.49** |
46
+ | **LLM360 K2 ckpt_108** | Centralized Cluster | 65B | 420B | 45.73 | 70.54 | 80.90 | 43.20 | 78.23 | 71.90 | 50.01 |
47
+ | **LLM360 K2 Stage 1** | Centralized Cluster | 65B | 1.4T | 53.84 | 75.93 | 82.48 | 48.00 | 82.81 | 76.64 | 63.90 |
48
+ | **LLaMA-2-7B** | Centralized Cluster | 7B | 2T | 45.90 | 74.58 | 75.92 | 44.20 | 75.92 | 68.90 | 40.86 |
49
+ | **LLaMA-2-70B** | Centralized Cluster | 70B | 2T | 57.59 | 80.77 | 82.92 | 48.60 | 83.86 | 77.58 | 65.56 |
50
 
51
  ---
52
 
assets/checkpoint-two.webp ADDED

Git LFS Details

  • SHA256: 304150dfec39a6576ea1a64d986f2a6bf868c181adf110b4a68b9169cdcb8770
  • Pointer size: 131 Bytes
  • Size of remote file: 372 kB