tokyotech-llm/Llama-3.1-8B-code-ablation-exp7-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0007500
8B • Updated • 4
tokyotech-llm/Llama-3.1-8B-code-ablation-exp6-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0007500
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp9-LR2.5e-5-WD0.1-iter0005000
8B • Updated tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0007500
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp7-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0005000
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp6-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0005000
8B • Updated tokyotech-llm/Llama-3.1-8B-code-ablation-exp9-LR2.5e-5-WD0.1-iter0002500
8B • Updated tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0005000
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp7-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0002500
8B • Updated tokyotech-llm/Llama-3.1-8B-code-ablation-exp6-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0002500
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp5-Llama-3.3-70B-LR2.5e-5-WD0.1-iter0002500
Updated
tokyotech-llm/Llama-3.1-Swallow-8B-v0.2
Text Generation
• 8B • Updated • 706
• • 4
tokyotech-llm/Llama-3.1-Swallow-70B-v0.1
Text Generation
• 71B • Updated • 15
• 4
tokyotech-llm/Llama-3.1-Swallow-8B-v0.1
Text Generation
• 8B • Updated • 172
• • 10
tokyotech-llm/edu-classifier
Text Classification
• Updated • 447
• 13
tokyotech-llm/Swallow-7b-NVE-hf
Text Generation
• 7B • Updated • 11
• 2
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0012500
8B • Updated • 4
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0010000
8B • Updated • 2
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0007500
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0005000
Updated
tokyotech-llm/Llama-3.1-8B-code-ablation-exp4-LR2.5e-5-MINLR2.5E-6-WD0.1-iter0002500
8B • Updated • 2
tokyotech-llm/Llama-3-Swallow-70B-v0.1
Text Generation
• Updated • 9
• • 6
tokyotech-llm/Llama-3-Swallow-8B-v0.1
Text Generation
• Updated • 240
• • 12
tokyotech-llm/Llama-3-Swallow-70B-Instruct-v0.1
Text Generation
• 71B • Updated • 32
• • 7
tokyotech-llm/Llama-3-Swallow-8B-Instruct-v0.1
Text Generation
• 8B • Updated • 11k
• • 21
tokyotech-llm/Swallow-70b-instruct-v0.1
Text Generation
• 69B • Updated • 24
tokyotech-llm/Swallow-13b-instruct-v0.1
Text Generation
• 13B • Updated • 42
• 1
tokyotech-llm/Swallow-7b-instruct-v0.1
Text Generation
• 7B • Updated • 156
• 3
tokyotech-llm/Swallow-70b-NVE-instruct-hf
Text Generation
• 69B • Updated • 5
• 2
tokyotech-llm/Swallow-70b-instruct-hf
Text Generation
• 69B • Updated • 892
• 37