·
AI & ML interests
None yet
Organizations
princeton-nlp/warm-start__ppo__think__Qwen2.5-7B
8B • Updated • 5
princeton-nlp/warm-start__dpo__nothink__Qwen2.5-7B-Instruct
8B • Updated • 4
princeton-nlp/warm-start__dpo__nothink__Llama-3.1-8B-Instruct
8B • Updated • 4
princeton-nlp/warm-start__dpo__nothink__Qwen2.5-7B
8B • Updated • 4
princeton-nlp/warm-start__dpo__nothink__Llama-3.1-8B
8B • Updated • 4
princeton-nlp/warm-start__dpo__think__Qwen2.5-7B-Instruct
8B • Updated • 5
princeton-nlp/warm-start__dpo__think__Llama-3.1-8B-Instruct
8B • Updated • 4
princeton-nlp/warm-start__dpo__think__Qwen2.5-7B
8B • Updated • 3
princeton-nlp/warm-start__dpo__think__Llama-3.1-8B
8B • Updated • 4
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B
8B • Updated • 4
princeton-nlp/warm-start__sft__nothink__Llama-3.1-8B
8B • Updated • 4
princeton-nlp/warm-start__sft__think__Qwen2.5-7B-Instruct
8B • Updated • 4
• 1
princeton-nlp/warm-start__sft__nothink__Llama-3.1-8B-Instruct
8B • Updated • 4
princeton-nlp/warm-start__sft__think__Qwen2.5-7B
8B • Updated • 6
princeton-nlp/warm-start__sft__think__Llama-3.1-8B
8B • Updated • 6
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct
8B • Updated • 4
princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct
8B • Updated • 5
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct
8B • Updated • 8.29k
• 26
princeton-nlp/Llama-3-8B-ProLong-512k-Base
8B • Updated • 8.23k
• 9
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct
Text Generation
• 8B • Updated • 8.62k
• • 13
princeton-nlp/Llama-3-8B-ProLong-64k-Base
Text Generation
• 8B • Updated • 8.26k
• • 6
princeton-nlp/Mistral-7B-Base-SFT-CPO
Text Generation
• 7B • Updated • 18
• • 1
princeton-nlp/Mistral-7B-Base-SFT-RRHF
Text Generation
• 7B • Updated • 20
• princeton-nlp/gemma-2-9b-it-SimPO
Text Generation
• 9B • Updated • 915
• • 172
princeton-nlp/gemma-2-9b-it-DPO
Text Generation
• 9B • Updated • 25
• • 9
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
Text Generation
• 8B • Updated • 25
• • 8
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2
Text Generation
• 8B • Updated • 19
• princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2
Text Generation
• 8B • Updated • 26
• princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2
Text Generation
• 8B • Updated • 20
• princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2
Text Generation
• 8B • Updated • 19
•