rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_10_of_10 1B • Updated 4 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_9_of_10 1B • Updated 4 days ago • 14
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_8_of_10 1B • Updated 4 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_7_of_10 1B • Updated 4 days ago • 14
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_6_of_10 1B • Updated 4 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_5_of_10 1B • Updated 4 days ago • 14
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_4_of_10 1B • Updated 4 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_3_of_10 1B • Updated 4 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_2_of_10 1B • Updated 4 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_v2_ckpt_1_of_10 1B • Updated 4 days ago • 12
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_10_of_10 1B • Updated 6 days ago • 14
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_9_of_10 1B • Updated 6 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_8_of_10 1B • Updated 6 days ago • 12
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_7_of_10 1B • Updated 6 days ago • 17
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_6_of_10 1B • Updated 6 days ago • 15
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_5_of_10 1B • Updated 6 days ago • 13
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_4_of_10 1B • Updated 6 days ago • 14
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_3_of_10 1B • Updated 6 days ago • 10
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_2_of_10 1B • Updated 6 days ago • 12
rosieyzh/sft_llama1_alma_lr_3e-6_cosine_2_epochs_pretrain_mode_ckpt_1_of_10 1B • Updated 6 days ago • 17
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_10_of_10 2B • Updated 6 days ago • 14
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_9_of_10 2B • Updated 6 days ago • 20
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_8_of_10 2B • Updated 6 days ago • 17
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_7_of_10 2B • Updated 6 days ago • 18
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_6_of_10 2B • Updated 6 days ago • 17
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_5_of_10 2B • Updated 6 days ago • 14
rosieyzh/sft_qwen15_code200_lr_1e-5_cosine_2_epochs_pretrain_mode_ckpt_4_of_10 2B • Updated 6 days ago • 16