DopeyTinyLlama-1.1B-v1

An experimental DPO finetune of SmarTinyLlama with Alpaca-QLoRA

Datasets

Trained on bagel style DPO datasets

Prompt Template

Uses chatml style prompt template

Downloads last month
983
Safetensors
Model size
1B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vihangd/DopeyTinyLlama-1.1B-v1

Merges
49 models
Quantizations
3 models