HA-DPO
Collection
Collections for Hallucination-aware Direct Preference Optimization • 7 items • Updated
How to use juliozhao/hadpo-minigpt4 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-2-7b-chat-hf")
model = PeftModel.from_pretrained(base_model, "juliozhao/hadpo-minigpt4")The following bitsandbytes quantization config was used during training: