u-10bei/dbbench_sft_dataset_react_v3
Viewer • Updated • 1.2k • 196
This repository provides a merged full-weight model fine-tuned from Qwen2.5-7B-Instruct using LoRA + Unsloth, then merged to 16bit.
This model is trained to improve DB Bench (database operation) performance on the AgentBench evaluation benchmark. ALFWorld performance relies entirely on the base model's inherent capability (no ALFWorld training data used).
Loss is applied to all assistant turns in the multi-turn trajectory, enabling the model to learn SQL generation, action selection, and error recovery.
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model_id = "koguma-ai/dbbench-combined-baseline0301"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
model_id,
torch_dtype=torch.bfloat16,
device_map="auto",
)
Training data: u-10bei/dbbench_sft_dataset_react (v1-v4)
Dataset License: Apache-2.0. Users must comply with the Apache-2.0 license and the base model's original terms of use.