QUEST-30B-SFT

QUEST 30B vanilla SFT checkpoint (Qwen3-30B-A3B base, dense). This is an intermediate artifact in the QUEST 30B training pipeline (MT โ†’ SFT โ†’ RL).

We did not run benchmark evaluations on this checkpoint. For full pipeline results, see QUEST-30B-RL.

Training stage

Stage Applied
Mid-training (MT) โœ—
Supervised fine-tuning (SFT) โœ“
Reinforcement learning (RL) โœ—

Quick start

from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "osunlp/QUEST-30B-SFT"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id, device_map="auto", torch_dtype="auto",
)

License

Released under the Apache License 2.0.

Downloads last month
80
Safetensors
Model size
31B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Collection including osunlp/QUEST-30B-SFT