Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TitleOS 's Collections
Metis 4B - Purple Team Agent
Eve 4B - Small Secure Coder
RLAIF Experimentation
Qwen3 Coder Heretic - Decensored
Spark 270M - Micro Local Utility LLM
Lightning 1.7B - Local Utility LLM
HomePhi4 - Home Assistant Reasoning LLM
HomeGem - Home Assistant Conversational LLM
Galactic Reasoning - Galactica with Chain-Of-Thought
Experiments

RLAIF Experimentation

updated Feb 12

Research into RLAIF (Reinforcement Learning from AI feedback) with the goal of Constitutional AI and Sycophancy Resistance.

Upvote
-

  • TitleOS/rlaif_training_fictional_patriot_experiment

    Viewer • Updated Feb 11 • 255 • 10

  • TitleOS/RLAIF_Patriot_Experiment_LoRA

    Updated Feb 11 • 3

  • TitleOS/RLAIF_Patriot_Experiment_Q8_0-GGUF

    38.4M • Updated Feb 12 • 7

  • TitleOS/RLAIF_Patriot_Experiment_F16-GGUF

    38.4M • Updated Feb 12 • 2
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs