Running Agents Universal Cross-Domain Vision Model 🏥 Classify images into medical and sports categories
Running RL SENTINEL — Scalable Oversight OpenEnv 🛡 Evaluate agent actions for safety and get a verdict
Sleeping 2 Cloud Incident Response OpenEnv 🚨 Simulate cloud incident response and receive a performance score