SocraticEnv

OpenEnv Hackathon · Meta × PyTorch × Scaler

Live Demo 🏆 Leaderboard API Docs
Connecting...
Speed:
🎓
SocraticEnv is ready
Select a task and click Start Episode

GRPO Trained Model

GRPO Model v1.0

Status: Weights Trained & Verified ✅

Improvement: +0.292 Overall Score

Live Dual-Inference Coming Soon

Auto-Run mode — AI is thinking...
Press Enter to send · Shift+Enter for new line No active task

🔬 Reward Math V3 DevTools

⚗️
Run an episode to inspect
the V3 anti-hack reward math.