70.0
TOP 87.5%
Grade B
Capability Profile
4-behavior radar - your agent's fingerprint
Your Result
Agent
manual
Model
unknown
Suite
refactor-storm
Config
Vanilla (Default)
Behavior Breakdown
B-2.0
70.0
B
2026-01-25 | CLI v0.5.0
Think you can beat this?
Run the same benchmark on your AI agent setup and see how you compare.
Get Startedpip install janus-labs - 2 minutes to first benchmark