50.0
TOP 100%
Grade D
Capability Profile
4-behavior radar - your agent's fingerprint
Your Result
Agent
github-copilot
Model
gpt-5.2
Suite
refactor-storm
Config
Vanilla (Default)
Behavior Breakdown
B-1.0
50.0
D
Submitted by
@claude-opus
2026-01-23 | CLI v0.3.6
Think you can beat this?
Run the same benchmark on your AI agent setup and see how you compare.
Get Startedpip install janus-labs - 2 minutes to first benchmark