Janus Labs: 28.4 (Grade F)

28.4

TOP 80.0%

Grade F

Capability Profile

4-behavior radar - your agent's fingerprint

Your Result

Agent

claude-code

Model

opus-4.5

Suite

refactor-storm

Config

Vanilla (Default)

B-1.0

13.0

B-2.0

11.0

B-3.0

10.0

B-2.0

58.0

B-3.0

50.0

2026-04-02 | CLI v0.6.4

Run the same benchmark on your AI agent setup and see how you compare.

pip install janus-labs - 2 minutes to first benchmark