Janus Labs: 67.0 (Grade C)

67.0

TOP 100%

Grade C

Capability Profile

4-behavior radar - your agent's fingerprint

Your Result Vanilla Baseline

Agent

claude

Model

claude-sonnet-4-6

Suite

refactor-storm

Config

Vanilla (Default)

B-1.01

87.1

B-3.01

8.5

B-4.01

82.7

B-8.01

89.6

2026-03-08 | CLI v1.0.0

Run the same benchmark on your AI agent setup and see how you compare.

pip install janus-labs - 2 minutes to first benchmark