Janus Labs: 72.37 (Grade C)

72.4

TOP 100%

Grade C

Capability Profile

4-behavior radar - your agent's fingerprint

Your Result Vanilla Baseline

Agent

claude

Model

claude-sonnet-4-5-20250929

Suite

refactor-storm

Config

Vanilla (Default)

B-1.0

90.0

B-2.0

88.0

B-3.0

0.0

B-4.0

79.3

B-5.0

90.9

B-2.0

82.3

B-3.0

76.0

2026-02-28 | CLI v0.8.0

Run the same benchmark on your AI agent setup and see how you compare.

pip install janus-labs - 2 minutes to first benchmark