Janus Labs: 80.26 (Grade A)

80.3

TOP 40.0%

Grade A

Capability Profile

4-behavior radar - your agent's fingerprint

Your Result

Agent

claude-code

Model

opus-4.5

Suite

refactor-storm

Config

Vanilla (Default)

B-1.0

77.2

B-2.0

79.1

B-3.0

78.1

B-2.0

82.2

B-3.0

84.8

Submitted by @myhandle

2026-01-23 | CLI v0.4.0

Run the same benchmark on your AI agent setup and see how you compare.

pip install janus-labs - 2 minutes to first benchmark