AOF Eval Harness

Self-applied measurement of the Agent Operating Framework. Grades rule adherence, plan→delivery gap, cost, and dispatch quality across every Claude Code session.

Last run: 5/24/2026, 5:45:39 AM · 52 sessions · harness v1.0 · aof v1.5

Sessions
52
Mean rule
8.88
Mean plan→delivery
10.00
Mean composite
8.92
How to read these numbers. Scores run 0–10. They are uncalibrated against any external benchmark — there is no public dataset of AOF-style framework adherence to compare against. The absolute number isn't the signal; the slope over future runs is.v1.5 is the baseline. If a future v1.6 holds at 9.20 it's working; if it drops to 8.5 after a refactor, the refactor broke something. Click into the session table below to see which individual sessions dragged the average down — that's where the learning is.

Rule adherence (14d)

Session cost (14d)

Cost data not populated yet. Handoff frontmatter needs cost_usd field; cc-analytics will fill this on future sessions.

Sessions

DateMachineRulePlan→DeliveryCost $Composite
2026-05-24win7.07.0
2026-05-24win4.04.0
2026-05-23win9.09.0
2026-05-22mac10.010.0
2026-05-22win7.57.5
2026-05-22win10.010.0
2026-05-22win9.09.0
2026-05-22win9.010.09.4
2026-05-22win8.510.09.1
2026-05-22win9.09.0
2026-05-22win10.010.010.0
2026-05-22win8.58.5
2026-05-22win9.09.0
2026-05-22win7.57.5
2026-05-22win10.010.0
2026-05-22win10.010.0
2026-05-22win9.09.0
2026-05-22win9.09.0
2026-05-22win10.010.0
2026-05-21win10.010.0
2026-05-21win9.09.0
2026-05-21win9.010.09.4
2026-05-21win10.010.0
2026-05-21win8.58.5
2026-05-21win7.57.5
2026-05-21win10.010.0
2026-05-20win10.010.0
2026-05-20win6.06.0
2026-05-19win10.010.010.0
2026-05-19win10.010.0
2026-05-19win7.57.5
2026-05-19win7.07.0
2026-05-18win7.57.5
2026-05-18win7.07.0
2026-05-18win8.58.5
2026-05-18win9.09.0
2026-05-18win9.09.0
2026-05-18win9.09.0
2026-05-17win9.09.0
2026-05-17win9.09.0
2026-05-17win9.09.0
2026-05-17win10.010.0
2026-05-17win10.010.0
2026-05-16win9.09.0
2026-05-16win10.010.0
2026-05-15mac10.010.0
2026-05-15mac10.010.0
2026-05-15mac10.010.010.0
2026-05-15mac10.010.0
2026-05-15win8.58.5
2026-05-15win10.010.0
2026-05-15win9.010.09.4
2026-05-15win10.010.0
2026-05-14mac10.010.0
2026-05-14win9.09.0
2026-05-13mac9.09.0
2026-05-13mac6.010.07.6
2026-05-13mac9.010.09.4
2026-05-13win10.010.0
2026-05-12mac10.010.010.0
2026-05-12mac5.58.86.8
2026-05-12mac10.010.0
2026-05-12mac8.58.5