Prufs captures the full reasoning chain behind AI-generated code - from your directive through every decision, alternative, and trade-off - as a cryptographically signed, tamper-evident audit trail.
a1b2c3dSix node types connected by causal edges. Every line of code traces back to the human directive that spawned it, through every decision and constraint along the way.
Every event is Ed25519 signed and hash-chained. Modify one event and the entire chain breaks. Verification is independent of the SDK - any tool with the public key can audit the trail.
Not just what was chosen, but what was rejected and why. Domain-tagged, confidence-scored, and sensitivity-classified. Security decisions are automatically restricted via RBAC.
Replace diff-reading with trail-walking. Reviewers see the directive, interpretation, decisions, and trade-offs - then approve at the reasoning level, not the line level.
Hover over any line of code to see its decision trail. Gutter icons show trail coverage. Right-click to trace any line back to its originating directive.
Ship with a SKILL.md and agents instrument themselves. Hooks for Claude Code and Cursor. SessionObserver auto-detects decisions and constraints from natural language.
The SDK is free and open source. Pay for cloud when your team needs shared trails.
Prufs was designed alongside the Composite Reliability Index (CRI) framework for evaluating LLM agent reliability in production banking environments. The decision trail directly implements the Explainability Performance Index (EPI) sub-index - making agent-generated code auditable at the reasoning level, not just the output level.
Request a CRI readiness assessmentPhase 3 in progress — cloud platform and sync engine live