A mix of internal pilots inside a regulated agency and freelance systems clients run in
production. The proprietary ones are shown through architecture and sanitized artifacts; the
open ones link straight to the code.
Open source · live
claude-autofix — a self-healing error system
Python · Anthropic API · Claude Code
When a production process throws an error and no one's awake to fix it, this doesn't just
retry — it reads its own source code, reasons about the failure, rewrites the code that
caused it, restarts the process, and emails a report. Autonomously, inside hard guardrails
(rate limits, a daily attempt cap, deduplication, and escalation to a human). Ran in
production for 53 days on a personal data pipeline; this repo is the genericized, open
extraction, with a no-API-key demo and real fix reports the agents wrote themselves.
53 active days in production
42 error types diagnosed
691 errors → 1 fix (one day)
runnable in 10s, no API key
Flagship
Policy Navigator
Python · FastAPI · ChromaDB · Voyage AI · Anthropic API · HTMX
A multi-source RAG over 11,000+ regulatory sections across four authoritative sources —
ask a plain-language policy question, get a cited, synthesized answer. Cross-checks sources
to surface gaps and drafts new policy content from research.
Open source
agent_lab — a fleet of scheduled agents
Python · Claude Code
A framework for running specialized, scheduled Claude Code agents that share a workspace,
message each other through files, and accumulate durable knowledge over time — extracted
and genericized from a live multi-agent system.
Detail coming
Draft-Document Peer Reviewer
Python · Anthropic API · python-docx
An agentic reviewer that checks draft documents against department standards — accessibility,
inclusivity, formatting, completeness — and returns structured feedback for human signoff.
A worked before/after of AI removing toil while keeping a human in the loop.
Detail coming
Regulations Comparator
Python · Anthropic API · ChromaDB · Voyage AI
A 4-stage agentic pipeline (classify → triage → deep investigation → cross-check) detecting
coverage gaps across 480+ regulatory sections before auditors find them — a generalizable
pattern for comparing any two compliance bodies.