Claude Autoresearch: AI That Improves Code Forever
Claude Autoresearch: Turn Claude Code into a Relentless Improvement Engine
"Set the GOAL → Claude runs the LOOP → You wake up to results"
What if your AI could autonomously improve anything measurable—code, content, metrics, processes—without you babysitting? Claude Autoresearch (3.1k ⭐) makes this reality using Karpathy's proven formula: constraint + mechanical metric + autonomous iteration = compounding gains.
From 630 Lines of Python to Claude's Universal Loop
Karpathy showed a simple Python script could run 100 ML experiments overnight. Claude Autoresearch generalizes this to ANY domain:
- Code: Test coverage → 90%, bundle size → 50% smaller
- Performance: API p95 → <100ms
- Security: Autonomous STRIDE + OWASP audits
- Shipping: Universal PR/deployment/content workflows
- Docs: Auto-generate/update/validate documentation
The 8-Phase Autonomous Loop
LOOP (FOREVER):
1. Review git history + results log
2. Pick ONE focused change
3. Git commit (experiment: prefix)
4. Run mechanical verification
5. IMPROVED → keep | WORSE → revert
6. Log TSV results
7. Repeat
8 Critical Rules ensure relentless progress: - One change per iteration (atomic) - Mechanical verification only (no subjectivity) - Auto-rollback failures - Git as memory - Simplicity wins (less code = better)
10 Battle-Tested Commands
| Command | Use Case |
|---|---|
/autoresearch |
Core optimization loop |
/autoresearch:plan |
Goal → config wizard |
/autoresearch:security |
Autonomous security audit |
/autoresearch:ship |
Ship PRs/deployments/content |
/autoresearch:debug |
Hunt ALL bugs scientifically |
/autoresearch:fix |
Crush errors until zero remain |
/autoresearch:scenario |
Explore 12 dimensions of edge cases |
/autoresearch:predict |
5-expert swarm analysis |
/autoresearch:learn |
Autonomous docs engine |
/autoresearch:reason |
Adversarial refinement (v1.9.0) |
Install in 30 Seconds
Plugin (Recommended):
/plugin marketplace add uditgoenka/autoresearch
/plugin install autoresearch@autoresearch
First Run:
/autoresearch
Goal: Increase test coverage from 72% to 90%
Scope: src/**/*.test.ts
Verify: npm test -- --coverage | grep "All files"
Walk away. Claude iterates autonomously. Every improvement stacks.
Real-World Power: Command Chains
# Full quality pipeline
/autoresearch:reason --chain predict,scenario,debug,fix
# Security → Ship
/autoresearch:security --fix --chain ship
# Docs after changes
/autoresearch:learn --mode update
Guard: Regression Protection
Goal: Reduce API response <100ms
Verify: npm run bench:api | grep "p95"
Guard: npm test # Safety net
Metrics improve + tests pass = keep. Anything breaks = rework.
Why It Works (The Science)
- Atomic changes → Clear cause/effect
- Git memory → Learns from every experiment
- Mechanical gates → No human bias
- Unbounded iteration → Compounding gains
- Auto-rollback → Never worse than start
Domains Beyond Code
- Marketing: CTR → 3x, conversion rate ↑
- Sales: Email open rates, reply rates
- Content: Engagement scores, readability
- HR: Policy compliance metrics
- DevOps: Deployment success rate
TSV Results Tracking:
iteration commit metric delta status
0 a1b2c3d 85.2 0.0 baseline
1 b2c3d4e 87.1 +1.9 keep
3 c3d4e5f 88.3 +1.2 keep
🚀 Get Started Today
Install Claude Autoresearch and experience autonomous improvement. No AGI needed—just goals, metrics, and loops that never quit.
Creators: Udit Goenka (AI Product Expert) + contributors. MIT License. 127 commits. v1.9.0 released Apr 2026.