One Founder. Eleven Specialists. Unlimited Potential.
If AGENT-11 has saved you time or helped you ship faster, consider ⭐ starring the repo or buying me a coffee ☕ to support development!
AGENT-11 deploys 11 specialized AI agents to your project, orchestrating them through proven workflows to build production-ready software. Think of it as hiring an elite development team - except they're AI agents working together seamlessly.
Instead of prompting Claude Code for every task, AGENT-11 provides:
- 🎯 Specialized Agents - Each with domain expertise (requirements, architecture, coding, testing, design, etc.)
- 🎖️ Coordinated Workflows - 20 pre-built missions that orchestrate multiple agents automatically
- 🧠 Persistent Context - Knowledge that survives across sessions and agent handoffs
- ✅ Quality Assurance - Built-in testing, verification, and security protocols
AGENT-11 is the technical execution arm of the BOS-AI ecosystem. BOS-AI handles strategic planning (30 business agents) → AGENT-11 handles software development (11 technical agents).
You can use AGENT-11 standalone without BOS-AI. → Complete BOS-AI Integration Guide
- Solo founders building products without technical teams
- Developers who want structured AI assistance with quality assurance
- Technical projects requiring multiple specialties (frontend, backend, DevOps, testing)
- Long-running projects needing context preservation across sessions
- Simple scripts or one-off tasks (use Claude Code directly)
- Projects requiring constant human oversight (AGENT-11 is autonomous)
- Non-technical work (use BOS-AI for business operations)
The Ultimate Proof: AGENT-11 Built by AGENT-11
The deployment system you just used was built by AGENT-11 itself—from concept to working system in under 1 day. Complete automated installation, agent orchestration, configuration management, and error handling. 98% success rate, <1 second deployment.
If AGENT-11 can build itself, it can build anything.
Your projects can include:
- SaaS applications with AI analysis, authentication, payment processing
- Marketplace platforms with OAuth, reviews, community features
- Web applications with Next.js, React, Tailwind CSS, modern stacks
- API services with REST/GraphQL endpoints, third-party integrations
- Business tools with analytics, automation, data pipelines
Time to MVP: 1-3 days with /coord mvp mission
→ See 7+ Real Production Projects
Proven Performance (v2.0 Modernization Results):
- 39% effectiveness improvement - Extended thinking + self-verification for better decisions
- 84% token reduction - Context editing + memory optimization enables 30+ hour missions
- Time to MVP: Traditional team (3-6 months) → AGENT-11 (2-4 weeks)
- Cost per Feature: Traditional ($10-50k) → AGENT-11 (<$500)
→ Complete Performance Metrics
Core Squad (4 agents):
- 🎯 Strategist - Product vision and requirements analysis
- 💻 Developer - Code implementation at light speed
- ✅ Tester - Zero bugs reach production (SENTINEL Mode)
- 🚀 Operator - Deploy with confidence
Specialist Squad (7 agents):
- 🏗️ Architect - Bulletproof technical decisions
- 🎨 Designer - Interfaces that convert (RECON Protocol)
- 📚 Documenter - Knowledge captured perfectly
- 💬 Support - Users become advocates
- 📊 Analyst - Data drives decisions
- 📈 Marketer - Growth on autopilot
- 🎖️ Coordinator - Mission commander (PARALLEL STRIKE)
New to AGENT-11? → Complete Getting Started Guide takes you from installation to first mission in under 10 minutes.
AGENT-11 requires a project context to deploy. Your directory needs at least ONE of:
- 🔧 Git repository (recommended):
git init - 📄 README file:
README.md - 📦 Package file:
package.json,requirements.txt, etc.
# Navigate to your project
cd /path/to/your/project
# Deploy full squad (recommended)
curl -sSL https://raw.githubusercontent.com/TheWayWithin/agent-11/main/project/deployment/scripts/install.sh | bash -s full
# Restart Claude Code
/exit
claudeSuccess indicator: You should see "✅ AGENT-11 deployed successfully" message
What gets installed:
- 11 specialized agent prompts in
.claude/agents/ - 20+ mission workflows
- Context management system
- Project templates
"Command not found" after install?
# Restart Claude Code completely
/exit
claude"Permission denied"?
chmod +x ./project/deployment/scripts/install.sh
./project/deployment/scripts/install.sh fullAgents not appearing?
# Check installation
ls -la .claude/agents/
# Should show 11 .md filesExpected output after /agents command:
Project agents (.claude/agents):
- coordinator.md
- strategist.md
- developer.md
- tester.md
- operator.md
- architect.md
- designer.md
- documenter.md
- support.md
- analyst.md
- marketer.md
Success indicator: You see 11 agents listed
If agents don't appear:
- Check
.claude/agents/directory exists:ls -la .claude/agents/ - Restart Claude Code:
/exitthenclaude - Re-run installation if needed
→ Complete troubleshooting guide
# List your agents
/agentsExpected output:
Project agents (.claude/agents):
- coordinator.md
- strategist.md
- developer.md
- tester.md
- operator.md
- architect.md
- designer.md
- documenter.md
- support.md
- analyst.md
- marketer.md
Success indicator: You see 11 agents listed
# Test your first specialist
@strategist What should we build first?Success indicator: Strategist responds with analysis and questions
For existing projects:
/coord dev-alignmentAnalyzes your codebase, creates documentation, sets up context system.
For new projects:
Step 1: Create your vision document
# Copy template
cp templates/mission-inputs/vision.md my-project-vision.md
# Edit with your details (example below)
nano my-project-vision.mdExample vision.md content:
# Project: Task Manager MVP
## Goal
Simple web-based task management for small teams
## Target Users
Small teams (3-10 people) needing basic task tracking
## Core Features (MVP)
1. Create, edit, delete tasks
2. Assign tasks to team members
3. Mark complete/incomplete
4. Basic search and filtering
5. Simple dashboard view
## Technical Preferences
- Frontend: React with TypeScript
- Backend: Node.js with Express
- Database: PostgreSQL
- Hosting: Vercel + Railway
## Timeline
2-day MVP for initial testing
## Success Criteria
- 5 team members can use simultaneously
- Tasks persist across sessions
- Mobile-responsive interfaceStep 2: Initialize project
/coord dev-setup my-project-vision.mdWhat happens (30-45 minutes):
- Strategist analyzes vision → creates requirements
- Architect designs system → creates architecture.md
- Developer sets up structure → initializes codebase
- Tester configures testing → sets up test framework
Expected deliverables:
architecture.md- System designproject-plan.md- Development roadmapCLAUDE.md- Project-specific instructions/memories/- Persistent knowledge- Basic project structure
Time: 30-45 minutes
Success indicator: Mission completes with project-plan.md and progress.md created
# Build a feature
/coord build requirements.md
# Fix a bug
/coord fix bug-report.md
# Create MVP
/coord mvp vision.mdCongratulations! You're now using AGENT-11.
Learn the most common AGENT-11 workflows with real examples, time estimates, and expected outcomes.
When to use: You have a product vision and need to ship fast
What you'll get: Production-ready MVP with auth, payments, testing, and deployment
Time estimate: 1-3 days depending on complexity
# Step 1: Create your vision document
/coord dev-setup ideation/vision.md
# Step 2: Build core features
/coord build ideation/requirements.md
# Step 3: Quality assurance
/coord test
# Step 4: Deploy to production
/coord deployWhat happens:
- Strategist analyzes vision → creates product requirements
- Architect designs system → creates architecture.md
- Developer implements features → writes code
- Tester validates quality → runs comprehensive tests
- Operator deploys safely → sets up staging + production
Deliverables: Working MVP, tests, documentation, deployment guides
Real example: LLM.txt Mastery built in 3 days
Cost estimate: $5-10 in API usage
Recovery Protocols:
MVP too complex?
@strategist "Review my MVP scope and identify what can be cut to ship faster"Tests failing?
/coord fix test-failures.mdNeed to add feature after MVP?
/coord build feature-requirements.mdVerify deliverables:
# Check these files were created:
ls architecture.md project-plan.md progress.md README.md
ls -la src/ tests/ deployment-config/When to use: Production bug that needs immediate attention
What you'll get: Root cause identified, fix implemented, tests added
Time estimate: 2-4 hours average
# Analyze and fix the issue
/coord fix bug-report.mdWhat happens:
- Coordinator reads bug report → creates fix plan
- Developer identifies root cause → implements fix
- Tester validates fix → ensures no regressions
- Documenter updates docs → prevents recurrence
Deliverables: Bug fix, regression tests, updated documentation
Cost estimate: $0.50-1.50 in API usage
Recovery Protocols:
Can't reproduce the bug?
@tester "Help me create a minimal reproduction case for this bug"Fix causes other issues?
/coord test # Run full test suite
@tester "Identify regressions from recent fix"Need emergency rollback?
@operator "Rollback to previous deployment"Verify deliverables:
# Confirm fix is deployed:
git log -1 # See commit
npm test # Tests passWhen to use: Need professional design audit before launch
What you'll get: Comprehensive UI/UX analysis with specific improvements
Time estimate: 1-2 hours for full site review
# Run design review
/design-reviewWhat happens: 7-phase systematic audit using RECON Protocol:
- Preparation - Environment setup, baseline screenshots
- Interaction Testing - User flows and micro-interactions
- Responsive Validation - Cross-device compatibility (mobile, tablet, desktop)
- Visual Polish - Typography, spacing, hierarchy
- Accessibility Audit - WCAG AA+ compliance, screen readers
- Robustness Testing - Edge cases, error states, loading states
- Performance Check - Load times, console errors
Deliverables:
- ✅ Issues classified by severity (BLOCKER/HIGH/MEDIUM/NITPICK)
- ✅ Screenshots and evidence for each issue
- ✅ Reproduction steps for developers
- ✅ Accessibility compliance report
Real example: "Identified 12 accessibility issues before launch, preventing legal compliance problems"
Cost estimate: $0.50-1 in API usage
Recovery Protocols:
Too many issues to fix?
@designer "Prioritize design issues by business impact"Need design implementation help?
@developer "Implement the HIGH priority design fixes"Accessibility issues found?
@designer "Create accessibility remediation plan"Verify deliverables:
# Check report was created:
ls design-review-report.md
# Review evidence:
ls evidence-repository.mdWhen to use: Building production-quality feature from requirements
What you'll get: Feature built, tested, documented, deployed
Time estimate: 4-8 hours for standard features
# Build new feature
/coord build requirements.mdWhat happens:
- @strategist analyzes requirements → user stories (30-45 min)
- @architect designs solution → technical approach (30-45 min)
- @developer implements → code + unit tests (2-4 hours)
- @tester validates → E2E tests and quality assurance (1 hour)
- @documenter documents → API docs, user guides (optional, 1 hour)
Deliverables:
- ✅ Production-ready feature code
- ✅ Comprehensive test coverage (unit + E2E)
- ✅ Documentation (API, user guide)
- ✅ Updated project-plan.md and progress.md
Real example: "Implemented payment processing with Stripe in 6 hours including tests and error handling"
Cost estimate: $1-3 in API usage
Recovery Protocols:
Feature too complex?
@architect "Break this feature into smaller incremental phases"Implementation stuck?
@developer "Review current implementation and suggest alternative approach"Tests not passing?
@tester "Debug test failures and identify root cause"Verify deliverables:
# Check feature is complete:
git log --oneline -5 # Recent commits
npm test # Tests pass
ls src/features/ # New feature codeWhen to use: Before production, after security-related changes, periodic reviews
What you'll get: Security vulnerabilities identified and fixed
Time estimate: 2-3 hours for comprehensive audit
# Run security audit
/coord securityWhat happens:
- @architect reviews architecture → security assessment
- @developer audits code → vulnerability scan (OWASP Top 10)
- @tester runs security tests → penetration testing
- @operator reviews infrastructure → deployment security (CSP, CORS, HTTPS)
- @documenter logs → security documentation and compliance
Deliverables:
- ✅ Vulnerability assessment with severity ratings
- ✅ Security fixes implemented and tested
- ✅ Compliance validation (GDPR, SOC2, etc.)
- ✅ Security documentation and prevention strategies
Real example: "Identified and fixed XSS vulnerability before launch, prevented potential data breach"
Cost estimate: $2-4 in API usage
Recovery Protocols:
Critical vulnerabilities found?
@developer "Implement immediate patches for CRITICAL security issues"
@operator "Review production security posture"False positives in scan?
@architect "Review security findings and filter false positives"Need compliance documentation?
@documenter "Create security compliance documentation for [GDPR/SOC2/etc]"Verify deliverables:
# Check security is improved:
ls security-audit-report.md
npm audit # No critical vulnerabilities
grep -r "TODO.*security" src/ # All TODOs addressedWhen to use: Technical debt accumulated, code quality degraded, or preparing for major features
What you'll get: Clean, maintainable code with improved architecture and reduced complexity
Time estimate: 2-4 hours for standard refactoring
# Run refactoring mission
/coord refactorWhat happens:
- @architect audits codebase → identifies tech debt and patterns (30-45 min)
- @developer refactors code → improves structure, removes duplication (1-2 hours)
- @tester validates → ensures no regressions with comprehensive tests (30-45 min)
- @documenter updates → reflects architectural changes in docs (optional, 30 min)
Deliverables:
- ✅ Refactored codebase with improved architecture
- ✅ Reduced complexity and code duplication
- ✅ Updated tests ensuring no functional changes
- ✅ Technical debt documentation and justification
Real example: "Refactored authentication system from 800 lines to 400 lines, improved testability by 60%"
Cost estimate: $0.75-2 in API usage
Recovery Protocols:
Refactoring breaks functionality?
git revert HEAD # Roll back changes
@tester "Identify which refactoring caused regression"
@developer "Apply refactoring incrementally with tests"Too much tech debt to tackle?
@architect "Prioritize refactoring by business impact and risk"
@strategist "Create phased refactoring plan over multiple sprints"Tests insufficient for safe refactoring?
@tester "Create characterization tests before refactoring"
@developer "Add test coverage to critical paths first"Verify deliverables:
# Check refactoring is safe:
npm test # All tests pass
git diff --stat # Review changes
npm run lint # Code quality improved
grep -r "TODO" src/ # Tech debt items addressedWhen to use: Slow load times, high resource usage, scalability bottlenecks, or poor user experience
What you'll get: Faster application with reduced resource usage and improved scalability
Time estimate: 3-6 hours for comprehensive optimization
# Run performance optimization
/coord optimizeWhat happens:
- @analyst benchmarks performance → identifies bottlenecks (45-60 min)
- @architect reviews architecture → proposes optimization strategies (30-45 min)
- @developer implements optimizations → code, queries, caching (1-3 hours)
- @tester validates performance → benchmarks before/after (30-45 min)
- @operator tunes infrastructure → server config, CDN, compression (30 min)
Deliverables:
- ✅ Performance benchmarks (before/after metrics)
- ✅ Optimized code with faster execution
- ✅ Database query optimization and indexing
- ✅ Caching strategy implementation
- ✅ Infrastructure tuning recommendations
Real example: "Reduced page load time from 4.2s to 1.1s (74% improvement), decreased server costs by 40%"
Cost estimate: $1.50-3 in API usage
Recovery Protocols:
Optimization breaks functionality?
@tester "Identify which optimization caused issue"
@developer "Roll back specific optimization, apply fix"Can't identify bottleneck?
@analyst "Set up comprehensive performance monitoring"
@developer "Add profiling and instrumentation to critical paths"Optimizations show minimal improvement?
@architect "Review architecture for fundamental scalability issues"
@operator "Analyze infrastructure constraints (CPU, memory, network)"Need to prove ROI?
@analyst "Generate performance report with business impact metrics"
# Metrics: load time reduction, bounce rate, conversion improvementVerify deliverables:
# Check performance improvements:
npm run benchmark # Compare before/after
ls performance-report.md # Review metrics
npm test # Functionality preserved
# Check specific metrics:
# - Page load time < 2s
# - Time to First Byte < 200ms
# - Lighthouse score > 90When to use: Ready to ship features, security patches deployed, or scheduled releases
What you'll get: Safe, verified production deployment with rollback capability
Time estimate: 1-2 hours for standard deployment
# Deploy to production
/coord deployWhat happens:
- @tester runs pre-deployment checks → full test suite (15-20 min)
- @operator prepares deployment → backup, staging validation (15-20 min)
- @operator deploys to production → zero-downtime deployment (10-15 min)
- @tester validates production → smoke tests, monitoring (15-20 min)
- @documenter logs deployment → changelog, rollback procedures (10-15 min)
Deliverables:
- ✅ Production deployment with zero downtime
- ✅ Database backup and rollback procedures
- ✅ Smoke test results confirming functionality
- ✅ Monitoring dashboards and alerting configured
- ✅ Deployment documentation and changelog
Real example: "Deployed payment system update to 10,000 active users with zero downtime, 0% error rate"
Cost estimate: $0.50-1 in API usage
Recovery Protocols:
Deployment fails mid-process?
@operator "Execute rollback procedure immediately"
@developer "Investigate deployment failure in staging"Production errors after deployment?
@operator "Check error rates and logs"
# If critical (>5% error rate):
@operator "Rollback to previous version immediately"
# If minor (<1% error rate):
@developer "Prepare hotfix for next deployment window"Database migration issues?
@operator "Verify database backup exists and is restorable"
@developer "Test migration rollback procedure in staging"Need emergency hotfix?
@developer "Create hotfix branch from production tag"
@tester "Run critical path tests only (15 min max)"
@operator "Deploy hotfix with expedited process"Verify deliverables:
# Check deployment success:
curl https://your-app.com/health # Health check passes
@operator "Verify monitoring shows no errors"
git tag -l # Deployment tagged
ls deployment-report.md # Deployment documented
# Critical checks:
# - Error rate < 0.1%
# - Response time < 500ms
# - All critical paths functional
# - Database migrations applied
# - Rollback tested and readyAGENT-11 includes 20 pre-built missions covering every development need:
Setup & Initialization:
- DEV-SETUP - New project initialization (30-45 min)
- DEV-ALIGNMENT - Understand existing project (45-60 min)
- CONNECT-MCP - Setup external integrations (45-90 min)
Development Operations:
- REFACTOR - Code quality improvement (2-4 hours)
- DEPLOY - Production deployment (1-2 hours)
- DOCUMENT - Documentation creation (2-4 hours)
- MIGRATE - System migration (4-8 hours)
Performance & Quality:
- OPTIMIZE - Performance tuning (3-6 hours)
- INTEGRATE - Third-party APIs (3-6 hours)
- RELEASE - Release management (2-4 hours)
→ See all 20 missions with commands
→ Mission execution guide - Complete execution manual
Beyond basic installation, configure advanced features for production readiness.
Automatic setup during missions. Manual: @tester "Set up testing infrastructure"
Running out of context during long missions? The MCP Profile System solves this by loading only the tools you need.
Instead of loading all 8 MCPs every session (15,000 tokens of context), profiles load only relevant MCPs for your task - giving you 40-80% more space for code and conversation.
Why Use Profiles:
- ✅ Longer missions - 40-80% more context for code and conversation
- ✅ Faster responses - Load only what you need
- ✅ Task-appropriate tools - Testing profile for tests, deployment for deploys
- ✅ Production safety - Read-only database profiles prevent accidents
What are profiles? Pre-configured sets of MCP servers for different tasks. Instead of loading all 8 servers, load just the 3-5 you need:
| Profile | MCPs | Context | Reduction | Use Case |
|---|---|---|---|---|
| core | 3 | 3,000 | 80% | General development |
| testing | 4 | 5,500 | 63% | Playwright automation |
| database-staging | 4 | 8,000 | 47% | Database development |
| database-production | 4 | 8,000 | 47% | Production queries (read-only) |
| payments | 4 | 7,000 | 53% | Stripe integration |
| deployment | 5 | 9,000 | 40% | Netlify + Railway |
| fullstack | 8 | 15,000 | 0% | All development MCPs |
Start with core - works for 90% of development. Switch to specialized profiles only when needed.
Quick Setup:
Start with the lightweight core profile (80% less context):
/mcp-switch coreThen restart Claude Code when prompted.
See all profiles: /mcp-list | Check current profile: /mcp-status
Need full setup? → Complete MCP Setup Guide covers MCP installation, API keys, and all 7 profiles.
Documentation:
- 📖 MCP Setup Guide - Complete setup and usage instructions
- 📋 Profile Reference - Detailed profile documentation
- 🔧 Troubleshooting - Common issues and solutions
New: /coord dev-setup ideation.md (30-45 min) | Existing: /coord dev-alignment (45-60 min)
Understanding the architecture helps you maximize effectiveness through visual diagrams and clear explanations.
graph TB
subgraph User["👤 User Layer"]
U[Developer/Founder]
end
subgraph Mission["🎯 Mission Layer"]
CMD["/coord Command"]
M1["20 Pre-built Missions"]
M2["Custom Workflows"]
end
subgraph Coord["🎖️ Coordination Layer"]
C[Coordinator Agent]
PP["project-plan.md<br/>(Planning)"]
AC["agent-context.md<br/>(Knowledge)"]
HN["handoff-notes.md<br/>(Handoffs)"]
PR["progress.md<br/>(History)"]
end
subgraph Agents["🤖 Specialist Layer"]
S[Strategist]
A[Architect]
D[Developer]
T[Tester]
O[Operator]
Des[Designer]
Doc[Documenter]
Sup[Support]
Ana[Analyst]
Mar[Marketer]
end
subgraph Tools["🔧 Tool Layer"]
R[Read/Write/Edit]
B[Bash/Git]
MCP["MCP Integrations<br/>(GitHub, Playwright, etc.)"]
Mem["Memory API<br/>(/memories/)"]
end
U --> CMD
CMD --> M1
CMD --> M2
M1 --> C
M2 --> C
C --> PP
C --> AC
C --> HN
C --> PR
C --> S
C --> A
C --> D
C --> T
C --> O
C --> Des
C --> Doc
C --> Sup
C --> Ana
C --> Mar
S --> R
A --> R
D --> R
D --> B
T --> R
O --> B
O --> MCP
Des --> MCP
Doc --> R
S --> Mem
A --> Mem
D --> Mem
style U fill:#e1f5ff
style CMD fill:#fff4e6
style C fill:#f3e5f5
style S fill:#e8f5e9
style A fill:#e8f5e9
style D fill:#e8f5e9
style T fill:#e8f5e9
style O fill:#e8f5e9
style Des fill:#e8f5e9
style Doc fill:#e8f5e9
style Sup fill:#e8f5e9
style Ana fill:#e8f5e9
style Mar fill:#e8f5e9
style Mem fill:#fffde7
Three-Layer Architecture Explained:
- Mission Layer - 20 pre-built missions + custom workflows via
/coord - Coordination Layer - Context preservation, parallel execution, dependency resolution
- Specialist Layer - 11 agents with domain expertise, tool permissions, extended thinking, self-verification
How specialists work together with zero context loss:
sequenceDiagram
participant U as 👤 User
participant C as 🎖️ Coordinator
participant S as 🎯 Strategist
participant A as 🏗️ Architect
participant D as 💻 Developer
participant T as ✅ Tester
participant AC as 📝 agent-context.md
participant HN as 🤝 handoff-notes.md
U->>C: /coord build requirements.md
C->>AC: Read mission context
C->>HN: Initialize handoff system
Note over C: Phase 1: Analysis
C->>S: Analyze requirements
S->>AC: Read existing context
S->>S: Create user stories (30-45 min)
S->>HN: Write findings for Architect
S->>AC: Update with decisions
Note over C: Phase 2: Design
C->>A: Design solution
A->>AC: Read strategist findings
A->>HN: Read specific handoff
A->>A: Create architecture (30-45 min)
A->>HN: Write specs for Developer
A->>AC: Update with design decisions
Note over C: Phase 3: Implementation
C->>D: Implement feature
D->>AC: Read all prior context
D->>HN: Read architect specs
D->>D: Write code + tests (2-4 hours)
D->>HN: Document implementation
D->>AC: Update with code decisions
Note over C: Phase 4: Validation
C->>T: Validate quality
T->>AC: Read full mission context
T->>HN: Read developer notes
T->>T: Run comprehensive tests (1 hour)
T->>HN: Report test results
T->>AC: Final quality assessment
C->>U: Mission complete ✅
Note over AC,HN: Zero context loss!<br/>100% knowledge preserved
Key Collaboration Principles:
- Sequential Handoffs - Each agent reads context from previous agents via handoff-notes.md
- Cumulative Knowledge - All decisions stored in agent-context.md for mission-wide awareness
- Zero Loss - Every finding, decision, and issue preserved across the entire mission
How AGENT-11 preserves 100% of knowledge across sessions and agent handoffs:
flowchart TD
subgraph Input["📥 Mission Input"]
MI["/coord build requirements.md"]
end
subgraph Context["🧠 Context Files"]
AC["agent-context.md<br/>━━━━━━━━━━━<br/>• Mission objectives<br/>• All findings (cumulative)<br/>• Technical decisions<br/>• Known issues<br/>• Dependencies"]
HN["handoff-notes.md<br/>━━━━━━━━━━━<br/>• Immediate task<br/>• Critical context<br/>• Warnings/blockers<br/>• Specific instructions<br/>• Test results"]
ER["evidence-repository.md<br/>━━━━━━━━━━━<br/>• Screenshots<br/>• Code snippets<br/>• Test results<br/>• API responses<br/>• Error logs"]
end
subgraph Memory["💾 Persistent Memory"]
MEM["/memories/<br/>━━━━━━━━━━━<br/>project/<br/> - overview.xml<br/> - architecture.xml<br/>technical/<br/> - patterns.xml<br/> - decisions.xml<br/>lessons/<br/> - insights.xml"]
end
subgraph Agents["🤖 Agents"]
A1[Agent reads context]
A2[Agent performs work]
A3[Agent updates context]
end
subgraph Output["📤 Deliverables"]
PP["project-plan.md<br/>(What we're doing)"]
PR["progress.md<br/>(What we did)"]
CODE["Code, Tests, Docs"]
end
MI --> AC
MI --> HN
AC --> A1
HN --> A1
MEM --> A1
A1 --> A2
A2 --> A3
A3 --> AC
A3 --> HN
A3 --> ER
A3 --> MEM
AC --> PP
AC --> PR
HN --> PP
HN --> PR
A2 --> CODE
style AC fill:#e3f2fd
style HN fill:#f3e5f5
style ER fill:#fff3e0
style MEM fill:#e8f5e9
style A1 fill:#fffde7
style A2 fill:#fffde7
style A3 fill:#fffde7
Context Preservation Protocol:
- Before Task - Agent reads agent-context.md + handoff-notes.md + /memories/
- During Task - Agent maintains awareness of all prior decisions
- After Task - Agent updates handoff-notes.md with findings for next specialist
- Coordinator - Merges findings into agent-context.md for mission-wide knowledge
Result: Zero context loss, 100% knowledge retention across sessions
→ Complete context management guide
From command to deliverables - the complete mission workflow:
stateDiagram-v2
[*] --> Planning: /coord build requirements.md
Planning --> Analysis: Create project-plan.md
note right of Planning
Coordinator reads mission
Creates execution plan
Initializes context files
end note
Analysis --> Design: Strategist completes
note right of Analysis
Requirements analysis
User story creation
Success criteria defined
30-45 minutes
end note
Design --> Implementation: Architect completes
note right of Design
System design
Technology selection
Architecture documentation
30-45 minutes
end note
Implementation --> Testing: Developer completes
note right of Implementation
Code implementation
Unit test creation
Documentation updates
2-4 hours
end note
Testing --> Verification: Tests pass
note right of Testing
E2E test execution
Quality validation
Regression checks
1 hour
end note
Verification --> Complete: All verified
note right of Verification
Self-verification protocols
Deliverable confirmation
Progress documentation
15 minutes
end note
Testing --> Implementation: Tests fail
Implementation --> Implementation: Bugs found
Complete --> [*]: Mission success ✅
note right of Complete
progress.md updated
Learnings stored in /memories/
Ready for next mission
end note
Mission Phases Explained:
- Planning (5-10 min) - Coordinator reads mission, creates project-plan.md with phases
- Analysis (30-45 min) - Strategist analyzes requirements, creates user stories
- Design (30-45 min) - Architect designs system, selects technologies
- Implementation (2-4 hours) - Developer writes code, creates tests
- Testing (1 hour) - Tester validates quality, runs comprehensive test suite
- Verification (15 min) - Self-verification protocols ensure completeness
Total Time: 4-8 hours for standard feature development
Context Preservation: Zero loss via agent-context.md + handoff-notes.md + /memories/. Extended Thinking: Ultrathink/Think harder/Think modes (39% effectiveness improvement). Tool Permissions: Least-privilege security model (64% read-only agents). Parallel Execution: Independent tasks run simultaneously for speed.
→ Memory Management | → Extended Thinking | → Tool Permissions
Workflow: BOS-AI (strategy) → AGENT-11 (execution)
# 1. Copy BOS-AI outputs to ideation/
# 2. Initialize: /coord dev-setup ideation/PRD.md
# 3. Build: /coord build ideation/PRD.md→ Complete BOS-AI Integration Guide
Complete overview of AGENT-11's capabilities organized by category.
100% knowledge retention with native memory API, zero-loss context preservation (agent-context.md, handoff-notes.md), and strategic /clear usage (84% token reduction enabling 30+ hour missions).
→ Memory Management | → Context Editing
Dual-file tracking (project-plan.md, progress.md with complete fix history), 20 pre-built mission workflows via /coord, automatic greenfield/brownfield bootstrap with CLAUDE.md generation.
→ Progress Tracking | → Bootstrap Guide | → Mission Library
SENTINEL Mode with separation of duties (zero bugs reach production), self-verification protocols (50% rework reduction), comprehensive testing (mcp__playwright, cross-browser, WCAG AA+).
→ Testing Guide | → Enhanced Prompting
Extended thinking modes (Ultrathink/Think harder/Think hard/Think - 39% effectiveness improvement), 15+ MCP integrations (GitHub, Playwright, Context7, Firecrawl, Supabase), RECON Protocol design reviews with 7-phase audit, OpsDev workflow with staging validation.
→ Extended Thinking | → Tool Permissions | → UI Doctrine
| Capability | Measurement | Impact |
|---|---|---|
| Agent Effectiveness | 39% improvement | Extended thinking + self-verification |
| Token Efficiency | 84% reduction | Context editing + memory optimization |
| Autonomous Operation | 30+ hours | Multi-day missions without intervention |
| Rework Reduction | 50% fewer errors | Pre-handoff verification catches issues |
| Context Retention | 100% persistence | Zero knowledge loss across sessions |
| Security Model | 64% read-only agents | Least-privilege tool permissions |
| Time to MVP | 2-4 weeks | vs 3-6 months traditional |
| Cost per Feature | <$500 | vs $10-50k traditional |
→ Complete Performance Analysis
AGENT-11 is powerful but not unlimited. Understanding these constraints helps you work effectively within the system.
- Issue: Projects with >50 files may exceed context window during analysis
- Workaround: Break work into modules, use phased approach with multiple missions
- Command:
@coordinator "Focus only on [specific module/feature]"
- Issue: Some packages require manual configuration (native bindings, platform-specific tools)
- Workaround: Let agents set up what they can, complete manual steps with provided instructions
- Example: Database setup, API key configuration, external service accounts
- Issue: If mission fails mid-way, context may be partially lost
- Workaround: Check progress.md for completed work, resume from last checkpoint
- Command:
@coordinator "Resume from last checkpoint in progress.md"
- Issue: Claude API rate limits may pause long-running missions (>2 hours)
- Workaround: Break into smaller phases, use
/clearbetween phases to reset context - Recovery: Wait 1-5 minutes, then continue with next phase
- Issue: Very large files (>2000 lines) or long conversations exceed context window
- Workaround: Use Glob/Grep for targeted searches instead of reading entire files
- Command:
@coordinator "Search for [specific pattern] instead of reading whole file"
- Issue: Some JavaScript-heavy sites may not work with Playwright automation
- Workaround: Use manual testing for complex UI, automated for standard paths
- When: Design reviews of SPAs, PWAs, or sites with heavy client-side rendering
- Issue: Some commands work differently on Windows vs Mac vs Linux
- Workaround: Use cross-platform alternatives (npm scripts vs shell scripts)
- Example: Prefer
npxover direct binary calls, Node.js scripts over bash
- Issue: Some tasks don't fit cleanly into one agent's role
- Workaround: Use
/coordmissions for cross-functional work requiring multiple agents - Example: Security audit needs @architect + @developer + @tester coordination
- Issue: First-time users need 1-2 hours to understand agent system and workflows
- Workaround: Start simple - use
/meetingfor conversations, progress to single @agent commands, then to/coordmissions - Progression: Individual agents → simple missions → complex orchestration
- Issue: No real-time collaboration between multiple humans on same project
- Workaround: Use git branches for parallel work, merge through standard PR process
- Best Practice: One person runs missions, team reviews results via PR
- Issue: Requires active internet connection for Claude API access
- Workaround: Work offline on manual tasks, run missions when connection available
- Impact: No offline development assistance
- Issue: API usage costs vary by mission complexity ($0.50-$10 per mission)
- Workaround: Start with smaller missions to estimate costs, use
/coordefficiently - Monitoring: Check API usage in Claude dashboard, set budget alerts
→ Complete capabilities and limitations guide
3,750+ lines of comprehensive guides: Memory Management | Bootstrap | Context Editing | Extended Thinking | Tool Permissions | Enhanced Prompting
Quick Recovery: /clear + @coordinator "I'm stuck" | System Check: /agents, ls .claude/agents/ | Test: @developer "Create hello world HTML" | Escalation: Troubleshooting | @support | GitHub Issues | Discord
- Project Deployment | Update Installation | MCP Integration | MCP Troubleshooting | Greenfield | Brownfield
- Memory Management | Bootstrap Guide | Context Editing | Extended Thinking | Tool Permissions | Enhanced Prompting
@support| Troubleshooting | Success Stories | GitHub Issues | Discord
AGENT-11 uses a dual-file tracking system: project-plan.md (forward-looking roadmap) and progress.md (backward-looking changelog with complete issue history including all fix attempts). This approach enables 10-40x faster resolution of similar future issues.
→ Complete Progress Tracking Guide
Three-tier cleanup strategy prevents context pollution: Milestone Transition (every 2-4 weeks), Project Completion (when shipping), or Continue Active Work. Lessons extracted to searchable repository while keeping active context lean (200 lines vs 20,000+).
→ Complete Lifecycle Management Guide | → Cleanup Checklist
Separation of duties: Tester analyzes (read-only), Developer implements tests, Tester validates. SENTINEL Mode runs 7-phase protocol for critical releases. Zero bugs reach production with 80%+ coverage on critical paths.
Pattern: /coord [mission] [input-file.md] or @agent for direct access. Examples: /coord build requirements.md | /coord fix bug-report.md | /coord mvp vision.md
→ Mission Execution Guide | → Mission Library
AGENT-11 provides 6 powerful slash commands for different workflows:
Orchestrate multi-agent missions with automatic specialist coordination
# Pattern: /coord [mission] [input-file.md]
/coord build requirements.md # Feature development (4-8 hours)
/coord fix bug-report.md # Bug resolution (1-3 hours)
/coord mvp vision.md # Build MVP (1-3 days)
/coord dev-setup ideation.md # Project initialization (30-45 min)
/coord dev-alignment # Analyze existing project (45-60 min)What it does:
- Automatically delegates to appropriate specialists
- Maintains context across agent handoffs
- Updates project-plan.md and progress.md
- Ensures quality through built-in verification
Have natural, conversational discussions with specialists for brainstorming and planning
# Pattern: /meeting @agent [topic]
/meeting @strategist # Product strategy discussion
/meeting @architect "microservices vs monolith" # Architecture consultation
/meeting @designer "improving user onboarding" # Design brainstorming
/meeting @developer "real-time data synchronization" # Technical problem-solvingWhen to use:
- Brainstorming new features or approaches
- Getting expert advice before formal missions
- Exploring technical decisions and trade-offs
- Creative collaboration and idea generation
Available specialists: @strategist, @architect, @coordinator, @developer, @tester, @designer, @operator, @marketer, @analyst, @documenter, @support
Comprehensive design audit using RECON Protocol with automated testing
/design-review # Full 7-phase UI/UX audit with evidenceWhat it analyzes (7-phase RECON Protocol):
- Preparation - Environment setup, baseline screenshots
- Interaction - User flows, micro-interactions, navigation
- Responsive - Mobile, tablet, desktop compatibility
- Visual - Typography, spacing, hierarchy, consistency
- Accessibility - WCAG AA+ compliance, screen readers
- Robustness - Edge cases, error states, loading
- Performance - Load times, console errors
Deliverables:
- Issues classified by severity (BLOCKER/HIGH/MEDIUM/NITPICK)
- Screenshots and reproduction steps
- Accessibility compliance report
- Specific recommendations with examples
Quick design reconnaissance for understanding existing UI/UX patterns
/recon # Rapid design analysis and pattern identificationWhat it does:
- Analyzes current design system and patterns
- Identifies UI/UX strengths and weaknesses
- Documents design decisions and rationale
- Provides strategic design recommendations
Use when: Need quick design assessment before major changes or starting new features
Generate stakeholder-ready progress reports automatically
# Pattern: /report [since_date]
/report # Last 7 days (default)
/report 2025-08-20 # Since specific dateReport includes:
- Executive summary of progress
- Completed tasks with business impact
- Issues encountered and resolutions
- Current project status and health
- Key metrics and trends
- Next milestones and resource needs
- BOS-AI alignment (if integrated)
Output: Creates progress-report.md with professional formatting suitable for stakeholders, clients, or management
Systematic root cause analysis of failures to prevent recurrence
# Pattern: /pmd [issue_description]
/pmd # Analyze recent failures from progress.md
/pmd "Coordinator not using Task tool" # Analyze specific issue
/pmd "Installation failing on Windows" # Analyze deployment failureAnalysis covers:
- Agent Performance - Prompt clarity, scope compliance, tool usage
- Documentation Quality - CLAUDE.md accuracy, task definitions
- Tool Usage - MCP prioritization, error handling
- Process Issues - Planning, communication, testing coverage
Deliverables:
- Timeline of events leading to failure
- Root cause analysis with evidence
- Immediate fixes (do now)
- Short-term improvements (this week)
- Long-term enhancements (this month)
- Prevention strategies and detection mechanisms
Output: Creates post-mortem-analysis.md with actionable recommendations
| Command | Purpose | Duration | Output | Best For |
|---|---|---|---|---|
/coord |
Execute missions with specialists | 1-8 hours | Code, tests, docs | Building, fixing, deploying |
/meeting |
Strategic conversations | 15-60 min | Discussion, ideas | Brainstorming, planning |
/design-review |
Full UI/UX audit | 1-2 hours | Design report | Pre-launch quality checks |
/recon |
Quick design analysis | 30-45 min | Design insights | Understanding existing UI |
/report |
Stakeholder updates | 5-10 min | Progress report | Communication, tracking |
/pmd |
Failure analysis | 30-60 min | Root cause analysis | Learning from mistakes |
"Each project gets its own elite team that understands the codebase perfectly. Game changer." — Alex Chen, Solo Founder ($15k MRR)
"Project-local agents mean no confusion, no context switching. They just get it." — Sarah Martinez, Technical Founder
"Clean deployments, isolated teams per project. Finally, AI that scales with my workflow." — David Kim, Multi-Project Founder
"The v2.0 memory system is incredible - agents remember decisions from days ago. No more context loss." — AGENT-11 Beta Tester
- 🚀 DEV-SETUP - Greenfield project initialization (30-45 min)
- 🎯 DEV-ALIGNMENT - Existing project understanding (45-60 min)
- ⚙️ OPSDEV-SETUP - DevOps & environment configuration (20-30 min)
- 📋 CLAUDE-SETUP - CLAUDE.md creation & sync (15-25 min)
- 🔌 CONNECT-MCP - MCP discovery and connection (45-90 min)
- 🏗️ BUILD - New feature development (4-8 hours)
- 🐛 FIX - Emergency bug resolution (1-3 hours)
- ♻️ REFACTOR - Code quality improvement (2-4 hours)
- 🚀 DEPLOY - Production deployment (1-2 hours)
- 📚 DOCUMENT - Documentation creation (2-4 hours)
- 📐 ARCHITECTURE - System architecture documentation (2-3 hours)
- 📋 PRODUCT-DESCRIPTION - Product definition with risk management (2-3 hours)
- 💡 MVP - Minimum viable product (1-3 days)
- 🔄 MIGRATE - System migration (4-8 hours)
- ⚡ OPTIMIZE - Performance optimization (3-6 hours)
- 🔒 SECURITY - Security audit & fixes (4-6 hours)
- 🔌 INTEGRATE - Third-party integration (3-6 hours)
- 🎯 RELEASE - Release management (2-4 hours)
| Mission | Command | Input Required | Duration | Use Case |
|---|---|---|---|---|
| Setup Missions | ||||
| DEV-SETUP | /coord dev-setup ideation.md |
✅ Ideation | 30-45 min | New project initialization |
| DEV-ALIGNMENT | /coord dev-alignment |
❌ None | 45-60 min | Understand existing project |
| CONNECT-MCP | /coord connect-mcp |
❌ None | 45-90 min | Setup external integrations |
| Core Development | ||||
| BUILD | /coord build requirements.md |
✅ Requirements | 3-8 hours | Feature development |
| MVP | /coord mvp vision.md |
✅ Vision | 6-12 hours | Minimum viable product |
| FIX | /coord fix bug-report.md |
✅ Bug Report | 1-4 hours | Bug resolution |
| REFACTOR | /coord refactor |
❌ None | 2-4 hours | Code improvement |
| DEPLOY | /coord deploy |
❌ None | 1-2 hours | Production deployment |
| DOCUMENT | /coord document |
❌ None | 2-4 hours | Documentation creation |
| MIGRATE | /coord migrate requirements.md |
✅ Requirements | 4-8 hours | System migration |
| Specialized Operations | ||||
| ARCHITECTURE | /coord architecture vision.md |
✅ Vision | 2-4 hours | System design |
| OPTIMIZE | /coord optimize |
❌ None | 3-6 hours | Performance tuning |
| SECURITY | /coord security |
❌ None | 4-6 hours | Security audit |
| INTEGRATE | /coord integrate requirements.md |
✅ Requirements | 3-6 hours | Third-party APIs |
| RELEASE | /coord release |
❌ None | 2-4 hours | Deployment prep |
| Business & Growth | ||||
| PRODUCT-DESCRIPTION | /coord product-description vision.md |
✅ Vision | 2-3 hours | Marketing copy |
| Operations | ||||
| GENESIS | /coord genesis |
❌ None | 1-2 hours | Project reconnaissance |
| RECON | /coord recon |
❌ None | 1-3 hours | Design intelligence |
Input File Templates: Available in /templates/mission-inputs/
Legend: ✅ Required input file | ❌ No input needed
Some missions benefit from specific MCP profiles for optimal performance and context efficiency:
- Testing missions (BUILD with E2E tests, QA phases) →
/mcp-switch testing(adds Playwright) - Deployment missions (DEPLOY, RELEASE) →
/mcp-switch deployment(Netlify + Railway access) - Database work (MIGRATE, schema changes) →
/mcp-switch database-stagingor/mcp-switch database-production - Full-stack development (complex features) →
/mcp-switch fullstack(all development MCPs) - General development (BUILD, FIX, REFACTOR) →
/mcp-switch core(lightweight, 80% less context)
Quick Commands:
- Switch profiles:
/mcp-switch [profile-name] - See all profiles:
/mcp-list - Check current:
/mcp-status
Already have AGENT-11? Get the latest features including the /coord mission system:
# Single command to update your project
cd /path/to/your/project
curl -sSL https://raw.githubusercontent.com/TheWayWithin/agent-11/main/project/deployment/scripts/install.sh | bash -s core🛡️ Your Custom CLAUDE.md is Safe! The installer:
- ✅ Never overwrites your existing CLAUDE.md file
- ✅ Creates
CLAUDE-AGENT11-TEMPLATE.mdwith latest AGENT-11 features - ✅ Creates automatic backup:
CLAUDE.md.backup-[timestamp] - ✅ Provides clear instructions for merging new capabilities
📋 Complete Update Guide → - Everything you need to know about updating
Complete Troubleshooting Guide →
AGENT-11 is a passion project that's 100% free and always will be. If this framework has:
- ⚡ Saved you weeks of development time
- 🚀 Helped you ship your MVP faster
- 💰 Eliminated the need for expensive contractors
Consider supporting its development:
Your support keeps AGENT-11 free, actively maintained, and growing with new capabilities!
Other ways to support:
- ⭐ Star the repo to help others discover it
- 📢 Share AGENT-11 with fellow founders and developers
- 🐛 Report bugs and suggest features
- 📝 Contribute to documentation
AGENT-11 is open source and community-driven. We welcome contributions from fellow operators.
MIT - Use AGENT-11 to build your empire.
Ready to deploy your project squad?
🚀 Project Setup · 📖 Quick Start · 📚 Full Documentation
"One project. One founder. Eleven specialists. Unlimited potential."