summaryrefslogtreecommitdiff
path: root/docs/research/claude-flow-research.md
diff options
context:
space:
mode:
Diffstat (limited to 'docs/research/claude-flow-research.md')
-rw-r--r--docs/research/claude-flow-research.md169
1 files changed, 169 insertions, 0 deletions
diff --git a/docs/research/claude-flow-research.md b/docs/research/claude-flow-research.md
new file mode 100644
index 0000000..479691a
--- /dev/null
+++ b/docs/research/claude-flow-research.md
@@ -0,0 +1,169 @@
+# Claude-Flow (Ruflo v3) Research Summary
+
+> Research conducted 2026-02-24 for makima improvement evaluation
+
+## Overview
+
+claude-flow (marketed as Ruflo v3) is an enterprise AI orchestration system built around Claude Code. It provides 175+ MCP tools, manages 60+ specialized agents, and has accumulated 5,923+ commits. Key performance claims: 84.8% SWE-Bench solve rate, 2.8-4.4x faster task completion vs baseline Claude Code.
+
+**Repository**: https://github.com/ruvnet/claude-flow
+
+## Architecture
+
+### Layered Design
+```
+User Layer: CLI + Claude Code interfaces
+Entry Layer: MCP Server with AIDefence security validation
+Routing Layer: Q-Learning router + MoE (8 experts) + 42 skills + 17 hooks
+Swarm Layer: Topologies (mesh/hierarchical/ring/star) + consensus
+Agent Layer: 60+ specialized agents
+Resources: Memory systems, LLM providers, 12 background workers
+Intelligence: RuVector with 10+ optimization components
+```
+
+### MCP Integration
+- Runs as stdio process providing 175+ tools
+- MCP 2025-11-25 full specification compliance
+- Supports tools, resources, prompts, and tasks
+- Multiple transports: stdio, HTTP, WebSocket, in-process
+
+## Multi-Agent Coordination (Hive Mind)
+
+### Queen Types (Coordinators)
+| Type | Role |
+|------|------|
+| Strategic | Planning and goal decomposition |
+| Tactical | Execution coordination |
+| Adaptive | Optimization and learning |
+
+### Worker Types (8 Specialized Roles)
+1. **Researcher** - Information gathering and analysis
+2. **Coder** - Implementation
+3. **Analyst** - Data analysis and insights
+4. **Tester** - Quality assurance
+5. **Architect** - System design
+6. **Reviewer** - Code review and quality gates
+7. **Optimizer** - Performance tuning
+8. **Documenter** - Documentation generation
+
+### Consensus Algorithms
+- **Byzantine** (f < n/3): 2/3 majority for decisions
+- **Weighted Voting**: Queen has 3x authority
+- **Majority Voting**: Simple democratic decisions
+
+## Task Routing & Scheduling
+
+### Q-Learning Router
+- Combined with MoE (8 experts)
+- 89% routing accuracy
+- 34,798 routes/s throughput
+- Learns which agents perform best per task type through execution trajectories
+
+### Three-Tier Routing Strategy
+| Tier | Handler | Latency | Cost |
+|------|---------|---------|------|
+| Simple | Agent Booster WASM | <1ms | $0 |
+| Medium | Haiku/Sonnet | ~500ms | Low |
+| Complex | Opus + multi-agent swarms | 2-5s | Standard |
+
+### Task Templates (Agent Combinations)
+| Task Type | Recommended Agents |
+|-----------|--------------------|
+| Bug Fix | Coordinator, Researcher, Coder, Tester |
+| Feature | Coordinator, Architect, Coder, Tester, Reviewer |
+| Refactor | Coordinator, Architect, Coder, Reviewer |
+| Performance | Coordinator, Perf-Engineer, Coder |
+| Security | Coordinator, Security-Architect, Auditor |
+
+## Self-Learning Mechanisms
+
+### SONA (Self-Optimizing Neural Architecture)
+- <0.05ms adaptation time
+- Rapid behavior adjustment at runtime
+- Two-tier LoRA + EWC++ + ReasoningBank integration
+
+### EWC++ (Elastic Weight Consolidation)
+- Preserves 95%+ knowledge across tasks
+- Prevents catastrophic forgetting
+
+### ReasoningBank
+- Pattern caching with RETRIEVE → JUDGE → DISTILL → CONSOLIDATE → ROUTE cycle
+- 32% token savings through pattern retrieval instead of full context
+- Stores successful execution trajectories for reuse
+
+### MicroLoRA
+- 128x compressed fine-tuning
+- No full retraining required
+- Lightweight runtime adaptation
+
+## Memory & Context Sharing
+
+### 3-Scope Architecture
+| Scope | Purpose |
+|-------|---------|
+| Project | Task-specific context |
+| Local | Machine/user patterns |
+| User | Cross-project learnings |
+
+### Storage Stack
+- **HNSW Vector Search**: 150x-12,500x faster retrieval, 16,400 QPS
+- **AgentDB**: SQLite with WAL for persistence
+- **LRU Cache**: Sub-millisecond access for hot data
+- **Knowledge Graph**: PageRank + community detection for insight ranking
+
+### 8 Memory Types
+Attention, episodic, procedural, semantic, + 4 additional types for comprehensive knowledge representation.
+
+## Drift Control
+
+Critical for multi-agent alignment:
+1. **Hierarchical Coordinator** validates all outputs against goals
+2. **Small Teams** (6-8 agents) reduce coordination overhead
+3. **Frequent Checkpoints** via post-task hooks verify compliance
+4. **Raft Consensus** maintains authoritative state
+5. **Specialized Roles** enforce clear task boundaries
+
+## Cost Optimization
+
+### Multi-Layer Strategy
+| Layer | Mechanism | Savings |
+|-------|-----------|---------|
+| 1 | Agent Booster WASM | Eliminates tokens entirely |
+| 2 | Haiku/Sonnet routing | 75% lower than Opus |
+| 3 | ReasoningBank | -32% token savings |
+| 4 | Token compression | 30-50% reduction |
+| 5 | Caching | 95% hit rate |
+| **Combined** | **All layers** | **Extends Claude Max 250%** |
+
+## Hook System
+
+33+ hooks across 7 categories:
+- **Session**: start, end
+- **Agent**: pre-spawn, post-spawn, pre-terminate
+- **Task**: pre-execute, post-complete, error
+- **Tool**: pre-call, post-call
+- **Memory**: store/retrieve operations
+- **Swarm**: coordination events
+- **File**: read/write operations
+
+Self-Learning Hooks feed execution insights back into the Q-Learning router.
+
+## Claims System (Human-Agent Coordination)
+- **Claim**: Agent requests task ownership
+- **Release**: Agent returns uncompleted work
+- **Handoff**: Human reassigns to different agent
+- Prevents duplicate effort and maintains clear responsibility
+
+## Fault Tolerance
+- Byzantine fault-tolerant (f < n/3, 2/3 majority)
+- 6 LLM provider failover (Claude, GPT, Gemini, etc.)
+- Checkpoint system prevents cascading failures
+- Persist/Restore/Export session management
+
+## Swarm Topologies
+| Topology | Structure | Best For |
+|----------|-----------|----------|
+| Hierarchical | Coordinator + workers | Structured coding tasks (0.20s, 256MB/agent) |
+| Mesh | Peer-to-peer | Collaborative, high redundancy |
+| Ring | Sequential chain | Pipeline processing |
+| Star | Hub-and-spoke | Centralized control |