Skip to content

Team Structure

Four Teams, One Manager

Team Purpose Agents
Research Theory, math, literature, proofs research-dev/rev
Dev Architecture + implementation architect-dev/rev, code-dev/rev
KB Documentation: cards, API docs, tex kb-dev/rev
QA Full regression, full benchmark benchmark

Plus shadow as a validation mode (not a standing team).

             User (co-plans, gates, reviews math)
               |
             Manager (owns kanban, coordinates)
               |
    +----------+----------+----------+
    |          |          |          |
 Research     Dev        KB         QA
  dev/rev   arch d/r   dev/rev   benchmark
            code d/r

Agent Roster

Research Team

Agent Role Type
research-developer Theory, math, literature, proofs Developer
research-reviewer Challenge research claims, verify proofs Reviewer

Research produces idea cards and tex drafts that land in KB as informal artifacts (no code backing yet).

Dev Team

Agent Role Type
architect-developer Design, abstraction layers, refactoring plans Developer
architect-reviewer Challenge architecture, check feasibility Reviewer
code-developer Implementation + smoke test Developer
code-reviewer Block on issues + small reg test with early stop Reviewer

KB Team

Agent Role Type
kb-developer Cards, API docs (Doxygen/Sphinx), math docs, index, fine-tune tex Developer
kb-reviewer Board view, scoring, staleness detection Reviewer

KB is the project's shared knowledge. All teams feed into it:

  • Research → idea cards, tex drafts
  • Dev → @code-change tags → formal cards + API docs
  • KB-dev fine-tunes tex after cards and API docs are done

QA Team

Agent Role Type
benchmark Full regression, full benchmark, performance reports Standalone

QA runs the full test suite and benchmark. Distinct from code-reviewer's small reg test with early stop.

Shadow (validation mode)

Agent Role Type
shadow-developer Reimplement modules from KB only (no source access) Developer
shadow-reviewer Compare reimplementation against original, score transfer Reviewer

Shadow is activated on demand, not always running.

Coordinator

Agent Role Type
manager Owns kanban, coordinates all teams, co-plans with user Coordinator

Accountability Chain

Every agent has a reviewer. The manager's reviewer is the user.

User (co-plans, gates every phase, reviews tex for math)
  └── Manager (owns kanban, proposes, reports)
        ├── Research team (dev/rev)
        ├── Dev team (arch dev/rev, code dev/rev)
        ├── KB team (dev/rev)
        ├── QA team (benchmark)
        └── Shadow mode (dev/rev, on demand)

Reviewers are blockers

Without this, developers prefer deliverables over safety and performance. Reviewers also run small regression tests (early stop). Only High-severity findings are hard blocks; Medium is "should fix" but doesn't block.

Decisions reserved for the user:

  • Phase transitions (user gates every phase)
  • Merge decisions (shadow blocks)
  • Baseline updates
  • Escalated disputes (medium/high severity)
  • Release gates
  • Scope changes mid-sprint
  • HALT and rollback decisions
  • Math review (tex documents)

Reviewer vs QA Testing

Code-reviewer (dev team) QA (benchmark)
Test set Small Full
Early stop Yes (fail fast) No (complete run)
When Every code change Pre-release / on demand
Blocks Merge to branch Release

Team Pairs

Manager:      user (co-plans, gates)   ←→  manager (owns kanban, reports)
Research:     research-developer       ←→  research-reviewer
Architecture: architect-developer      ←→  architect-reviewer
Code:         code-developer           ←→  code-reviewer
KB:           kb-developer             ←→  kb-reviewer
QA:           benchmark (standalone)
Shadow:       shadow-developer         ←→  shadow-reviewer

Routing

Request type Primary Reviewer
Prove / derive / survey research-dev research-rev
Where to implement / refactor architect-dev architect-rev
Implement / fix bug code-dev code-rev
Write card / annotate / fine-tune tex kb-dev kb-rev
Full benchmark / pre-release benchmark ---
Review KB coverage kb-rev ---
Validate KB transfer quality shadow-dev shadow-rev
Complex multi-team request manager decomposes all relevant

Escalation Policy

When a developer and reviewer disagree:

Severity Resolution
Minor (style, naming) Developer decides, reviewer notes it
Medium (design choice) Manager presents both sides to user; user decides
High (correctness, safety) Reviewer has veto. Must resolve before proceeding.

Warning

In adversarial environments, the default is the reviewer's position. The cost of missing a real problem outweighs the cost of an extra revision.

Owner Abbreviations

Short Agent
res-dev research-developer
res-rev research-reviewer
arch-dev architect-developer
arch-rev architect-reviewer
code-dev code-developer
code-rev code-reviewer
kb-dev kb-developer
kb-rev kb-reviewer
bench benchmark (QA)
shad-dev shadow-developer
shad-rev shadow-reviewer
mgr manager