[DOCS][Maker][Tests][Maker]

kyegomez · kyegomez · commit 38491830c091 · 2026-03-20T23:59:29.000-04:00
diff --git a/README.md b/README.md
@@ -242,6 +242,7 @@ This feature is perfect for rapid prototyping, complex task decomposition, and c
 | **[ForestSwarm](https://docs.swarms.world/en/latest/swarms/structs/forest_swarm/)** | Dynamically selects the most suitable agent or tree of agents for a given task. | Task routing, optimizing for expertise, and complex decision-making trees. |
 | **[HierarchicalSwarm](https://docs.swarms.world/en/latest/swarms/structs/hierarchical_swarm/)** | Orchestrates agents with a director who creates plans and distributes tasks to specialized worker agents. | Complex project management, team coordination, and hierarchical decision-making with feedback loops. |
 | **[HeavySwarm](https://docs.swarms.world/en/latest/swarms/structs/heavy_swarm/)** | Implements a five-phase workflow with specialized agents (Research, Analysis, Alternatives, Verification) for comprehensive task analysis. | Complex research and analysis tasks, financial analysis, strategic planning, and comprehensive reporting. |
+| **[MAKER](https://docs.swarms.world/en/latest/swarms/structs/maker/)** | Long-horizon tasks decomposed into steps; each step uses first-to-ahead-by-k voting and red-flagging on micro-agent samples (from Meyerson et al., 2025). | Extremely long or fragile pipelines where you want statistical agreement and validation on every atomic step—not a hand-designed multi-agent graph. |
 | **[SwarmRouter](https://docs.swarms.world/en/latest/swarms/structs/swarm_router/)** | A universal orchestrator that provides a single interface to run any type of swarm with dynamic selection. | Simplifying complex workflows, switching between swarm strategies, and unified multi-agent management. |
 
 -----
@@ -613,6 +614,31 @@ This architecture is perfect for financial analysis, strategic planning, researc
 
 ---
 
+### MAKER
+
+`MAKER` implements **maximal agentic decomposition** with **first-to-ahead-by-k voting** and **red-flagging**: you supply `format_prompt`, `parse_response`, and optional `validate_response` / `update_state`, then run for a fixed number of steps (or until a stop condition). Each step spins up a focused one-shot `Agent` (or cycles a pool you provide) until one parsed answer leads all others by `k` votes. This matches the error-correction story in [Solving a Million-Step LLM Task with Zero Errors](https://arxiv.org/abs/2511.09030). [Full documentation](https://docs.swarms.world/en/latest/swarms/structs/maker/)
+
+```python
+from swarms.structs.maker import MAKER
+
+maker = MAKER(
+    model_name="gpt-4.1-mini",
+    system_prompt="You solve tasks in one clear line per step.",
+    k=3,
+)
+
+# Optional: override format_prompt / parse_response / validate_response for your domain.
+results = maker.run(
+    task="List three concise benefits of typed APIs, one per step.",
+    max_steps=3,
+)
+print(results)
+```
+
+For lower latency when `k` is large, use `run_parallel_voting` with the same `task` and `max_steps`.
+
+---
+
 ### Social Algorithms
 
 **Social Algorithms** provide a flexible framework for defining custom communication patterns between agents. You can upload any arbitrary social algorithm as a callable that defines the sequence of communication, enabling agents to talk to each other in sophisticated ways. [Learn more about Social Algorithms](https://docs.swarms.world/en/latest/swarms/structs/social_algorithms/)
diff --git a/docs/mkdocs.yml b/docs/mkdocs.yml
@@ -284,6 +284,7 @@ nav:
 
         - DebateWithJudge: "swarms/structs/debate_with_judge.md"
         - MajorityVoting: "swarms/structs/majorityvoting.md"
+        - MAKER: "swarms/structs/maker.md"
         - RoundRobin: "swarms/structs/round_robin_swarm.md"
         - Mixture of Agents: "swarms/structs/moa.md"
         - SelfMoASeq: "swarms/structs/self_moa_seq.md"
diff --git a/docs/swarms/concept/swarm_architectures.md b/docs/swarms/concept/swarm_architectures.md
@@ -1,6 +1,6 @@
 # Multi-Agent Architectures
 
-### What is a Multi-Agent Architecture?
+*What are Multi-Agent Architectures?*
 
 A multi-agent architecture refers to a group of more than two agents working collaboratively to achieve a common goal. These agents can be software entities, such as LLMs that interact with each other to perform complex tasks. The concept of multi-agent architectures is inspired by how humans communicate and work together in teams, organizations, and communities, where individual contributions combine to create sophisticated collaborative problem-solving capabilities.
 
@@ -14,10 +14,6 @@ Multi-agent architectures are designed to establish and manage communication bet
 
 3. **Sequential Communication**: Sequential architectures process tasks in a linear order, where each agent's output becomes the input for the next agent. This ensures that tasks with dependencies are handled in the correct sequence, maintaining the integrity of the workflow.
 
-4. **Mesh Communication**: In mesh architectures, agents are fully connected, allowing any agent to communicate with any other agent. This setup provides high flexibility and redundancy, making it ideal for complex systems requiring dynamic interactions.
-
-5. **Federated Communication**: Federated architectures involve multiple independent systems that collaborate by sharing information and results. Each system operates autonomously but can contribute to a larger task, enabling distributed problem-solving across different nodes.
-
 Multi-agent architectures leverage these communication patterns to ensure that agents work together efficiently, adapting to the specific requirements of the task at hand. By defining clear communication protocols and interaction models, multi-agent architectures enable the seamless orchestration of multiple agents, leading to enhanced performance and problem-solving capabilities.
 
 ## Core Multi-Agent Architectures
@@ -37,6 +33,7 @@ Multi-agent architectures leverage these communication patterns to ensure that a
 | Heavy                             | High-performance architecture for handling intensive computational tasks with multiple agents.                                                                         | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/heavy_swarm/)                       | Large-scale data processing, intensive computational workflows                                    |
 | Council as Judge                  | Multiple agents act as a council to evaluate and judge outputs or decisions.                                                                                           | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/council_of_judges/)                     | Quality assessment, decision validation, peer review processes                                    |
 | Majority Voting                   | Agents vote on decisions with the majority determining the final outcome.                                                                                              | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/majorityvoting/)                   | Democratic decision-making, consensus building, error reduction                                   |
+| MAKER                             | Decomposes work into sequential steps; each step uses repeated micro-agent samples, red-flagging, and first-to-ahead-by-k voting before committing.                     | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/maker/)                             | Very long or high-precision pipelines where every atomic step should be statistically validated      |
 | Round Robin                       | Tasks are distributed cyclically among agents in a rotating order.                                                                                                     | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/round_robin_swarm/)                       | Load balancing, fair task distribution, resource optimization                                     |
 | Auto-Builder                      | Automatically constructs and configures multi-agent systems based on requirements.                                                                                    | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/auto_swarm_builder/)                | Dynamic system creation, adaptive architectures, rapid prototyping                               |
 | Hybrid Hierarchical Cluster      | Combines hierarchical and peer-to-peer communication patterns for complex workflows.                                                                                   | [Learn More](https://docs.swarms.world/en/latest/swarms/structs/hhcs/)     | Complex enterprise workflows, multi-department coordination                                       |
@@ -768,6 +765,37 @@ graph TD
 
 ---
 
+### MAKER
+
+**Overview:**
+MAKER (**M**aximal **A**gentic decomposition, first-to-ahead-by-**K** **E**rror correction, and **R**ed-flagging) breaks a task into many sequential steps. At each step, one-shot micro-agents sample answers; invalid outputs are discarded (red-flagging), and the framework only commits when one parsed result leads every other candidate by `k` votes. It is task-agnostic: you supply prompt formatting, parsing, and optional validation and state updates. Based on Meyerson et al. (2025); see the [paper](https://arxiv.org/abs/2511.09030).
+
+**Use Cases:**
+
+- Very long-horizon workflows where each atomic step must be reliable
+
+- Pipelines where correlated failures are reduced by voting and response validation
+
+- Domains where you can decompose work into explicit steps with clear per-step I/O
+
+**[Learn More](https://docs.swarms.world/en/latest/swarms/structs/maker/)**
+
+```mermaid
+graph TD
+    T[Task + step budget] --> S[Step loop]
+    S --> P[format_prompt → Agent.run]
+    P --> R{Valid sample?}
+    R -->|red-flag| P
+    R -->|ok| V[parse → vote tally]
+    V --> W{Leader ahead by k?}
+    W -->|no| P
+    W -->|yes| U[update_state, record result]
+    U --> S
+    S --> D[Final trajectory]
+```
+
+---
+
 ### Auto-Builder
 
 **Overview:**
diff --git a/docs/swarms/structs/index.md b/docs/swarms/structs/index.md
@@ -35,6 +35,7 @@ Multi-agent systems unlock new levels of intelligence, reliability, and efficien
 | **[SwarmRouter](https://docs.swarms.world/en/latest/swarms/structs/swarm_router/)** | Universal orchestrator that provides a single interface to run any type of swarm with dynamic selection. | Simplifying complex workflows, switching between swarm strategies, unified multi-agent management. |
 | **[HierarchicalSwarm](https://docs.swarms.world/en/latest/swarms/structs/hierarchical_swarm/)** | Director agent coordinates specialized worker agents in a hierarchy. | Complex, multi-stage tasks, iterative refinement, enterprise workflows. |
 | **[Hybrid Hierarchical-Cluster Swarm (HHCS)](https://docs.swarms.world/en/latest/swarms/structs/hhcs/)** | Router agent distributes tasks to specialized swarms for parallel, hierarchical processing. | Enterprise-scale, multi-domain, and highly complex workflows. |
+| **[MAKER](https://docs.swarms.world/en/latest/swarms/structs/maker/)** | Decomposes a task into many steps; each step uses micro-agents, red-flagging, and first-to-ahead-by-k voting before committing. | Long or fragile workflows where you want statistical agreement on every atomic step (see Meyerson et al., 2025). |
 
 ---
 
diff --git a/docs/swarms/structs/maker.md b/docs/swarms/structs/maker.md
@@ -0,0 +1,101 @@
+# MAKER
+
+**MAKER** (**M**aximal **A**gentic decomposition, first-to-ahead-by-**K** **E**rror correction, and **R**ed-flagging) is a task-agnostic orchestrator for long-horizon problems. It decomposes work into many small steps; at each step it samples LLM outputs, discards bad ones (red-flagging), and commits only when one parsed answer leads the next-best by `k` votes (“first-to-ahead-by-k”).
+
+This implementation follows the framework described in *Solving a Million-Step LLM Task with Zero Errors* (Meyerson et al., 2025) — [arXiv:2511.09030](https://arxiv.org/abs/2511.09030).
+
+**Import:** `from swarms.structs.maker import MAKER`
+
+## When to use MAKER
+
+| Use MAKER when… | Consider something else when… |
+|-----------------|-------------------------------|
+| You can express the problem as a fixed or conditionally bounded sequence of steps | You need a fixed DAG of different agents ([GraphWorkflow](graph_workflow.md)) |
+| Each step should be a single focused LLM call with statistical agreement | You want multi-agent debate + judge ([DebateWithJudge](debate_with_judge.md)) |
+| You care about per-step reliability (voting + validation) over raw speed | You only need one-shot or simple majority across agents ([MajorityVoting](majorityvoting.md)) |
+
+## How it works
+
+```mermaid
+flowchart TD
+    T[Task + max_steps] --> S[For each step]
+    S --> V[Sample votes via Agent.run]
+    V --> R{Red-flag?}
+    R -->|invalid / exception| V
+    R -->|valid| P[parse_response → hashable result]
+    P --> C{Leader ahead by k?}
+    C -->|no| V
+    C -->|yes| U[update_state, append result]
+    U --> S
+```
+
+1. **MAD (maximal agentic decomposition)** — You run up to `max_steps` iterations; each iteration is one micro-step with a prompt built from the task, optional state, step index, and the previous step’s result (`format_prompt`).
+2. **First-to-ahead-by-k voting** — Parsed answers are counted until some candidate’s count is at least `k` greater than every other candidate (`do_voting`). Optional **`run_parallel_voting`** batches the first round of samples with a thread pool.
+3. **Red-flagging** — Before parsing, `validate_response` can reject outputs (default rejects empty or overly long text vs `max_tokens`).
+
+## Constructor parameters
+
+| Parameter | Role |
+|-----------|------|
+| `model_name`, `system_prompt`, `max_tokens`, `temperature`, `temperature_first` | Passed through to per-step `Agent` instances (first vote often uses `temperature_first=0`). |
+| `k` | Votes a winner must lead the runner-up by (higher ⇒ more reliable, more cost). |
+| `format_prompt(task, state, step_idx, previous_result)` | Builds the user prompt for the current step. |
+| `parse_response(text)` | Turns raw LLM output into a **hashable** result for voting (strings, numbers, tuples of primitives, etc.). |
+| `validate_response(text, max_tokens)` | Returns `False` to discard a sample. |
+| `update_state(state, result, step_idx)` | Fold step output into state (default: unchanged). |
+| `initial_state` | Starting state for `run` / `run_until_condition`. |
+| `max_workers` | Thread pool size for `run_parallel_voting` (default: `k`). |
+| `max_retries_per_step` | Cap on samples per step before `RuntimeError`. |
+| `agents` | Optional list of pre-built `Agent`s; votes cycle through this pool instead of creating fresh micro-agents. |
+
+## Main methods
+
+| Method | Description |
+|--------|-------------|
+| `run(task, max_steps)` | Run exactly `max_steps` voting rounds; returns `list` of per-step results. |
+| `run_until_condition(task, stop_condition, max_steps=1000)` | Like `run`, but before each step the loop checks `stop_condition(state, results, step_idx)`; if true, it exits without running another vote for that index. |
+| `run_parallel_voting(task, max_steps)` | Like `run` but uses parallel sampling for the first batch of votes per step. |
+| `get_statistics()` | Copy of internal counters (samples, votes, red-flags, per-step vote/sample lists). |
+| `reset()` | Clears stats and conversation. |
+| `estimate_cost(total_steps, target_success_probability=0.95)` | Heuristic cost / `k` guidance from paper-style estimates (uses run statistics when available). |
+
+## Minimal example
+
+```python
+from swarms.structs.maker import MAKER
+
+
+def format_prompt(task, state, step_idx, previous_result):
+    prev = f"\nPrevious: {previous_result}" if previous_result is not None else ""
+    return f"{task}\nStep {step_idx + 1} of the plan. One short line only.{prev}"
+
+
+def parse_response(response: str) -> str:
+    return response.strip().splitlines()[0]
+
+
+def validate_response(response: str, max_tokens: int) -> bool:
+    if not response.strip():
+        return False
+    return len(response) // 4 <= max_tokens  # rough token estimate, same idea as default
+
+
+maker = MAKER(
+    name="LineByLine",
+    model_name="gpt-4.1-mini",
+    system_prompt="Answer in one short line per step.",
+    format_prompt=format_prompt,
+    parse_response=parse_response,
+    validate_response=validate_response,
+    k=2,
+    verbose=True,
+)
+
+results = maker.run(task="List three benefits of unit tests, one per step.", max_steps=3)
+print(results)
+```
+
+## Related
+
+- Source: `swarms/structs/maker.py` (module and class docstrings mirror this behavior).
+- [MajorityVoting](majorityvoting.md) — multi-agent loops with a consensus agent, not step-wise first-to-ahead-by-k on a decomposed trajectory.
diff --git a/docs/swarms/structs/overview.md b/docs/swarms/structs/overview.md
@@ -8,6 +8,7 @@ This page provides a comprehensive overview of all available multi-agent archite
     | Architecture | Use Case | Key Functionality | Documentation |
     |-------------|----------|-------------------|---------------|
     | MajorityVoting | Decision making through consensus | Combines multiple agent opinions and selects the most common answer | [Docs](majorityvoting.md) |
+    | MAKER | Long-horizon precision | Step-wise decomposition with first-to-ahead-by-k voting and red-flagging per step | [Docs](maker.md) |
     | AgentRearrange | Optimizing agent order | Dynamically reorders agents based on task requirements | [Docs](agent_rearrange.md) |
     | RoundRobin | Equal task distribution | Cycles through agents in a fixed order | [Docs](round_robin_swarm.md) |
     | Mixture of Agents | Complex problem solving | Combines diverse expert agents for comprehensive analysis | [Docs](moa.md) |
diff --git a/swarms/structs/llm_council.py b/swarms/structs/llm_council.py
@@ -10,19 +10,20 @@
 often selecting responses from other models as superior to their own.
 """
 
-from typing import Dict, List, Optional
 import random
+from typing import Dict, List, Optional
+
 from swarms.structs.agent import Agent
+from swarms.structs.conversation import Conversation
 from swarms.structs.multi_agent_exec import (
-    run_agents_concurrently,
     batched_grid_agent_execution,
+    run_agents_concurrently,
 )
+from swarms.structs.swarm_id import swarm_id
 from swarms.utils.history_output_formatter import (
     HistoryOutputType,
     history_output_formatter,
 )
-from swarms.structs.conversation import Conversation
-from swarms.structs.swarm_id import swarm_id
 
 
 def get_gpt_councilor_prompt() -> str:
diff --git a/tests/structs/test_maker.py b/tests/structs/test_maker.py