mala

Multi-Agent Loop Architecture

A multi-agent system for processing beads issues in parallel using Claude, Amp, or Codex coder backends.

The name also derives from Sanskrit, where mala means "garland" or "string of beads" - fitting for a system that orchestrates beads issues in a continuous loop, like counting prayer beads.

Why Mala?

The core insight: agents degrade as context grows.

LLM agents become unreliable as their context window fills up. Early in a session, an agent follows instructions precisely, catches edge cases, and produces clean code. But as context accumulates—tool outputs, file contents, previous attempts—performance degrades.

The solution: small tasks, fresh context, automated verification.

Breaking work into atomic issues — Each issue is sized to complete within ~100k tokens
Starting each agent with cleared context — Every issue gets a fresh agent session
Running automated checks after completion — Linting, tests, type checking, and code review
Looping until done — The orchestrator continuously spawns agents for ready issues

Prerequisites

Beads

Beads is the issue tracking system that agents pull work from. See the repo for installation instructions.

Claude Code

Claude Code CLI is the default agent runtime. See the docs for installation instructions.

Amp (Optional, for `coder: amp`)

Mala can drive its per-issue implementation agent on Sourcegraph's Amp instead of Claude. Amp is opt-in via --coder amp / MALA_CODER=amp / coder: amp in mala.yaml.

When coder: amp is selected, the orchestrator runs amp --execute --stream-json under --dangerously-allow-all and relies on a bundled TypeScript safety plugin (plugins/amp/mala-safety.ts) for dangerous-command and lock-ownership enforcement. Before any issue agent is spawned, mala runs a fail-closed runtime self-test that proves the plugin actually loaded; if it doesn't, the run aborts with a clear error.

Prerequisites:

Binary install required. Install Amp via the official binary install documented at https://ampcode.com/manual. The npm package @sourcegraph/amp is not supported for coder: amp: per the Amp plugin API, plugins only load under the binary install with PLUGINS=all set and a working Bun runtime. An npm-installed Amp will fail mala's runtime plugin self-test and abort the run before any issue agent runs.
Amp CLI installed and authenticated/configured in your shell.
Bun runtime present — provided by the Amp binary install; mala does not install Bun separately.
~/.config/amp/plugins/ writable. Mala installs mala-safety to ~/.config/amp/plugins/mala-safety/ idempotently on every run.
Mala always sets PLUGINS=all for you — this is not user-managed.

Tested-against version: see plugins/amp/README.md for the pinned plugin acknowledgment header. Run amp --version to compare your local install, and uv run pytest -m e2e tests/e2e/test_amp_real_cli.py to check the real CLI stream-json contract.

Costs / agent modes: Amp routes to different models based on --amp-mode:

Mode	Model
`smart`	Claude Opus
`rush`	Claude Haiku
`deep` (default)	GPT-5 reasoning

Known limitations under coder: amp (MVP):

MALA_DISALLOWED_TOOLS is a no-op under Amp — the bundled plugin only enforces dangerous-command blocking and lock-ownership. A warning is logged once at run start when the env var is set. Tracked as a follow-up.
Cross-coder session resume is not supported (Amp thread IDs are not interchangeable with Claude session IDs).
--claude-settings-sources is logged as ignored under coder: amp; symmetrically, --amp-mode is logged as ignored under coder: claude.
No devcontainer integration: Amp install/auth is a user prerequisite, not baked into mala's DevContainer image.

Codex (for `coder: codex`)

Mala can drive its per-issue implementation agent on OpenAI's codex app-server (gpt-5.5 family) instead of Claude or Amp. Codex is opt-in via --coder codex / MALA_CODER=codex / coder: codex in mala.yaml; the default remains coder: claude.

When coder: codex is selected, mala drives codex app-server through the codex_app_server Python SDK (in-process JSON-RPC over stdio — no CLI subprocess wrapping by mala). The orchestrator runs Codex with sandbox: danger-full-access and approval_policy: never by default, and relies on a bundled PreToolUse command hook (mala-codex-pre-tool-use) plus the bundled mala-locking MCP server for dangerous-command blocking, lock-ownership enforcement, and MALA_DISALLOWED_TOOLS enforcement. Both are packaged as a Mala-shipped Codex plugin (plugins/codex/mala-safety/.codex-plugin/). Mala installs and trusts that plugin inside a per-run temporary CODEX_HOME seeded from your normal Codex auth/config, then passes that CODEX_HOME only to the codex app-server subprocess it launches. Your normal ~/.codex is not mutated, so ordinary Codex CLI sessions do not load Mala's safety hook. Before any issue agent is spawned, mala runs a fail-closed runtime self-test that proves both SessionStart and PreToolUse hook handlers are active and trusted; if either handler is missing, disabled, untrusted, or stale, the run aborts with a clear error.

Prerequisites:

Codex Python SDK. Installed with mala by default. The SDK is experimental ("expect breaking changes"); mala pins to the upstream tag in pyproject.toml.
Codex runtime binary (openai-codex-cli-bin). The SDK pulls this in as a transitive dependency; it is platform-specific (mac/linux/windows wheels) and pinned to an exact version matching the SDK release. Mala does not vendor the runtime.
Codex auth. Configure Codex auth in your local Codex config (e.g., via codex login) — see Codex docs for the current auth flow. Mala's install_prerequisites() fails closed with CodexNotInstalledError when the SDK, runtime, or auth is missing.
Your normal $CODEX_HOME must be readable enough for mala to detect Codex auth (auth.json, keyring config, or auth env vars). Plugin files and hook trust entries are written only to mala's temporary CODEX_HOME for the run.

Defaults under coder: codex:

Option	Default	Why
`model`	`gpt-5.5`	Latest gpt-5.5 family release
`effort`	`medium`	Shared coder reasoning-effort default for Codex
`approval_policy`	`never`	Unattended-run posture; bundled hook is the gate
`sandbox`	`danger-full-access`	Same posture as Amp's `--dangerously-allow-all`

Known limitations under coder: codex (MVP):

No cross-coder session resume. Codex thr_* thread IDs are not interchangeable with Claude session IDs or Amp T-* thread IDs.
Claude settings and Amp mode settings are only used by their matching coders; other combinations may be ignored.
The bundled mala-locking MCP server is mandatory and cannot be replaced via coder_options.codex.mcp_servers; user-supplied servers are merged with the bundled one (the bundled key wins on conflict).
ReasoningThreadItem content (Codex's internal reasoning) is stripped from AgentEvents in MVP (parity with Amp's stripped-thinking stance).
No devcontainer baking: Codex install/auth is a user prerequisite. The existing DevContainer mounts ~/.codex so an authed local install carries through.

Cerberus Review-Gate (Optional)

Cerberus provides automated code review when reviewer_type: cerberus is enabled in mala.yaml. If you use reviewer_type: agent_sdk, no Cerberus install is required.

To enable Cerberus reviews, install the Cerberus v2 binary and make sure cerberus is available on $PATH.

Installation

uv tool install mala-agent

Usage

mala init                                 # Interactively create mala.yaml
mala init --yes --preset python-uv         # Non-interactive init with defaults
mala run /path/to/repo                    # Run the parallel worker
mala run --max-agents 5 /path/to/repo     # Limit concurrent agents
mala run --scope epic:proj-abc /path/to/repo    # Process children of epic
mala run --scope ids:issue-1,issue-2 --order input /path/to/repo  # Specific issues in order
mala run --resume /path/to/repo            # Include in_progress issues and resume sessions
mala run --review-wip /path/to/repo        # Review in_progress issues first after interrupted reviews
mala run --strict --resume /path/to/repo   # Fail if a resumed issue has no session
mala run --watch /path/to/repo             # Keep polling for new issues
mala run --config mala.codex.yaml /path/to/repo  # Use an alternate project config
mala run --coder amp /path/to/repo         # Use Amp instead of Claude as the per-issue coder
mala run --coder amp --amp-mode rush /path/to/repo  # Amp in rush mode (Haiku)
mala run --coder codex /path/to/repo       # Use Codex (gpt-5.5) as the per-issue coder
mala run --coder codex --model gpt-5.5 --effort high /path/to/repo
mala status                               # Check locks, config, logs
mala status --all                          # Show running instances across directories
mala logs list                            # List recent runs
mala logs sessions --issue ISSUE-123      # Find sessions for an issue
mala logs show <run_id_prefix>            # Show run metadata
mala clean                                # Clean up locks
mala clean --force                         # Clean even if mala is running
mala epic-verify proj-abc /path/to/repo   # Verify and close an epic
mala epic-verify --config mala.strict.yaml proj-abc /path/to/repo

By default, Mala loads <repo>/mala.yaml. --config PATH selects any alternate config filename for run and epic-verify; relative paths are resolved from your current working directory.

How It Works

Orchestrator queries bd ready --json for available issues
Filtering: Epics are skipped - only tasks/bugs are processed
Spawning: Up to N parallel agent tasks (unlimited by default)
Per-session pipeline: Agent implements → quality gate (commit + evidence) → session_end trigger (optional) → external review → close
Trigger validation: periodic, epic_completion, and run_end triggers run configured commands with optional fixer remediation
Epic verification: When all children close, verifies acceptance criteria

Agent Workflow

Understand: Read issue details (injected into prompt)
Lock files: Acquire filesystem locks before editing
Implement: Write code following project conventions
Quality checks: Run the required validations for evidence (see evidence_check in mala.yaml)
Commit: Stage and commit changes locally
Session-end validation: Orchestrator may run additional commands after gate passes
Cleanup: Release locks (orchestrator closes issue after gate + review)

Resolution Markers

Agents can signal non-implementation resolutions:

Marker	Meaning
`ISSUE_NO_CHANGE`	Issue requires no code changes
`ISSUE_OBSOLETE`	Issue is no longer relevant
`ISSUE_ALREADY_COMPLETE`	Work was already done in a prior commit
`ISSUE_DOCS_ONLY`	Documentation-only changes; skip validation evidence

Epics and Parent-Child Issues

Epics are skipped: Issues with issue_type: "epic" are never assigned to agents
Parent-child is non-blocking: Use bd dep add <child> <epic> --type parent-child
Verification before close: When all children complete, the epic is verified against its acceptance criteria

Coordination

Layer	Tool	Purpose
Issue-level	Beads (`bd`)	Prevents duplicate claims via status updates
File-level	Filesystem locks	Prevents edit conflicts between agents

Lock Enforcement

File locks are enforced at two levels:

MCP locking tools: Agents acquire locks before editing files via lock_acquire/lock_release MCP tools
PreToolUse hook: Blocks file-write tool calls unless the agent holds the lock

Git Safety

Dangerous commands are blocked to avoid destructive or conflicting actions:

Destructive git operations: git reset --hard|--soft|--mixed, git reset HEAD, git checkout -f|--force|--, git restore, git clean -f|-fd, git rebase, git commit --amend, git branch -D, git merge --abort, git rebase --abort, git cherry-pick --abort, git worktree remove, git submodule deinit -f, git stash
Dangerous shell patterns: rm -rf /, rm -rf ~, fork bombs, mkfs.*, raw disk writes, curl|wget | bash/sh

The hook errors include safe alternatives where possible.

Creating Issues

Mala's effectiveness depends on well-structured beads issues. Each issue must be self-contained and unambiguous.

Principle	Description
Atomic	One issue = one clear outcome
Sized for agents	Completable within ~100k tokens
Minimal file overlap	Issues touching same files cannot run in parallel
Actionable	Clear acceptance criteria and test plan
Grounded	Include exact file/line pointers when available

See commands/bd-breakdown.md for the full issue creation workflow.

Documentation

Architecture — Layered architecture, module responsibilities, key flows
CLI Reference — CLI options, environment variables, integrations
Project Configuration — mala.yaml schema, alternate config files, presets, coverage settings
Validation — Evidence check, session_end, review gates, trigger validation
Validation Triggers — Trigger-based validation and code review
Development — Type checking, testing, package structure
plans/ — Historical design documents (not actively maintained)

Running in a Sandbox

Mala spawns AI agents with permissive tool access. Running in a container is strongly recommended to limit blast radius if an agent misbehaves.

DevContainer (Recommended)

This repo includes a DevContainer configuration for developing mala:

devcontainer up --workspace-folder .
devcontainer exec --workspace-folder . mala run /workspaces/mala

The DevContainer mounts:

/workspaces/mala — the mala source code
/.claude — Claude Code auth and plugins (including Cerberus)
/.codex — Codex CLI config
/.gemini — Gemini CLI config
/.config/mala — mala logs and run state

Pre-installed tools: Claude Code, Codex CLI, Gemini CLI, bd (Beads), uv, Python 3.12, Node.js

What DevContainers Protect Against

Risk	Protected?
Modifying files outside mounted dirs	✅ Yes
Accessing host processes	✅ Yes
Persisting malware on host	✅ Yes
Reading mounted sensitive files	❌ No
Network exfiltration	❌ No (full network access)

DevContainers provide process isolation (prevent accidents) not security isolation (prevent malice).

Name		Name	Last commit message	Last commit date
Latest commit History 2,531 Commits
.beads		.beads
.devcontainer		.devcontainer
.github/workflows		.github/workflows
docs		docs
plans		plans
plugins		plugins
src		src
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE.txt		LICENSE.txt
README.md		README.md
TODO.md		TODO.md
mala.yaml		mala.yaml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mala

Why Mala?

Prerequisites

Beads

Claude Code

Amp (Optional, for `coder: amp`)

Codex (for `coder: codex`)

Cerberus Review-Gate (Optional)

Installation

Usage

How It Works

Agent Workflow

Resolution Markers

Epics and Parent-Child Issues

Coordination

Lock Enforcement

Git Safety

Creating Issues

Documentation

Running in a Sandbox

DevContainer (Recommended)

What DevContainers Protect Against

About

Uh oh!

Releases 79

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mala

Why Mala?

Prerequisites

Beads

Claude Code

Amp (Optional, for coder: amp)

Codex (for coder: codex)

Cerberus Review-Gate (Optional)

Installation

Usage

How It Works

Agent Workflow

Resolution Markers

Epics and Parent-Child Issues

Coordination

Lock Enforcement

Git Safety

Creating Issues

Documentation

Running in a Sandbox

DevContainer (Recommended)

What DevContainers Protect Against

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 79

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Amp (Optional, for `coder: amp`)

Codex (for `coder: codex`)

Packages