Skip to content

charlieyou/mala

Repository files navigation

mala

PyPI version Ask DeepWiki

Multi-Agent Loop Architecture

A multi-agent system for processing beads issues in parallel using Claude, Amp, or Codex coder backends.

The name also derives from Sanskrit, where mala means "garland" or "string of beads" - fitting for a system that orchestrates beads issues in a continuous loop, like counting prayer beads.

Why Mala?

The core insight: agents degrade as context grows.

LLM agents become unreliable as their context window fills up. Early in a session, an agent follows instructions precisely, catches edge cases, and produces clean code. But as context accumulates—tool outputs, file contents, previous attempts—performance degrades.

The solution: small tasks, fresh context, automated verification.

  1. Breaking work into atomic issues — Each issue is sized to complete within ~100k tokens
  2. Starting each agent with cleared context — Every issue gets a fresh agent session
  3. Running automated checks after completion — Linting, tests, type checking, and code review
  4. Looping until done — The orchestrator continuously spawns agents for ready issues

Prerequisites

Beads

Beads is the issue tracking system that agents pull work from. See the repo for installation instructions.

Claude Code

Claude Code CLI is the default agent runtime. See the docs for installation instructions.

Amp (Optional, for coder: amp)

Mala can drive its per-issue implementation agent on Sourcegraph's Amp instead of Claude. Amp is opt-in via --coder amp / MALA_CODER=amp / coder: amp in mala.yaml.

When coder: amp is selected, the orchestrator runs amp --execute --stream-json under --dangerously-allow-all and relies on a bundled TypeScript safety plugin (plugins/amp/mala-safety.ts) for dangerous-command and lock-ownership enforcement. Before any issue agent is spawned, mala runs a fail-closed runtime self-test that proves the plugin actually loaded; if it doesn't, the run aborts with a clear error.

Prerequisites:

  • Binary install required. Install Amp via the official binary install documented at https://ampcode.com/manual. The npm package @sourcegraph/amp is not supported for coder: amp: per the Amp plugin API, plugins only load under the binary install with PLUGINS=all set and a working Bun runtime. An npm-installed Amp will fail mala's runtime plugin self-test and abort the run before any issue agent runs.
  • Amp CLI installed and authenticated/configured in your shell.
  • Bun runtime present — provided by the Amp binary install; mala does not install Bun separately.
  • ~/.config/amp/plugins/ writable. Mala installs mala-safety to ~/.config/amp/plugins/mala-safety/ idempotently on every run.
  • Mala always sets PLUGINS=all for you — this is not user-managed.

Tested-against version: see plugins/amp/README.md for the pinned plugin acknowledgment header. Run amp --version to compare your local install, and uv run pytest -m e2e tests/e2e/test_amp_real_cli.py to check the real CLI stream-json contract.

Costs / agent modes: Amp routes to different models based on --amp-mode:

Mode Model
smart Claude Opus
rush Claude Haiku
deep (default) GPT-5 reasoning

Known limitations under coder: amp (MVP):

  • MALA_DISALLOWED_TOOLS is a no-op under Amp — the bundled plugin only enforces dangerous-command blocking and lock-ownership. A warning is logged once at run start when the env var is set. Tracked as a follow-up.
  • Cross-coder session resume is not supported (Amp thread IDs are not interchangeable with Claude session IDs).
  • --claude-settings-sources is logged as ignored under coder: amp; symmetrically, --amp-mode is logged as ignored under coder: claude.
  • No devcontainer integration: Amp install/auth is a user prerequisite, not baked into mala's DevContainer image.

Codex (for coder: codex)

Mala can drive its per-issue implementation agent on OpenAI's codex app-server (gpt-5.5 family) instead of Claude or Amp. Codex is opt-in via --coder codex / MALA_CODER=codex / coder: codex in mala.yaml; the default remains coder: claude.

When coder: codex is selected, mala drives codex app-server through the codex_app_server Python SDK (in-process JSON-RPC over stdio — no CLI subprocess wrapping by mala). The orchestrator runs Codex with sandbox: danger-full-access and approval_policy: never by default, and relies on a bundled PreToolUse command hook (mala-codex-pre-tool-use) plus the bundled mala-locking MCP server for dangerous-command blocking, lock-ownership enforcement, and MALA_DISALLOWED_TOOLS enforcement. Both are packaged as a Mala-shipped Codex plugin (plugins/codex/mala-safety/.codex-plugin/). Mala installs and trusts that plugin inside a per-run temporary CODEX_HOME seeded from your normal Codex auth/config, then passes that CODEX_HOME only to the codex app-server subprocess it launches. Your normal ~/.codex is not mutated, so ordinary Codex CLI sessions do not load Mala's safety hook. Before any issue agent is spawned, mala runs a fail-closed runtime self-test that proves both SessionStart and PreToolUse hook handlers are active and trusted; if either handler is missing, disabled, untrusted, or stale, the run aborts with a clear error.

Prerequisites:

  • Codex Python SDK. Installed with mala by default. The SDK is experimental ("expect breaking changes"); mala pins to the upstream tag in pyproject.toml.
  • Codex runtime binary (openai-codex-cli-bin). The SDK pulls this in as a transitive dependency; it is platform-specific (mac/linux/windows wheels) and pinned to an exact version matching the SDK release. Mala does not vendor the runtime.
  • Codex auth. Configure Codex auth in your local Codex config (e.g., via codex login) — see Codex docs for the current auth flow. Mala's install_prerequisites() fails closed with CodexNotInstalledError when the SDK, runtime, or auth is missing.
  • Your normal $CODEX_HOME must be readable enough for mala to detect Codex auth (auth.json, keyring config, or auth env vars). Plugin files and hook trust entries are written only to mala's temporary CODEX_HOME for the run.

Defaults under coder: codex:

Option Default Why
model gpt-5.5 Latest gpt-5.5 family release
effort medium Shared coder reasoning-effort default for Codex
approval_policy never Unattended-run posture; bundled hook is the gate
sandbox danger-full-access Same posture as Amp's --dangerously-allow-all

Known limitations under coder: codex (MVP):

  • No cross-coder session resume. Codex thr_* thread IDs are not interchangeable with Claude session IDs or Amp T-* thread IDs.
  • Claude settings and Amp mode settings are only used by their matching coders; other combinations may be ignored.
  • The bundled mala-locking MCP server is mandatory and cannot be replaced via coder_options.codex.mcp_servers; user-supplied servers are merged with the bundled one (the bundled key wins on conflict).
  • ReasoningThreadItem content (Codex's internal reasoning) is stripped from AgentEvents in MVP (parity with Amp's stripped-thinking stance).
  • No devcontainer baking: Codex install/auth is a user prerequisite. The existing DevContainer mounts ~/.codex so an authed local install carries through.

Cerberus Review-Gate (Optional)

Cerberus provides automated code review when reviewer_type: cerberus is enabled in mala.yaml. If you use reviewer_type: agent_sdk, no Cerberus install is required.

To enable Cerberus reviews, install the Cerberus v2 binary and make sure cerberus is available on $PATH.

Installation

uv tool install mala-agent

Usage

mala init                                 # Interactively create mala.yaml
mala init --yes --preset python-uv         # Non-interactive init with defaults
mala run /path/to/repo                    # Run the parallel worker
mala run --max-agents 5 /path/to/repo     # Limit concurrent agents
mala run --scope epic:proj-abc /path/to/repo    # Process children of epic
mala run --scope ids:issue-1,issue-2 --order input /path/to/repo  # Specific issues in order
mala run --resume /path/to/repo            # Include in_progress issues and resume sessions
mala run --review-wip /path/to/repo        # Review in_progress issues first after interrupted reviews
mala run --strict --resume /path/to/repo   # Fail if a resumed issue has no session
mala run --watch /path/to/repo             # Keep polling for new issues
mala run --config mala.codex.yaml /path/to/repo  # Use an alternate project config
mala run --coder amp /path/to/repo         # Use Amp instead of Claude as the per-issue coder
mala run --coder amp --amp-mode rush /path/to/repo  # Amp in rush mode (Haiku)
mala run --coder codex /path/to/repo       # Use Codex (gpt-5.5) as the per-issue coder
mala run --coder codex --model gpt-5.5 --effort high /path/to/repo
mala status                               # Check locks, config, logs
mala status --all                          # Show running instances across directories
mala logs list                            # List recent runs
mala logs sessions --issue ISSUE-123      # Find sessions for an issue
mala logs show <run_id_prefix>            # Show run metadata
mala clean                                # Clean up locks
mala clean --force                         # Clean even if mala is running
mala epic-verify proj-abc /path/to/repo   # Verify and close an epic
mala epic-verify --config mala.strict.yaml proj-abc /path/to/repo

By default, Mala loads <repo>/mala.yaml. --config PATH selects any alternate config filename for run and epic-verify; relative paths are resolved from your current working directory.

How It Works

  1. Orchestrator queries bd ready --json for available issues
  2. Filtering: Epics are skipped - only tasks/bugs are processed
  3. Spawning: Up to N parallel agent tasks (unlimited by default)
  4. Per-session pipeline: Agent implements → quality gate (commit + evidence) → session_end trigger (optional) → external review → close
  5. Trigger validation: periodic, epic_completion, and run_end triggers run configured commands with optional fixer remediation
  6. Epic verification: When all children close, verifies acceptance criteria

Agent Workflow

  1. Understand: Read issue details (injected into prompt)
  2. Lock files: Acquire filesystem locks before editing
  3. Implement: Write code following project conventions
  4. Quality checks: Run the required validations for evidence (see evidence_check in mala.yaml)
  5. Commit: Stage and commit changes locally
  6. Session-end validation: Orchestrator may run additional commands after gate passes
  7. Cleanup: Release locks (orchestrator closes issue after gate + review)

Resolution Markers

Agents can signal non-implementation resolutions:

Marker Meaning
ISSUE_NO_CHANGE Issue requires no code changes
ISSUE_OBSOLETE Issue is no longer relevant
ISSUE_ALREADY_COMPLETE Work was already done in a prior commit
ISSUE_DOCS_ONLY Documentation-only changes; skip validation evidence

Epics and Parent-Child Issues

  • Epics are skipped: Issues with issue_type: "epic" are never assigned to agents
  • Parent-child is non-blocking: Use bd dep add <child> <epic> --type parent-child
  • Verification before close: When all children complete, the epic is verified against its acceptance criteria

Coordination

Layer Tool Purpose
Issue-level Beads (bd) Prevents duplicate claims via status updates
File-level Filesystem locks Prevents edit conflicts between agents

Lock Enforcement

File locks are enforced at two levels:

  1. MCP locking tools: Agents acquire locks before editing files via lock_acquire/lock_release MCP tools
  2. PreToolUse hook: Blocks file-write tool calls unless the agent holds the lock

Git Safety

Dangerous commands are blocked to avoid destructive or conflicting actions:

  • Destructive git operations: git reset --hard|--soft|--mixed, git reset HEAD, git checkout -f|--force|--, git restore, git clean -f|-fd, git rebase, git commit --amend, git branch -D, git merge --abort, git rebase --abort, git cherry-pick --abort, git worktree remove, git submodule deinit -f, git stash
  • Dangerous shell patterns: rm -rf /, rm -rf ~, fork bombs, mkfs.*, raw disk writes, curl|wget | bash/sh

The hook errors include safe alternatives where possible.

Creating Issues

Mala's effectiveness depends on well-structured beads issues. Each issue must be self-contained and unambiguous.

Principle Description
Atomic One issue = one clear outcome
Sized for agents Completable within ~100k tokens
Minimal file overlap Issues touching same files cannot run in parallel
Actionable Clear acceptance criteria and test plan
Grounded Include exact file/line pointers when available

See commands/bd-breakdown.md for the full issue creation workflow.

Documentation

  • Architecture — Layered architecture, module responsibilities, key flows
  • CLI Reference — CLI options, environment variables, integrations
  • Project Configuration — mala.yaml schema, alternate config files, presets, coverage settings
  • Validation — Evidence check, session_end, review gates, trigger validation
  • Validation Triggers — Trigger-based validation and code review
  • Development — Type checking, testing, package structure
  • plans/ — Historical design documents (not actively maintained)

Running in a Sandbox

Mala spawns AI agents with permissive tool access. Running in a container is strongly recommended to limit blast radius if an agent misbehaves.

DevContainer (Recommended)

This repo includes a DevContainer configuration for developing mala:

devcontainer up --workspace-folder .
devcontainer exec --workspace-folder . mala run /workspaces/mala

The DevContainer mounts:

  • /workspaces/mala — the mala source code
  • /.claude — Claude Code auth and plugins (including Cerberus)
  • /.codex — Codex CLI config
  • /.gemini — Gemini CLI config
  • /.config/mala — mala logs and run state

Pre-installed tools: Claude Code, Codex CLI, Gemini CLI, bd (Beads), uv, Python 3.12, Node.js

What DevContainers Protect Against

Risk Protected?
Modifying files outside mounted dirs ✅ Yes
Accessing host processes ✅ Yes
Persisting malware on host ✅ Yes
Reading mounted sensitive files ❌ No
Network exfiltration ❌ No (full network access)

DevContainers provide process isolation (prevent accidents) not security isolation (prevent malice).

About

Multi-agent orchestrator for parallel issue processing using Claude Agent SDK

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages