You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
MetaGPT generates multi-agent systems that execute code, modify files, and interact with APIs. But there's no governance layer between "MetaGPT designs an agent" and "that agent executes in a user's repository."
This matters because MetaGPT is a library — users embed it in their workflows. When a MetaGPT-generated agent runs, it has the same permissions as the process that spawned it.
There's no step where the user or repo can say: "Generated agents may write to output/ but not to src/", or "Agents need approval before sending API requests."
MetaGPT is the only multi-agent framework where agents design and spawn other agents. This creates a transitive trust problem: if MetaGPT generates an agent that itself generates code, the user needs governance at every layer.
Without a policy boundary, MetaGPT-generated agents inherit the full permissions of the process — creating the same eval() vulnerability class that has surfaced in other frameworks (e.g., CVE-2026-2275).
Next steps
Discuss whether capability declaration should be part of the core agent model or a plugin
Context
MetaGPT generates multi-agent systems that execute code, modify files, and interact with APIs. But there's no governance layer between "MetaGPT designs an agent" and "that agent executes in a user's repository."
This matters because MetaGPT is a library — users embed it in their workflows. When a MetaGPT-generated agent runs, it has the same permissions as the process that spawned it.
The gap
Currently, the model is:
There's no step where the user or repo can say: "Generated agents may write to
output/but not tosrc/", or "Agents need approval before sending API requests."Proposal: Agent capability declaration + policy boundary
Two complementary mechanisms:
1. Agent-generated capability manifest
When MetaGPT generates an agent, it should produce a capability declaration alongside the agent definition:
2. Repo-level policy file (META.yml or AGENTOWNERS.yml)
A governance file at the repo root that gates what ANY agent (MetaGPT-generated or otherwise) may do:
Why this matters for MetaGPT specifically
MetaGPT is the only multi-agent framework where agents design and spawn other agents. This creates a transitive trust problem: if MetaGPT generates an agent that itself generates code, the user needs governance at every layer.
Without a policy boundary, MetaGPT-generated agents inherit the full permissions of the process — creating the same
eval()vulnerability class that has surfaced in other frameworks (e.g., CVE-2026-2275).Next steps
I've been working on deterministic policy evaluation for AI agents and happy to discuss patterns that fit MetaGPT's architecture.