DAY 064 / Security / Sandboxing

Sandboxing Code Execution: The RCE Defense

Security

Sandboxing

Code Execution

Docker

Firecracker

Zero Trust

Abstract

When an autonomous agent transitions from generating text to executing code, the operational paradigm shifts from probabilistic reasoning to deterministic system mutation. Allowing an LLM to dynamically write and execute Python scripts to "analyze data" without strict hardware-level isolation is equivalent to offering Remote Code Execution (RCE) as a feature. A prompt injection attack can trivially pivot a data-analysis agent into reading your .env files, exfiltrating credentials, or deleting host directories. This artifact establishes the non-negotiable architecture for ephemeral, network-isolated code sandboxing, ensuring that even a completely compromised agent remains physically contained.

1. Why This Topic Matters

The most severe architectural sin in AI engineering is running LLM-generated code directly on your host infrastructure using eval(), exec(), or an unprotected local subprocess.

LLMs are non-deterministic threat actors. Even without malicious user intent (prompt injection), a hallucinating model might decide the most efficient way to clear a dataframe is to run os.system('rm -rf /'). If your agent has access to a REPL (Read-Eval-Print Loop) to verify its own math or analyze a CSV, you have introduced a massive attack surface. We do not rely on system prompts like "Never run malicious code" to secure infrastructure. We rely on physical and cryptographic isolation.

2. Core Concepts & Mental Models

The fundamental mental model for code-executing agents is Zero Trust Ephemerality.

Assume Hostility: You must assume every line of code generated by the LLM is actively attempting to breach your system.
Ephemerality: The execution environment should not exist before the code needs to run, and it must be destroyed milliseconds after the execution completes.
The Airgap: Unless explicitly required (and proxy-filtered) for a specific task, the sandbox must have no route to the public internet and zero access to the host network (e.g., your internal databases or AWS metadata endpoints).

3. Theoretical Foundations

Sandboxing relies on boundary enforcement at the operating system or hypervisor level:

Namespaces and Cgroups (Docker): Isolates the process view of the filesystem, network, and process tree, while limiting CPU and memory usage. However, the container still shares the host's OS kernel.
Hardware Virtualization (Firecracker MicroVMs): Provides a lightweight, dedicated kernel for each execution environment. If an exploit compromises the kernel, it only compromises the VM's kernel, not the host machine's. This is the technology powering AWS Lambda.

4. Production-Grade Implementation

Resolving the Trade-off: Isolation Startup Time vs. Security Guarantees Heavy virtualization (like traditional EC2 instances) provides great security but takes minutes to boot, making synchronous agentic loops impossibly slow. Docker containers start in milliseconds but carry kernel-sharing risks (container escape vulnerabilities).

The Resolution: For single-tenant applications executing trusted internal data, a heavily locked-down Docker container (dropping all capabilities, read-only root filesystem, no network) is acceptable to maintain sub-second latency. However, for multi-tenant, user-facing agentic platforms, Docker is insufficient. You must use MicroVMs (Firecracker) or managed execution APIs. Two options dominate production deployments today:

Firecracker MicroVMs (self-hosted): Boot in ~150ms with hardware-level kernel isolation. This is the technology powering AWS Lambda. Run with KVM on Linux hosts.
E2B.dev (managed cloud sandboxes): A purpose-built cloud sandbox platform designed specifically for AI agent code execution. It provides an SDK that lets your agent spin up an isolated, networked (or air-gapped) Python/Node environment, upload files, execute code, and retrieve output—all over a simple API. E2B handles the Firecracker VM lifecycle for you and provides per-sandbox resource limits and audit logs. This is the fastest path to production-safe code execution for most teams.

Both options resolve the latency trade-off while providing hardware-level security guarantees.

Furthermore, Network Isolation is non-negotiable. The sandbox must be instantiated with loopback access only. If the agent needs to download a dataset, the host downloads it, sanitizes it, and mounts it to the sandbox. The sandbox itself cannot curl the outside world.

5. Hands-On Project / Exercise

Constraint: You will orchestrate a strictly sandboxed Python environment using Docker. The agent will submit code to:

Generate a matplotlib graph and save it to a specific mounted output directory (Must Succeed).
Attempt to run curl http://google.com via os.system (Must Fail).
Attempt to read /etc/passwd or the host's directory (Must Fail).

This demonstrates responsibility through engineered boundaries, proving that even hostile code executes safely.

(See Section 8 for the implementation).

6. Ethical, Security & Safety Considerations

Security: The Uselessness of Static Analysis A common misconception is that you can secure LLM code by running a regex or AST (Abstract Syntax Tree) parser over the output to block import os or subprocess.

This is fundamentally flawed. Python is a highly dynamic language. An LLM can trivially bypass static analysis by using __import__('o'+'s'), exploiting built-in evaluation functions, or using memory manipulation. Never attempt to sanitize execution by filtering the text. The only valid security boundary is the operating system itself.

7. Business & Strategic Implications

Deploying vulnerable code execution environments voids enterprise compliance certifications (SOC2, HIPAA, ISO 27001). A breach originating from an AI agent's sandbox escape is not just a technical failure; it is an existential business liability.

By offloading execution to specialized MicroVM infrastructure (either internally managed Firecracker clusters or audited third-party vendors like E2B), you shift the liability of the kernel boundary and maintain a defensible security posture for enterprise clients.

8. Code Examples / Pseudocode

This example demonstrates how to wrap a standard Docker container into a hardened, ephemeral sandbox using Python's subprocess module. For teams using E2B, the equivalent is shown in the commented block below.

import subprocess
import tempfile
import os

def execute_sandboxed_code(llm_generated_code: str) -> str:
    """
    Executes untrusted LLM code in a heavily restricted, ephemeral Docker container.
    """
    # 1. Create a temporary directory for strictly controlled I/O
    with tempfile.TemporaryDirectory() as temp_dir:
        code_path = os.path.join(temp_dir, "script.py")
        output_path = os.path.join(temp_dir, "output")
        os.makedirs(output_path)

        # Write the untrusted code to the temp file
        with open(code_path, "w") as f:
            f.write(llm_generated_code)

        # 2. Define the hardened Docker run command
        # Security constraints applied:
        # --rm: Ephemeral (destroyed immediately after run)
        # --network none: Absolute airgap. No internet, no host network.
        # --read-only: Cannot modify the container's own filesystem.
        # --user nobody: Runs as lowest privileged user.
        # --cap-drop ALL: Removes all Linux kernel capabilities (prevents escapes).
        # --pids-limit 10: Prevents fork bombs.
        # -m 128m: Strict memory limit.
        # -v ...: Only mounts the required scripts/output folders.
        docker_cmd = [
            "docker", "run", "--rm",
            "--network", "none",
            "--read-only",
            "--user", "nobody",
            "--cap-drop", "ALL",
            "--pids-limit", "10",
            "-m", "128m",
            "-v", f"{code_path}:/app/script.py:ro", # Read-only mount for code
            "-v", f"{output_path}:/app/output:rw",  # Read-write mount for artifacts
            "-w", "/app",
            "python:3.11-slim",
            "timeout", "5", "python", "script.py"   # Hard timeout of 5 seconds
        ]

        try:
            # 3. Execute with a host-level timeout as a secondary fallback
            result = subprocess.run(
                docker_cmd,
                capture_output=True,
                text=True,
                timeout=10
            )

            if result.returncode == 0:
                return f"Success:\n{result.stdout}\nArtifacts written to: {os.listdir(output_path)}"
            else:
                # Return the safe stack trace to the LLM for self-correction
                return f"Execution Failed:\n{result.stderr}"

        except subprocess.TimeoutExpired:
            return "Execution Failed: Process timed out (possible infinite loop)."
        except Exception as e:
            return f"System Error: {str(e)}"

# --- Test Cases ---

# 1. Safe execution (Plotting math)
safe_code = """
import math
with open('/app/output/result.txt', 'w') as f:
    f.write(str(math.pi * 2))
print("Math calculated successfully.")
"""
# print(execute_sandboxed_code(safe_code))
# -> Success: Math calculated successfully.

# 2. Hostile Execution (Network Access)
hostile_network = """
import urllib.request
try:
    urllib.request.urlopen("http://google.com")
except Exception as e:
    print(f"Network error: {e}")
"""
# print(execute_sandboxed_code(hostile_network))
# -> Fails safely: Name or service not known (Network none enforced)

# 3. Hostile Execution (Filesystem / Fork Bomb)
hostile_fs = """
import os
os.system('cat /etc/shadow')
while True:
    os.fork()
"""
# print(execute_sandboxed_code(hostile_fs))
# -> Fails safely due to pids-limit and user privileges.

```python
# --- E2B.dev alternative (managed cloud sandbox) ---
# from e2b_code_interpreter import Sandbox
#
# def execute_sandboxed_code_e2b(llm_generated_code: str) -> str:
#     with Sandbox() as sandbox:  # Ephemeral: auto-destroyed on context exit
#         execution = sandbox.run_code(llm_generated_code)
#         if execution.error:
#             return f"Execution Failed:\n{execution.error.traceback}"
#         return f"Success:\n{''.join([r.text for r in execution.results])}"
#
# E2B handles the Firecracker VM lifecycle, resource limits, and network isolation.
# Network access can be enabled per-sandbox for specific tasks.

9. Common Pitfalls & Misconceptions

Misconception: "I'm using ast.literal_eval, so I'm safe." ast.literal_eval only parses data structures (dicts, lists). If your agent needs to run loops, data transformations, or logic, you are executing raw Python, and AST parsing cannot protect you.
Pitfall: Mounting the Docker daemon socket (/var/run/docker.sock) into the container or host. This allows the sandbox to spin up sibling containers with root privileges, entirely defeating the sandbox.
Pitfall: Relying solely on the LLM provider's "Code Interpreter." While useful for consumer chats, relying on an opaque, external code execution environment for enterprise data means you are shipping sensitive data outside your VPC. You must own your execution boundaries.

10. Prerequisites & Next Steps

Prerequisites: Strong understanding of Linux namespaces, capabilities, and Docker networking.
Next Steps: Integrate the hardened Docker execution function as a deterministic tool within the ReAct loop we built in Day 61, capturing the stdout/stderr as the Observation.
Day 65: Human-in-the-Loop: Engineering the Authorization Gate.

11. Further Reading & Resources

Firecracker MicroVMs: AWS Open Source whitepaper on lightweight virtualization for serverless computing.
E2B.dev: Documentation on the e2b-code-interpreter SDK—the purpose-built, AI-native managed sandbox used in production by agent frameworks. Provides per-sandbox isolation, streaming output, and a clean Python/TypeScript API.
Docker Security Best Practices: Official documentation on dropping capabilities and preventing privilege escalation.