OpenAI has released the “Next Evolution” of its Agents SDK, fundamentally transforming how developers build and deploy autonomous AI agents. This update moves beyond simple API wrappers to provide a standardized infrastructure for “long-horizon” tasksโcomplex, multi-step operations that require a computer to think, plan, and execute over time.
The update is a direct response to Anthropic’s Managed Agents launch last week and is currently available in Python, with TypeScript support expected shortly.

1. The Core Innovation: Native Sandboxing
The most significant addition is native sandbox execution. Previously, developers had to build their own secure “containers” to prevent an agent from accidentally deleting files or running malicious code on a host machine.
- Isolated Workspaces: Agents now run in a controlled compute environment where they can safely install dependencies, read/write files, and execute terminal commands.
- Separation of Concerns: The “Harness” (the model’s reasoning logic) is kept separate from the “Compute” (the sandbox). This ensures that even if a model generates “hallucinated” code, it is trapped in a sterile environment.
- Unix-Style Permissions: Developers can now define “Users” within the sandbox, giving an agent read-only access to a data folder while allowing it full write access to an output folder.
2. The “Model-Native” Harness
OpenAI has introduced a new orchestration harness that aligns execution with how frontier models (like GPT-5.4) perform best.
- Resume Bookkeeping: If an agent is performing a task that takes hours (like refactoring a large codebase) and the connection drops, the SDK now includes built-in snapshotting and rehydration. The agent can “wake up” and continue exactly where it left off.
- Standardized Primitives: The SDK now includes built-in support for common agentic patterns like handoffs (transferring a task from one specialized agent to another) and approvals (pausing for human-in-the-loop review).
- Configurable Memory: Unlike basic context windows, this memory allows agents to keep track of their progress and state across complex, multi-tool workflows without “forgetting” the original goal.
3. Key Technical Features
| Feature | Description | Benefit |
| Manifests | A portable file that defines the agent’s workspace. | Move workflows easily between local and cloud environments. |
| Codex-Style Tools | Enhanced filesystem and shell access tools. | Allows agents to edit code and run scripts like a human dev. |
| External Mounts | Native support for S3, Azure Blob, and Google Cloud Storage. | Agents can “mount” your company data directly into their sandbox. |
| Standardized Tracing | Integrated logs for every decision and tool call. | Essential for debugging why an agent “went off the rails.” |
4. Why This Matters for Developers
As someone tracking the 27 million developer surge in India and the rise of AI automation at TCS, this SDK update removes the “infrastructure tax” of building agents:
- Production-Ready: By providing the execution layer “out of the box,” OpenAI is making it much easier for startups to move from a cool demo to a reliable production agent.
- Safety First: The sandboxing features address the #1 concern of enterprise CTOs: “What happens if the AI runs
rm -rf /on my server?” - Agentic Efficiency: The new harness reduces “looping” and hallucinations, ensuring that agents stay closer to their “natural operating pattern” for higher reliability.
5. Getting Started
If you are currently building with the older version, you can upgrade via pip:pip install "openai-agents>=0.14.0"
“This launch is about making the SDK compatible with all the major sandbox providers,” noted Karan Sharma from the OpenAI Product team. “If you do a task more than twice, it shouldn’t just be a promptโit should be an agent.”


