Following an intense wave of community backlash regarding severe restrictions introduced at Google I/O 2026, Google’s Antigravity team has executed a staggering, back-to-back rollout of emergency corrective measures, ultimately tripling user quotas twice over to hand developers a massive 9x permanent increase in baseline rate limits.
The dramatic turnabout, announced directly by core Antigravity team member Varun Mohan (former founder of Windsurf), effectively undoes a highly criticized “compute-used” token nerf that had previously rendered the company’s flagship autonomous agent platform virtually unusable for heavy-duty engineering workflows.
1. The Timeline of the Developer Backlash
The rapid-fire policy pivot follows a highly volatile 72-hour window that left paid subscribers threatening a mass exodus from Google’s enterprise development suite.
The ‘Compute-Used’ Nerf at I/O
Tuesday, May 19, 2026
Google announces a migration away from generous daily prompt counts to a rigid token-and-compute consumption model. This forces all downstream platforms, including the newly rebranded Antigravity CLI and SDK ecosystems, onto strict five-hour and weekly caps.
The Quota Outrage
Wednesday, May 20, 2026
Devs flood forums after exhausting their entire weekly Google AI Pro allocations in under two hours of light multi-turn agent execution, largely due to the high context tax of the new Gemini 3.5 Flash model repeating code repositories during error debugging loops.
The First 3x Concession
Thursday, May 21, 2026
The Antigravity core team issues a public apology acknowledging “poor choices” regarding feature resource consumption. They announce a permanent 3x increase in rate limits for all paid tiers and initiate an immediate reset of all weekly user quotas.
The Second 3x Upgrade (9x Total)
Friday, May 22, 2026
Realizing the initial tripling still wasn’t enough to appease developers trapped in intensive agentic code loops, Google stacks an additional 3x multiplier on top of the previous day’s adjustment, cementing a permanent 900% capacity expansion over the launch baselines.
2. Slicing Open the New Plan Thresholds
To accommodate these newly expanded ceilings without collapsing its global data centers, Google concurrently restructured its subscription tiers, introducing an intermediate high-performance gateway between the standard Pro tier and its ultra-premium enterprise layers.
| Subscription Tier | Monthly Cost | New Slicing Framework | Operational Context Moat |
| Google AI Pro | $20 / month | High baseline capacity with 5-hour rolling refreshes | Ideal for intermediate software engineering and basic agent prototyping. |
| Google AI Ultra (New) | $100 / month | 5X higher capacity limits than the AI Pro tier | Slotted explicitly to counter specialized enterprise tools like Claude Code. Includes a limited-time bonus of $100 in Antigravity credits if claimed before May 25, 2026. |
| Google AI Ultra (Legacy) | $200 / month (Price dropped from $250) | Highest priority tier with 20TB dataset storage | Tailored for multi-agent enterprise deployments requiring native support for third-party model hooks. |
3. Under the Hood: Why the Limits Were Melted
The friction that forced Google’s hand highlights a fundamental structural issue with how advanced AI agent platforms interact with LLMs.
Unlike standard text chats, autonomous programming tools operating in the Antigravity SDK or IDE utilize “Managed Agents” that spin up isolated Linux environments to run test workflows, analyze errors, and execute edits. Every time an agent loops to correct a syntax mistake, it resubmits massive file path context blocks.
Under the short-lived compute-used model, a single runaway agent debugging an edge-case error could accidentally burn through an entire week’s worth of token quotas in a matter of minutes. By introducing the massive 9x headroom adjustment and resetting global weekly tracking matrices, Google is attempting to give developers the breathing room needed to build fully autonomous digital pipelines without hitting a sudden wall mid-workflow.
