Google triples Gemini usage limits for Antigravity

Following an intense wave of community backlash regarding severe restrictions introduced at Google I/O 2026, Google’s Antigravity team has executed a staggering, back-to-back rollout of emergency corrective measures, ultimately tripling user quotas twice over to hand developers a massive 9x permanent increase in baseline rate limits.

The dramatic turnabout, announced directly by core Antigravity team member Varun Mohan (former founder of Windsurf), effectively undoes a highly criticized “compute-used” token nerf that had previously rendered the company’s flagship autonomous agent platform virtually unusable for heavy-duty engineering workflows.

1. The Timeline of the Developer Backlash

The rapid-fire policy pivot follows a highly volatile 72-hour window that left paid subscribers threatening a mass exodus from Google’s enterprise development suite.

The ‘Compute-Used’ Nerf at I/O

Tuesday, May 19, 2026

Google announces a migration away from generous daily prompt counts to a rigid token-and-compute consumption model. This forces all downstream platforms, including the newly rebranded Antigravity CLI and SDK ecosystems, onto strict five-hour and weekly caps.

The Quota Outrage

Wednesday, May 20, 2026

Devs flood forums after exhausting their entire weekly Google AI Pro allocations in under two hours of light multi-turn agent execution, largely due to the high context tax of the new Gemini 3.5 Flash model repeating code repositories during error debugging loops.

The First 3x Concession

Thursday, May 21, 2026

The Antigravity core team issues a public apology acknowledging “poor choices” regarding feature resource consumption. They announce a permanent 3x increase in rate limits for all paid tiers and initiate an immediate reset of all weekly user quotas.

The Second 3x Upgrade (9x Total)

Friday, May 22, 2026

Realizing the initial tripling still wasn’t enough to appease developers trapped in intensive agentic code loops, Google stacks an additional 3x multiplier on top of the previous day’s adjustment, cementing a permanent 900% capacity expansion over the launch baselines.

2. Slicing Open the New Plan Thresholds

To accommodate these newly expanded ceilings without collapsing its global data centers, Google concurrently restructured its subscription tiers, introducing an intermediate high-performance gateway between the standard Pro tier and its ultra-premium enterprise layers.

Subscription Tier	Monthly Cost	New Slicing Framework	Operational Context Moat
Google AI Pro	$20 / month	High baseline capacity with 5-hour rolling refreshes	Ideal for intermediate software engineering and basic agent prototyping.
Google AI Ultra (New)	$100 / month	5X higher capacity limits than the AI Pro tier	Slotted explicitly to counter specialized enterprise tools like Claude Code. Includes a limited-time bonus of $100 in Antigravity credits if claimed before May 25, 2026.
Google AI Ultra (Legacy)	$200 / month (Price dropped from $250)	Highest priority tier with 20TB dataset storage	Tailored for multi-agent enterprise deployments requiring native support for third-party model hooks.

3. Under the Hood: Why the Limits Were Melted

The friction that forced Google’s hand highlights a fundamental structural issue with how advanced AI agent platforms interact with LLMs.

Unlike standard text chats, autonomous programming tools operating in the Antigravity SDK or IDE utilize “Managed Agents” that spin up isolated Linux environments to run test workflows, analyze errors, and execute edits. Every time an agent loops to correct a syntax mistake, it resubmits massive file path context blocks.

Under the short-lived compute-used model, a single runaway agent debugging an edge-case error could accidentally burn through an entire week’s worth of token quotas in a matter of minutes. By introducing the massive 9x headroom adjustment and resetting global weekly tracking matrices, Google is attempting to give developers the breathing room needed to build fully autonomous digital pipelines without hitting a sudden wall mid-workflow.

Get the day’s top stories in your inbox

One concise email. No spam, unsubscribe anytime.

1. The Timeline of the Developer Backlash

2. Slicing Open the New Plan Thresholds

3. Under the Hood: Why the Limits Were Melted

Related Stories

Google Gemini Overtakes ChatGPT in Instagram Creator Conversations Across Influencer Categories

NVIDIA Launches ‘Cosmos 3 Edge’ to Power Real-Time AI on Robots and Autonomous Systems

Asia Pacific cyberscam losses hit as much as $114 billion

Leave a Comment Cancel reply