Troubleshooting · Provider quota

Fix a Gemini quota exceeded error.

By ReduzReduzUpdated May 11, 2026 Fix guide

Google Gemini quota errors — usually shown as "RESOURCE_EXHAUSTED" or "Quota exceeded for quota metric" — come from two completely different systems depending on how you connected. The free tier through AI Studio caps you at modest per-minute and per-day request quotas; the paid tier through Vertex AI or paid AI Studio has higher per-project quotas governed by Google Cloud. The fix depends on which tier you're on, which Gemini model (Pro vs Flash vs Flash Lite), and whether the issue is per-minute throttling or a daily-quota cap that won't reset until tomorrow.

Check these first

  • You're on the AI Studio free tier, which has tight per-minute (15-60 RPM depending on model) and per-day quotas.
  • A long PDF or transcript exceeded the tokens-per-minute cap in one request.
  • Billing is not enabled on the Google Cloud project, so you're stuck at free-tier limits even though you generated a paid key.
  • The specific Gemini model (2.5 Pro especially) has lower quotas than Flash or Flash Lite.
  • Daily quotas reset at midnight Pacific time — if you hit a daily cap in the morning Pacific, it won't reset for many hours.

Fix it in this order

  1. 1

    Identify your tier (free AI Studio vs paid Vertex AI)

    Open aistudio.google.com/api-keys. If your key is from AI Studio with no billing attached, you're on the free tier with strict quotas. If billing is enabled or you're using Vertex AI, you're on the paid tier with much higher quotas — different fix.

  2. 2

    Check the quota error message for the specific limit

    Google returns specific quota names: "GenerateRequestsPerMinutePerProjectPerModel-FreeTier" or "GenerateContentInputTokensPerMinutePerProjectPerModel-FreeTier." The exact metric tells you whether to wait 60 seconds (per-minute) or wait until tomorrow (per-day).

  3. 3

    Switch to Gemini Flash or Flash Lite

    Gemini 3 Pro has the lowest free-tier quotas. Flash and Flash Lite have substantially higher RPM and daily caps. For most summarization work, Flash is comparable quality at a fraction of the quota cost. In Reduz settings under Gemini, switch the active model and retry.

  4. 4

    Enable billing for higher quotas

    Open console.cloud.google.com/billing and attach a billing account to the project tied to your Gemini API key. Paid-tier quotas are 10-100x higher than free-tier, and Gemini's paid pricing remains among the most cost-competitive (Flash is often pennies per long PDF).

  5. 5

    Wait for the right reset window

    Per-minute quotas reset on a 60-second rolling window — wait that long. Daily quotas reset at midnight Pacific time. If you hit a daily cap, switching providers is faster than waiting; switch back tomorrow.

  6. 6

    Switch to another provider in Reduz

    For urgent work, switch Reduz to OpenAI, Anthropic Claude, DeepSeek, or xAI Grok. The same source runs through a different provider. Switch back to Gemini once the quota window resets.

Diagnosis

Free tier vs paid tier

AI Studio without billing = free tier (~15-60 RPM, modest daily cap). AI Studio with billing or Vertex AI = paid tier (much higher quotas). Same UI, very different limits.

Per-minute vs per-day

Per-minute quotas (RPM, TPM) reset on a 60-second rolling window. Daily quotas reset at midnight Pacific time. The error metric name identifies which one you hit.

Model-specific

Gemini 3 Pro has lower quotas than Flash. Flash has lower quotas than Flash Lite. For high-volume summarization, Flash is usually the right default — quality is excellent and quotas accommodate daily use.

Project-specific

Quotas attach to the Google Cloud project, not just the key. If you have multiple projects, each has its own quota. The error response identifies the project ID.

Free tier may be enough for moderate use

For casual summarization (~50 articles/day), AI Studio free tier on Gemini Flash usually fits. If you're consistently hitting daily caps, enable billing — Gemini paid is among the cheapest summarization options.

Switch models or providers, or enable billing

Gemini quota errors are usually free-tier specific — enabling billing on your Google Cloud project unlocks paid-tier quotas that are 10-100x higher, and Gemini paid pricing remains among the most cost-competitive in the category. Gemini Flash specifically is often pennies per long PDF and a strong default for daily-volume summarization. While you sort out billing or wait for quota reset, Reduz lets you switch to another provider in one click — OpenAI, Anthropic Claude, DeepSeek, or xAI Grok. Hosted Free also gives you a Gemini-independent fallback with 100 monthly credits. Switch back to Gemini once your quota resets.

Frequently asked questions

Does Reduz set my Gemini quota?

No. Gemini quotas are set by Google for your API key, project, model, and tier (free AI Studio vs paid). Reduz cannot raise them — but it lets you switch to another provider in one click while the quota window resets.

Why did Gemini quota fail on one PDF?

Large PDFs send 30,000+ tokens in a single request. On the free tier, that can immediately exceed the tokens-per-minute (TPM) cap. Either reduce source size, switch to Flash (which has higher TPM caps on the free tier), or enable billing for paid-tier limits.

Is Gemini still free for daily use through Reduz?

Free AI Studio access is real but capped. For casual summarization (~50 articles/day on Flash), the free tier usually fits. For heavier daily work, enabling billing is the right move — Gemini Flash paid is often pennies per long document, well below most other paid summarization options.

How long until my Gemini quota resets?

Per-minute quotas (RPM, TPM) reset on a 60-second rolling window. Daily quotas reset at midnight Pacific time. The error response identifies which quota you hit — check the metric name (it includes "PerMinute" or daily timing).

Which Gemini model has the highest free-tier quota?

Flash Lite > Flash > Pro on the free tier. For summarization specifically, Flash is the typical default — quality is excellent and quotas are generous. Reserve Pro for harder content where quality difference is worth the lower quota.

Is Reduz free?

Yes. Reduz includes 100 free credits a month. Using your own AI key removes the credit limit.

Do I need an account?

Not when you use your own AI key. An account is only needed for free credits, paid plans, or cloud backup.

Where is my data stored?

Summary history is stored in your browser. Cloud backup is opt-in and encrypted on your device before upload.

Which AI providers does Reduz support?

Reduz supports OpenAI, Anthropic Claude, Google Gemini, DeepSeek, and xAI Grok. You can also use free credits without setting up an AI account.