GitHub Copilot Usage-Based Billing

On June 1, 2026, GitHub Copilot transitions from request-based billing to usage-based billing across every plan, from Copilot Free through Copilot Enterprise. The shift changes how usage is measured, how overages are handled, and what teams should do to keep costs predictable.


Beginning June 1, 2026, premium request units (PRUs) are retired in favor of GitHub AI Credits. Charges now line up with real consumption rather than a fixed per-request count.

Credits are drawn down based on token usage, including input tokens, output tokens, and cached tokens, priced according to the published per-model rates.

Notable changes

  • Copilot capabilities backed by AI models draw from your AI credits. That includes Copilot Chat, Agent mode in the IDE, Copilot CLI, Copilot cloud agent, Copilot Code Review, Copilot Spaces, Spark, and third-party coding agents.
  • Code completions and Next Edit suggestions stay included in every paid plan and don’t draw down AI Credits.
  • Fallback experiences are going away. Today, users who run out of PRUs can drop to a cheaper model and keep going. Going forward, consumption is governed by available credits and any admin-defined budget limits.
  • Copilot code review also consumes GitHub Actions minutes in addition to GitHub AI Credits, billed at the same per-minute rates as other GitHub Actions workflows.

What are GitHub AI Credits?

GitHub AI Credits are the billing unit for Copilot consumption across both individual plans (Copilot Free, Pro, Pro+, Max) and organization plans (Copilot Business, Copilot Enterprise).

Every Copilot interaction uses tokens:

  • Input tokens (sent to the model)
  • Output tokens (produced by the model)
  • Cached tokens (context the model stores or reuses)

Token cost depends on the model chosen, and the total converts into AI credits at a fixed rate where 1 AI credit = $0.01 USD.

The cost of any single interaction depends on two factors: the model in use and the token volume consumed. A short chat exchange on a lightweight model might cost a fraction of a credit. A long agent session on a frontier model spanning multiple files will cost more, simply because there’s more work happening.

How do AI credits work for individual plans?

Each individual subscription includes a monthly AI credit allowance. Paid plans get a higher cap than the free plan.

PlanPrice/monthBase creditsFlex allotmentTotal monthly AICs
Copilot Pro$10 USD1,0005001,500
Copilot Pro+$39 USD3,9003,1007,000
Copilot Max$100 USD10,00010,00020,000

Base credits match the subscription price and stay constant. The flex allotment is an additional monthly amount layered on top, designed to flex as model pricing and efficiency evolve. Base credits are spent first, and the flex allotment kicks in automatically across the IDE, GitHub.com, and Copilot CLI with no extra setup.

Copilot Free includes 2,000 code completions a month, an AI credits allowance, and auto model selection.

How do AI credits work for Business and Enterprise?

For organizations, every assigned Copilot license carries a monthly allotment of included AI credits:

PlanTotal AI credits per user per month
Copilot Business1,900
Copilot Enterprise3,900

Existing Copilot Business and Copilot Enterprise customers get a higher promotional allotment for the first three months of the new model (June 1 – September 1, 2026):

PlanTotal AI credits per user per month
Copilot Business3,000
Copilot Enterprise7,000

After that promotional window, the standard allotments above apply.

The included credits per user are pooled at the billing entity level. As an example, an organization running 100 Copilot Business licenses ends up with a shared pool of 190,000 AI credits rather than 100 separate buckets. Heavier users can pull more from the pool when needed, while lighter users effectively offset that consumption.

Adding licenses mid-cycle grows the pool right away. Removing licenses mid-cycle doesn’t shrink the pool until the next billing cycle starts.

What happens when included AI credits run out?

For individuals, two options remain when the included allowance is exhausted:

  • Set an additional usage budget and pay extra to keep working.
  • Wait for the next monthly cycle when the included allowance refreshes.

For organizations and enterprises, what happens depends on the policies configured for additional usage:

  • Additional usage allowed: Consumption continues at published per-credit rates, with the extra spend billed to the organization or enterprise.
  • Additional usage not allowed: Usage is blocked until the next billing cycle when monthly allotments reset.

When a user-level budget is set and a user hits it, that user’s Copilot access stops, even if the organization’s pool still has room. There is no automatic fallback to a cheaper model once a budget is exhausted.

Additional usage budgets are configured in US dollars while consumption shows in GitHub AI Credits. The conversion stays fixed: 1 AI credit = $0.01 USD, so a $10 budget covers 1,000 AI credits.

Controlling costs with budgets

For organizations and enterprises, budgets can be set at four levels to govern AI Credit spend:

  • Enterprise-level budgets track spending across every organization, repository, and cost center under the enterprise.
  • Cost-center-level budgets track spending for a specific cost center.
  • User-level budgets track spending per user. A $0 user-level budget means no Copilot access at all.
  • Individual-level budgets track spending for power users. Overwrites user-level budgets.

Budgets can trigger alerts as limits approach and enforce hard stops on usage. For example, to allow a small amount of overage while keeping it controlled, set a user-level budget slightly above the included allotment.

Cost Optimization Best Practices

For Copilot users:

  • Match the model to the task. Frontier models burn more credits per interaction than lightweight models. Reserve premium models for complex reasoning, large refactors, and tough debugging, and use lighter models for routine edits, simple completions, and boilerplate. Switching to a cheaper model is one of the most effective ways to stretch the included allowance.
  • Share the minimum context needed. Attach only the files, selections, or references the model truly needs. Oversharing context inflates token usage and can dilute response quality.
  • Watch conversation length and complexity. Longer threads and more elaborate tasks involve more back-and-forth and higher token consumption. Start fresh sessions for new tasks rather than extending long-running threads.
  • Use Auto-mode. Auto model selection has been refined for usage-based billing and now comes with a 10% discount. Check out the change log.

For GitHub admins:

  • Plan for pooled AI credits. Each license includes a monthly allotment that’s pooled across the organization. Teams with uneven usage patterns can benefit, since power users naturally draw more while lighter users offset that spend.
  • Communicate the change internally. Give teams a heads-up about how billing is changing and what it means for their daily Copilot work.
  • Update IDEs, clients, and Copilot extensions. Older versions keep working but may show inaccurate model pricing, outdated billing terminology, or miss usage alerts.
  • Preview your bill before the cutover. From the announcement banner on the enterprise home page, billing overview, or premium request analytics page, use Preview your usage to download a CSV report. Two extra columns show the estimated equivalent under usage-based billing: aic_quantity (AI credits consumed) and aic_gross_amount (estimated USD cost). Upload the CSV to the billing preview tool for a side-by-side view of PRU vs. AI Credit projections. Data stays local in the browser.
  • Create cost budgets. to control the shared pool usage and extra spend for the whole company.

References

[The article is written by me and AI]