EN FR
Developer Productivity β€” GitHub Copilot

Usage-Based Billing with Confidence

A practical guide to using GitHub Copilot intentionally as it moves from request-based to usage-based billing on June 1, 2026.

πŸ“… June 1, 2026 πŸͺ™ GitHub AI Credits 🏒 Microsoft

01 What Changes on June 1, 2026

GitHub Copilot Business and Copilot Enterprise usage is measured with GitHub AI Credits. Interactions consume tokens β€” input, output, and cached β€” and GitHub converts that usage into AI credits.

Important. GitHub says the pricing and billing methods described here start on June 1, 2026. Review the live GitHub docs before making billing or procurement decisions, because model availability and pricing can change.

The goal is not to use Copilot less

The goal is to use Copilot with clearer intent: choose the right mode, give the right context, verify output quickly, and avoid expensive loops that do not improve the result.

πŸͺ™

AI Credits replace requests

1 GitHub AI Credit equals $0.01 USD. Usage is based on token consumption at published per-model rates.

∞

Completions stay unlimited

Code completions and next edit suggestions are not billed in AI credits for paid plans.

πŸ€–

AI features consume credits

Chat, CLI, cloud agent, Spaces, Spark, and third-party coding agents consume AI credits.

🚧

No automatic fallback

There is no automatic fallback to lower-cost models when a budget is exhausted.

Key facts to remember

02 Included AI Credits & Pooling

Each Copilot license contributes its AI credit value to a single shared pool. All licensed users draw from this pool first; usage beyond it is additional spend.

$0.01
1 AI Credit (USD)
1,900
Business credits / user / month
3,900
Enterprise credits / user / month
Pooled
At the billing entity level
PlanStandard AI credits / user / monthPromotional credits / user / month (Jun 1 – Sep 1, 2026)
Copilot Business1,9003,000
Copilot Enterprise3,9007,000

Example: an organization with 100 Copilot Business users receives a shared pool of 190,000 standard AI credits per month. During the promotional period, that same organization receives 300,000 AI credits per month.

Tokens drive cost

⬆️

Input tokens

What you send: prompts and new context. Can grow with large files and codebases.

⬇️

Output tokens

What you get: AI-generated responses and tool use. Highest cost per token.

♻️

Cached tokens

What is reused: context from earlier in a session. Improves speed; lowest cost per token.

03 How Model Pricing Works

All prices below are per 1 million tokens β€” the usage-based billing rates GitHub lists for June 1, 2026. A small prompt to a lightweight model can cost a fraction of a credit; a long agent session on a frontier model across many files costs more.

OpenAI models

ModelStatusCategoryInputCached inputOutput
GPT-4.1GAVersatile$2.00$0.50$8.00
GPT-5 miniGALightweight$0.25$0.025$2.00
GPT-5.2GAVersatile$1.75$0.175$14.00
GPT-5.2-CodexGAPowerful$1.75$0.175$14.00
GPT-5.3-CodexGAPowerful$1.75$0.175$14.00
GPT-5.4GAVersatile$2.50$0.25$15.00
GPT-5.4 miniGALightweight$0.75$0.075$4.50
GPT-5.4 nanoGALightweight$0.20$0.02$1.25
GPT-5.5GAPowerful$5.00$0.50$30.00

GPT-4.1 and GPT-5 mini are included models. GPT-5.4 pricing applies to prompts with 272K tokens or fewer.

Anthropic models

Anthropic models include a cache write cost in addition to cached input.

ModelStatusCategoryInputCached inputCache writeOutput
Claude Haiku 4.5GAVersatile$1.00$0.10$1.25$5.00
Claude Sonnet 4GAVersatile$3.00$0.30$3.75$15.00
Claude Sonnet 4.5GAVersatile$3.00$0.30$3.75$15.00
Claude Sonnet 4.6GAVersatile$3.00$0.30$3.75$15.00
Claude Opus 4.5GAPowerful$5.00$0.50$6.25$25.00
Claude Opus 4.6GAPowerful$5.00$0.50$6.25$25.00
Claude Opus 4.7GAPowerful$5.00$0.50$6.25$25.00

Google models

ModelStatusCategoryInputCached inputOutput
Gemini 2.5 ProGAPowerful$1.25$0.125$10.00
Gemini 3 FlashPublic previewLightweight$0.50$0.05$3.00
Gemini 3.1 ProPublic previewPowerful$2.00$0.20$12.00

Gemini 2.5 Pro and Gemini 3.1 Pro pricing applies to prompts with 200K tokens or fewer. Gemini 3 Flash has no long-context surcharge.

xAI & fine-tuned GitHub models

ModelStatusCategoryInputCached inputOutput
Grok Code Fast 1GALightweight$0.20$0.02$1.50
Raptor miniPublic previewVersatile$0.25$0.025$2.00
GoldeneyePublic previewPowerful$1.25$0.125$10.00

Raptor mini uses GPT-5 mini pricing. Goldeneye uses GPT-5.1-Codex pricing.

04 User Guide β€” Token-Smart Habits

Keep Copilot useful while being thoughtful about AI credits. The main habit is simple: start narrow, give Copilot the context it needs, and expand only when the task requires it.

1. Define the outcome in one or two sentences.
2. Attach or reference only the files needed for the task.
3. Ask Copilot to inspect before editing when the codebase is unfamiliar.
4. Prefer small implementation steps over one broad instruction.
5. Review diffs after each meaningful change.
6. Run targeted tests before asking for a larger verification pass.
7. Save repeated prompts as reusable instructions or prompt files.

Choose the right Copilot mode

TaskGood starting modeWhy
Fill in local implementation detailsCode completionsUnlimited on paid plans and fast for narrow edits
Explain one file or functionChatFocused context and quick iteration
Modify several related filesEdit or agent modeUseful when changes cross file boundaries
Diagnose terminal or command usageCopilot CLIKeeps command reasoning close to the shell
Review a pull requestCopilot code reviewBroad review, but consumes AI credits and Actions minutes

Use a prompt template

Goal: <specific outcome>
Scope: <files, folders, or APIs Copilot should consider>
Constraints: <what must stay unchanged>
Verification: <tests, commands, or manual checks to run>
Before editing: summarize the likely change in two sentences.

Prompt for less waste

Use specific prompts that reduce unnecessary context and retries. Avoid vague prompts like Look at this repo and improve it or Fix everything and make the code better, which trigger broad exploration.

Review only this diff for functional regressions, missing tests, and surprising
behavior. Prioritize concrete findings with file references.

05 Admin Guide β€” Budgets & Controls

Prepare budget controls and operating habits so high-value Copilot usage stays available while spend stays visible and manageable. Budgets are set in USD; usage appears in AI credits ($10 covers 1,000 credits).

Budget levels

Budget levelUse it forWatch out for
EnterpriseBroad guardrails across organizations, repos, and cost centersA hard stop can affect many teams at once
OrganizationTeam or product-area accountabilityShared infrastructure teams may span organizations
Cost centerFinance-aligned spend ownershipNeeds clear mapping to engineering work
UserCoaching, experiments, and individual safeguardsA $0 budget disables access; exhausted budgets halt Copilot access

Recommended policy starting points

Team patternAdditional usage policyBudget posture
New rollout or pilotAllow with alertsWatch behavior before hard limits
Mature team with stable usageAllow with user and org budgetsTune based on normal monthly usage
Sensitive cost centerLimit with stricter alertsUse clear escalation for exceptions
Short-term migration or incidentTemporarily allow more usageReview after the event and reset budgets

Coaching signals

βœ…

Healthy high usage

Multi-file migrations with measurable value, test generation for risky paths, review on complex PRs, and production incident support where speed matters.

🧭

Deserves coaching

Repeated broad prompts like "fix this repo", long agent sessions without checkpoints, premium models for trivial edits, and sending large unrelated folders.

Monthly review template

Month:
Total AI credits used:
Included pool:
Additional spend:
Top repositories by usage:
Top workflows by usage:
High-value examples:
Coaching opportunities:
Budget changes for next month:
Reusable prompts or instructions to publish:

06 Training β€” Agent Quality & Token Optimization

When tokens are cheap, agent accuracy is not important. Once they are not, quality becomes the better focus. Instead of counting tokens, make every token count.

The compound error problem. Even at 99% accuracy per step, a 50-step workflow only lands near 60% success. Agent errors compound quickly, so quality work upstream protects both ROI and credits.

The two biggest levers

🎚️

Model choice & auto mode

Reasoning models (Opus, GPT-5.5) for planning, architecture, and debugging. Mid-tier (Sonnet, GPT-5.4) for implementation. Low-tier (Haiku, GPT-mini) for small refactors and docs. Lean on Auto mode as the default.

🧩

Context engineering

As much as required, as little as necessary. Provide only relevant context, use /clear for each new task, and use session compaction cautiously to avoid losing valuable information.

Research β†’ Plan β†’ Implement

Step 1

Research

  • "I want to change X. What files are relevant?"
  • Discovery before edits
Step 2

Plan

  • Turn findings into a precise spec
  • Agree the change before code
Step 3

Implement

  • Small steps from the plan
  • Deterministic guardrails (tests, linters)
Step 4

Verify

  • Review the diff, run targeted tests
  • Log agent misses for next time

Persistent guidance that pays off

07 Quick Transition Checklist

πŸ‘©β€πŸ’»

For users

Start with the narrowest useful prompt. Use completions freely. Use chat for focused work, agents for multi-file autonomy, and review diffs before continuing.

πŸ› οΈ

For admins

Confirm billing ownership, decide post-credit policy, configure enterprise/org/cost-center/user budgets, start with alerts, then add limits, and monitor both credits and engineering value.

Make every token count

The repository goal is to help teams answer three questions: which Copilot features to use for which work, how model and context choices affect AI credit consumption, and which budgets and habits keep high-value Copilot usage available.

08 References