A practical guide to using GitHub Copilot intentionally as it moves from request-based to usage-based billing on June 1, 2026.
GitHub Copilot Business and Copilot Enterprise usage is measured with GitHub AI Credits. Interactions consume tokens β input, output, and cached β and GitHub converts that usage into AI credits.
Important. GitHub says the pricing and billing methods described here start on June 1, 2026. Review the live GitHub docs before making billing or procurement decisions, because model availability and pricing can change.
The goal is to use Copilot with clearer intent: choose the right mode, give the right context, verify output quickly, and avoid expensive loops that do not improve the result.
1 GitHub AI Credit equals $0.01 USD. Usage is based on token consumption at published per-model rates.
Code completions and next edit suggestions are not billed in AI credits for paid plans.
Chat, CLI, cloud agent, Spaces, Spark, and third-party coding agents consume AI credits.
There is no automatic fallback to lower-cost models when a budget is exhausted.
Each Copilot license contributes its AI credit value to a single shared pool. All licensed users draw from this pool first; usage beyond it is additional spend.
| Plan | Standard AI credits / user / month | Promotional credits / user / month (Jun 1 β Sep 1, 2026) |
|---|---|---|
| Copilot Business | 1,900 | 3,000 |
| Copilot Enterprise | 3,900 | 7,000 |
Example: an organization with 100 Copilot Business users receives a shared pool of 190,000 standard AI credits per month. During the promotional period, that same organization receives 300,000 AI credits per month.
What you send: prompts and new context. Can grow with large files and codebases.
What you get: AI-generated responses and tool use. Highest cost per token.
What is reused: context from earlier in a session. Improves speed; lowest cost per token.
All prices below are per 1 million tokens β the usage-based billing rates GitHub lists for June 1, 2026. A small prompt to a lightweight model can cost a fraction of a credit; a long agent session on a frontier model across many files costs more.
| Model | Status | Category | Input | Cached input | Output |
|---|---|---|---|---|---|
| GPT-4.1 | GA | Versatile | $2.00 | $0.50 | $8.00 |
| GPT-5 mini | GA | Lightweight | $0.25 | $0.025 | $2.00 |
| GPT-5.2 | GA | Versatile | $1.75 | $0.175 | $14.00 |
| GPT-5.2-Codex | GA | Powerful | $1.75 | $0.175 | $14.00 |
| GPT-5.3-Codex | GA | Powerful | $1.75 | $0.175 | $14.00 |
| GPT-5.4 | GA | Versatile | $2.50 | $0.25 | $15.00 |
| GPT-5.4 mini | GA | Lightweight | $0.75 | $0.075 | $4.50 |
| GPT-5.4 nano | GA | Lightweight | $0.20 | $0.02 | $1.25 |
| GPT-5.5 | GA | Powerful | $5.00 | $0.50 | $30.00 |
GPT-4.1 and GPT-5 mini are included models. GPT-5.4 pricing applies to prompts with 272K tokens or fewer.
Anthropic models include a cache write cost in addition to cached input.
| Model | Status | Category | Input | Cached input | Cache write | Output |
|---|---|---|---|---|---|---|
| Claude Haiku 4.5 | GA | Versatile | $1.00 | $0.10 | $1.25 | $5.00 |
| Claude Sonnet 4 | GA | Versatile | $3.00 | $0.30 | $3.75 | $15.00 |
| Claude Sonnet 4.5 | GA | Versatile | $3.00 | $0.30 | $3.75 | $15.00 |
| Claude Sonnet 4.6 | GA | Versatile | $3.00 | $0.30 | $3.75 | $15.00 |
| Claude Opus 4.5 | GA | Powerful | $5.00 | $0.50 | $6.25 | $25.00 |
| Claude Opus 4.6 | GA | Powerful | $5.00 | $0.50 | $6.25 | $25.00 |
| Claude Opus 4.7 | GA | Powerful | $5.00 | $0.50 | $6.25 | $25.00 |
| Model | Status | Category | Input | Cached input | Output |
|---|---|---|---|---|---|
| Gemini 2.5 Pro | GA | Powerful | $1.25 | $0.125 | $10.00 |
| Gemini 3 Flash | Public preview | Lightweight | $0.50 | $0.05 | $3.00 |
| Gemini 3.1 Pro | Public preview | Powerful | $2.00 | $0.20 | $12.00 |
Gemini 2.5 Pro and Gemini 3.1 Pro pricing applies to prompts with 200K tokens or fewer. Gemini 3 Flash has no long-context surcharge.
| Model | Status | Category | Input | Cached input | Output |
|---|---|---|---|---|---|
| Grok Code Fast 1 | GA | Lightweight | $0.20 | $0.02 | $1.50 |
| Raptor mini | Public preview | Versatile | $0.25 | $0.025 | $2.00 |
| Goldeneye | Public preview | Powerful | $1.25 | $0.125 | $10.00 |
Raptor mini uses GPT-5 mini pricing. Goldeneye uses GPT-5.1-Codex pricing.
Keep Copilot useful while being thoughtful about AI credits. The main habit is simple: start narrow, give Copilot the context it needs, and expand only when the task requires it.
| Task | Good starting mode | Why |
|---|---|---|
| Fill in local implementation details | Code completions | Unlimited on paid plans and fast for narrow edits |
| Explain one file or function | Chat | Focused context and quick iteration |
| Modify several related files | Edit or agent mode | Useful when changes cross file boundaries |
| Diagnose terminal or command usage | Copilot CLI | Keeps command reasoning close to the shell |
| Review a pull request | Copilot code review | Broad review, but consumes AI credits and Actions minutes |
Goal: <specific outcome>
Scope: <files, folders, or APIs Copilot should consider>
Constraints: <what must stay unchanged>
Verification: <tests, commands, or manual checks to run>
Before editing: summarize the likely change in two sentences.
Use specific prompts that reduce unnecessary context and retries. Avoid vague prompts like Look at this repo and improve it or Fix everything and make the code better, which trigger broad exploration.
Review only this diff for functional regressions, missing tests, and surprising
behavior. Prioritize concrete findings with file references.
Prepare budget controls and operating habits so high-value Copilot usage stays available while spend stays visible and manageable. Budgets are set in USD; usage appears in AI credits ($10 covers 1,000 credits).
| Budget level | Use it for | Watch out for |
|---|---|---|
| Enterprise | Broad guardrails across organizations, repos, and cost centers | A hard stop can affect many teams at once |
| Organization | Team or product-area accountability | Shared infrastructure teams may span organizations |
| Cost center | Finance-aligned spend ownership | Needs clear mapping to engineering work |
| User | Coaching, experiments, and individual safeguards | A $0 budget disables access; exhausted budgets halt Copilot access |
| Team pattern | Additional usage policy | Budget posture |
|---|---|---|
| New rollout or pilot | Allow with alerts | Watch behavior before hard limits |
| Mature team with stable usage | Allow with user and org budgets | Tune based on normal monthly usage |
| Sensitive cost center | Limit with stricter alerts | Use clear escalation for exceptions |
| Short-term migration or incident | Temporarily allow more usage | Review after the event and reset budgets |
Multi-file migrations with measurable value, test generation for risky paths, review on complex PRs, and production incident support where speed matters.
Repeated broad prompts like "fix this repo", long agent sessions without checkpoints, premium models for trivial edits, and sending large unrelated folders.
Month:
Total AI credits used:
Included pool:
Additional spend:
Top repositories by usage:
Top workflows by usage:
High-value examples:
Coaching opportunities:
Budget changes for next month:
Reusable prompts or instructions to publish:
When tokens are cheap, agent accuracy is not important. Once they are not, quality becomes the better focus. Instead of counting tokens, make every token count.
The compound error problem. Even at 99% accuracy per step, a 50-step workflow only lands near 60% success. Agent errors compound quickly, so quality work upstream protects both ROI and credits.
Reasoning models (Opus, GPT-5.5) for planning, architecture, and debugging. Mid-tier (Sonnet, GPT-5.4) for implementation. Low-tier (Haiku, GPT-mini) for small refactors and docs. Lean on Auto mode as the default.
As much as required, as little as necessary. Provide only relevant context, use /clear for each new task, and use session compaction cautiously to avoid losing valuable information.
Start with the narrowest useful prompt. Use completions freely. Use chat for focused work, agents for multi-file autonomy, and review diffs before continuing.
Confirm billing ownership, decide post-credit policy, configure enterprise/org/cost-center/user budgets, start with alerts, then add limits, and monitor both credits and engineering value.
The repository goal is to help teams answer three questions: which Copilot features to use for which work, how model and context choices affect AI credit consumption, and which budgets and habits keep high-value Copilot usage available.