GitHub Copilot: Claude Opus 4.8 Fast Mode in Preview

GitHub CopilotView original changelog

GitHub rolled out Claude Opus 4.8 in fast mode as a preview for GitHub Copilot on June 29, 2026, delivering 2.5x faster output token speeds while maintaining the same intelligence as the standard Opus 4.8 model. The fast mode variant is priced at $10 per million input tokens and $50 per million output tokens under usage-based billing, roughly three times cheaper than the fast mode tier that shipped with Claude Opus 4.7. It is available to Copilot Pro+, Max, Business, and Enterprise users across all major IDEs and developer surfaces, with Business and Enterprise administrators required to manually enable the policy in Copilot settings.

Key Takeaways

  • Fast mode for Claude Opus 4.8 brings 2.5x faster output token speeds, running at up to 62 tokens per second without degrading the intelligence of the underlying model.
  • Pricing is roughly three times cheaper than prior fast modes: at $10/$50 per million tokens, it costs significantly less than the Opus 4.7 fast tier while offering the same speed profile.
  • All major Copilot surfaces are supported, including VS Code, Visual Studio, JetBrains, Xcode, Eclipse, Copilot CLI, GitHub Mobile, and the Copilot cloud agent.
  • Access is gated to Pro+, Max, Business, and Enterprise plans, leaving Copilot Free and standard Pro users on standard model options only.
  • Business and Enterprise admins must manually enable the policy in Copilot settings before their developers can select fast mode from the model picker.
  • The fast mode release builds on the Opus 4.8 GA launch from May 28, which itself delivered a 4x reduction in self-missed code flaws, meaning fast mode inherits a meaningfully stronger base model than prior fast tiers ran on.

GitHub Copilot Adds Claude Opus 4.8 Fast Mode in Preview

GitHub introduced a fast mode preview for Claude Opus 4.8 in GitHub Copilot on June 29, 2026, offering developers access to Anthropic's most capable coding model at significantly higher throughput than the standard variant.

What Fast Mode Delivers

Fast mode for Claude Opus 4.8 targets the responsiveness requirements of interactive development and long-running agentic sessions. The variant delivers 2.5x faster output token speeds compared to standard Opus 4.8, measured at up to 62 tokens per second by independent benchmarking. Critically, this speed increase does not come at the expense of model quality: fast mode runs the same underlying Opus 4.8 weights, preserving the reasoning depth that made the base model notable when it launched in general availability on May 28, 2026.

Pricing: Significantly Cheaper Than Previous Fast Modes

GitHub prices fast mode under usage-based billing at $10 per million input tokens and $50 per million output tokens. This represents a meaningful cost improvement over earlier fast mode iterations: the Opus 4.7 fast tier cost approximately three times more per token for the same speed profile. The pricing positions Opus 4.8 fast mode as a practical option for teams running agentic workflows at scale, where latency and cost both factor into model selection decisions.

Standard Claude Opus 4.8 remains available at its existing price point of $5 per million input tokens and $25 per million output tokens for developers who prioritize cost over throughput.

Availability Across Surfaces

Fast mode for Opus 4.8 is accessible through all major GitHub Copilot surfaces where model selection is supported, including Visual Studio Code (in chat, ask, edit, and agent modes), Visual Studio, JetBrains IDEs, Xcode, Eclipse, the Copilot CLI, GitHub Mobile on iOS and Android, github.com, and the Copilot cloud agent.

Eligible plans include Copilot Pro+, Max, Business, and Enterprise. Copilot Free and standard Pro plan users do not have access to fast mode.

Administrator Requirements for Business and Enterprise

Fast mode for Claude Opus 4.8 is disabled by default for Business and Enterprise organizations. Administrators must navigate to Copilot settings and explicitly enable the fast mode policy before developers on their plan can select it from the model picker. This follows the same administrative gate pattern introduced for earlier premium model tiers.

Context: A Faster Variant of an Already-Strong Model

Claude Opus 4.8 launched in general availability for GitHub Copilot in late May 2026 with a reported 4x reduction in self-missed code flaws compared to Opus 4.7. The addition of fast mode extends the model's utility to scenarios where response latency directly affects developer flow, such as real-time inline completions, back-and-forth chat sessions, and multi-step agent loops that would otherwise stall waiting for output.

GitHub notes that rollout is gradual, and users who do not yet see the model in their picker should check back within the coming days.


Mentioned onX (Twitter)