Anthropic Files Confidentially for IPO at $965 Billion

At Build 2026, Microsoft revealed a complete in-house AI model family trained without OpenAI data. The strategic implications for GitHub Copilot, enterprise compliance, and the AI model market are enormous.

By Katrina Voss, Competitive Intelligence · Jun 3, 2026 · 12 min read

Microsoft Builds an Insurance Policy Against OpenAI

At Microsoft Build 2026, Satya Nadella announced a complete family of in-house AI models — the MAI family, developed under the internal codename Project Solara, trained independently of OpenAI datasets, and beginning deployment across Microsoft's own products immediately. The announcement was framed as "model diversity," but every enterprise AI team that heard it understood the strategic subtext: Microsoft has built the technical capability to operate without OpenAI if it ever needs to.

The timing is not accidental. Microsoft's OpenAI partnership, initiated with a $1 billion investment in 2019 and expanded to $13 billion through 2023, comes up for renegotiation on key exclusivity terms in 2026. The Build announcement — complete with live deployment of MAI-Code-1-Flash inside GitHub Copilot — ensures that Microsoft enters those renegotiations with a credible in-house alternative rather than a roadmap dependency.

For enterprise buyers, this is significant independent of whether they ever touch a MAI model directly. The existence of Microsoft's in-house AI capability changes the pricing leverage, compliance documentation, and multi-vendor optionality of every AI contract being signed on Azure today.

The MAI Family: Seven Models, Seven Use Cases

Microsoft released seven MAI models at Build 2026, spanning the full spectrum of enterprise AI workloads:

Model	Capability	Primary Use Case	Release Status
MAI-Thinking-1	Extended reasoning, 200K context	Enterprise analysis, complex research	Preview
MAI-Code-1	Full-precision code generation	Production code review, refactoring	GA
MAI-Code-1-Flash	Fast, low-latency code completion	GitHub Copilot inline suggestions	GA, live in Copilot
MAI-DS-R1	Data science and structured analytics	SQL generation, Jupyter automation	Preview
MAI-Vision-1	Multimodal text and image understanding	Document analysis, diagram extraction	Preview
MAI-Mini	Sub-3B parameter model	On-device deployment, EU data residency	GA
MAI-Embed-1	Embeddings and semantic retrieval	Azure AI Search, RAG pipeline indexing	GA

The flagship deployment at Build is MAI-Code-1-Flash shipping live in GitHub Copilot. This is the first time Microsoft has displaced an OpenAI model at the product layer in a generally available product. Previous GitHub Copilot releases ran on OpenAI Codex and GPT-4o for inline completions. MAI-Code-1-Flash is now the default completion model for Copilot subscribers, with OpenAI models available as an alternative routing option.

According to Microsoft's Build documentation, MAI-Code-1-Flash achieves comparable performance to GPT-4o-mini on standard code generation benchmarks while running at approximately 40% lower inference cost on Azure's compute infrastructure. The latency profile — sub-100ms median time-to-first-token at high load — is specifically optimized for the inline completion experience, where developer satisfaction drops measurably above 200ms latency.

"No OpenAI Data": The Enterprise Compliance Angle

The most strategically significant phrase in Microsoft's Build AI announcement — buried in a breakout session rather than the keynote — is that MAI models were trained without OpenAI-licensed datasets.

This distinction matters because enterprise AI governance is evolving rapidly. The EU AI Act's General-Purpose AI provisions, which took effect for major model providers in August 2025, require technical documentation of training data sources and licensing provenance. For regulated industries — financial services, healthcare, legal services, and defense contractors — procurement teams are beginning to ask AI vendors for training data attestations as a standard part of vendor qualification.

Microsoft's training data architecture for MAI relies on three sources: first-party licensed datasets, synthetic data generated from Microsoft's own models, and public domain corpora. None of these sources involve OpenAI-licensed data pools or outputs derived from ChatGPT. This gives Microsoft's legal team a clean provenance story that is materially different from models where training involved distillation from or reference to commercial model outputs.

The GitHub Copilot token billing changes announced at Build — moving to consumption-based pricing for agentic use cases — pair naturally with the MAI family's cost structure. Microsoft can offer MAI-Code-1-Flash at a lower per-token rate than GPT-4o, creating a pricing path where enterprise customers who migrate completions to MAI models reduce their per-seat costs while remaining on the Microsoft platform.

For Chief Legal Officers and Chief Compliance Officers tracking AI training data provenance as an emerging procurement risk, the MAI announcement gives Microsoft's sales teams a new answer to a question that has been getting harder: can you document exactly what data trained the model I am deploying in my regulated environment?

Project Solara: Microsoft's Parallel AI Research Track

Microsoft's in-house AI capability did not appear overnight. Project Solara — the internal program that produced the MAI family — has been running since approximately 2022, initially as a contingency planning exercise and subsequently as a strategic research investment.

The program currently employs more than 400 researchers and engineers operating in a dedicated track separate from Microsoft's Azure OpenAI Service team. Solara has its own compute clusters, training infrastructure, evaluation protocols, and data sourcing agreements. This operational independence from the OpenAI integration team was intentional: it ensures that Solara's research roadmap can proceed independently of the partnership's commercial terms.

Microsoft's Phi series — small language models that Microsoft has been releasing openly since 2023, starting with Phi-1 and progressing through Phi-4 by late 2025 — was an early output of this research track. The Phi series demonstrated that Microsoft could train competitive models at smaller parameter scales and provided the team with the infrastructure, data sourcing practices, and evaluation frameworks that transferred directly to the larger MAI family.

The strategic logic of maintaining Solara was articulated by Microsoft CTO Kevin Scott at Build: a cloud platform that depends entirely on a single foundation model supplier has no pricing leverage, limited customization ability, and a single point of failure in its AI architecture. Model diversity is not a competitive feature — it is a resilience requirement.

This framing mirrors the argument Microsoft made to enterprise customers when it built Azure's multi-cloud compatibility — that lock-in to a single provider is a structural business risk, and that optionality is worth paying for. Applied to AI models, the same argument creates a compelling pitch for Azure as the platform where multi-model optionality lives.

The OpenAI Partnership: What Changes and What Does Not

Microsoft was careful at Build to frame MAI as model diversity rather than a departure from the OpenAI partnership. Satya Nadella described the relationship as "the most important partnership in AI" and confirmed that OpenAI's latest models remain available through Azure OpenAI Service.

What this framing obscures is the material change in negotiating dynamics. Microsoft now enters every contract renewal and partnership renegotiation with OpenAI holding a credible in-house alternative. The specific performance characteristics of MAI-Code-1-Flash — competitive benchmarks, lower inference cost, production deployment already live in GitHub Copilot — mean that Microsoft's leverage in pricing negotiations with OpenAI has increased materially.

Microsoft's Agent 365 launch earlier this spring established the product architecture that makes this leverage operational: Agent 365 routes enterprise AI tasks to the most appropriate model based on cost, capability, and compliance requirements. MAI models are now a first-class routing option in Agent 365 alongside OpenAI, Anthropic, Meta, and Mistral models. Enterprise customers who use Agent 365 can shift workload between model providers without application-layer changes.

As AI infrastructure continues to commoditize, the platform that wins enterprise distribution may not be the one with the best individual model — it may be the one that makes multi-model orchestration easiest to operate. Microsoft's architecture positions Azure as that neutral platform, and SAP's Anthropic MCP integration earlier this spring suggests that third-party enterprise platforms are moving in the same direction: abstracting the model layer to give buyers routing optionality.

MAI-Thinking-1 vs. Claude and GPT-4.5: The Benchmark Picture

The model Microsoft faces the most scrutiny on is MAI-Thinking-1, its extended reasoning model competing directly with Anthropic's Claude 3.7 Sonnet and OpenAI's GPT-4.5 on enterprise analysis tasks.

Microsoft's published benchmark comparisons show MAI-Thinking-1 performing strongly on mathematical and scientific reasoning while trailing on software engineering:

Benchmark	MAI-Thinking-1	Claude 3.7 Sonnet	GPT-4.5
MATH (competition)	82.4%	81.2%	80.8%
GPQA Diamond	68.1%	71.0%	67.3%
SWE-bench Verified	49.3%	70.3%	55.1%
MMLU Pro	74.2%	75.1%	73.8%

MAI-Code-1 — the larger code-specific model — performs at 61.2% on SWE-bench, substantially better than MAI-Thinking-1 but still 9 percentage points below Claude 3.7 Sonnet with extended thinking. For software engineering tasks, Anthropic retains a meaningful capability advantage that the MAI launch does not close.

On reasoning benchmarks outside software engineering, MAI-Thinking-1 is genuinely competitive at the frontier. This pattern suggests Microsoft's training data and optimization focused on scientific and mathematical reasoning, while Anthropic's code-grounded training methodology continues to produce superior software engineering performance.

MAI-Thinking-1's pricing of $6/M input tokens and $18/M output tokens on Azure positions it as a premium reasoning model rather than a commodity alternative. This is not a price-based competitive strategy; it is a capability-based positioning argument for enterprise customers who want an alternative to Anthropic or OpenAI reasoning models with better data provenance documentation.

The Enterprise Strategy Playbook

For enterprise AI teams evaluating the MAI announcement, the strategic questions are more important than any single benchmark comparison.

1. Map your current model exposure. Identify which Microsoft products in your environment run on OpenAI models today — Azure OpenAI Service, GitHub Copilot, Microsoft 365 Copilot. This is your potential MAI migration surface area. The Agent 365 control plane makes workload migration possible without application-layer changes for many use cases.

2. Assess your data provenance requirements. If your compliance or legal team has raised training data documentation as a procurement criterion, request Microsoft's technical data sheets for MAI models under NDA. This documentation exists as of Build 2026 and Microsoft's enterprise sales teams are briefed to provide it.

3. Deploy MAI-Code-1-Flash in GitHub Copilot and measure developer acceptance. The inline completion model is GA and included in existing Copilot seats at no additional cost. Measure developer acceptance rates — what percentage of MAI suggestions are accepted versus rejected compared to previous OpenAI-based completions. This is the fastest, lowest-risk evaluation available for the MAI family.

4. Price the optionality value explicitly. Multi-model routing — the ability to shift workloads between OpenAI, MAI, Anthropic, and open-source models — has real option value even if you never exercise it. An AI infrastructure architecture that can reroute away from a supply disruption, a price increase, or a capability gap is worth more than equivalent single-vendor architecture. Build this into your AI vendor evaluation scoring.

The Competitive Stakes for Anthropic

Anthropic's reported $965 billion IPO valuation is partly a bet on the durability of Claude's enterprise positioning and code reasoning advantages. The MAI family launch, and specifically the narrowing of the benchmark gap on reasoning tasks, suggests that this advantage will face more structural competition in the next 12 months.

The SWE-bench gap — 70.3% for Claude 3.7 Sonnet versus 61.2% for MAI-Code-1 — is real and meaningful for software engineering use cases. Anthropic's code reasoning advantage has been one of the clearest sources of enterprise differentiation, and the GitHub Copilot token billing analysis shows that Claude Code users pay three to ten times the per-seat cost of GitHub Copilot precisely because of this advantage.

Microsoft's direct entry into the code generation model market with a GA product deployed in GitHub Copilot means that the benchmark gap Anthropic currently holds needs to widen — not hold steady — for the enterprise pricing premium to be sustainable. The model-level competition and the distribution-level competition will run on separate tracks: Claude's Anthropic-native enterprise distribution is not threatened by Microsoft's in-house capability, but the pricing premium on Azure deployments is under direct pressure.

For 2026, the question the MAI launch raises is simple: if Microsoft can close the SWE-bench gap by 12 to 15 percentage points in the next model generation — a realistic target given where MAI-Code-1 already stands — does Claude's enterprise code reasoning premium survive? The answer will define both companies' revenue trajectories for the next three years.

Takeaway: Microsoft's MAI family is the most consequential AI announcement at Build 2026 not because any individual model definitively outperforms OpenAI or Anthropic — the benchmarks are mixed, and the SWE-bench gap with Claude 3.7 Sonnet is real — but because it permanently changes the structure of enterprise AI procurement. Microsoft now has pricing leverage with OpenAI, a training data provenance story that satisfies EU AI Act documentation requirements, and a live production deployment in GitHub Copilot that proves the capability is ready. For enterprise AI teams, the question is no longer whether to evaluate MAI models — the Copilot integration means most enterprises already are. The question is how to use MAI's existence as negotiating leverage with every AI vendor, including Microsoft itself.

Frequently Asked Questions

What models did Microsoft announce at Build 2026?

Microsoft announced seven MAI (Microsoft AI) models at Build 2026: MAI-Thinking-1 for extended reasoning, MAI-Code-1 for full-precision code generation, MAI-Code-1-Flash for low-latency inline completions, MAI-DS-R1 for data science and structured analytics, MAI-Vision-1 for multimodal document understanding, MAI-Mini as a sub-3B parameter edge model for on-device deployment, and MAI-Embed-1 for embeddings and semantic retrieval. MAI-Code-1-Flash launched as the default completion model in GitHub Copilot at Build, replacing the previous OpenAI Codex and GPT-4o completions. MAI-Code-1, MAI-Mini, and MAI-Embed-1 are generally available on Azure; MAI-Thinking-1, MAI-DS-R1, and MAI-Vision-1 are in preview.

How does MAI-Thinking-1 compare to Claude 3.7 Sonnet and GPT-4.5?

On mathematical and scientific reasoning benchmarks, MAI-Thinking-1 is competitive at the frontier: 82.4% on MATH competition problems versus Claude 3.7 Sonnet's 81.2% and GPT-4.5's 80.8%, and 68.1% on GPQA Diamond expert reasoning versus Claude's 71.0% and GPT-4.5's 67.3%. The gap is most pronounced on software engineering: MAI-Code-1 scores 61.2% on SWE-bench Verified while Claude 3.7 Sonnet with extended thinking scores 70.3%. For enterprise teams using AI in code review, refactoring, and software development workflows, Anthropic retains a meaningful capability advantage. For scientific analysis, financial modeling, and research summarization, MAI-Thinking-1 benchmarks within margin of error of the frontier models at a pricing tier comparable to Claude 3.5 Sonnet on Azure.

What does 'no OpenAI data' mean for enterprise AI compliance?

Microsoft's MAI models were trained without OpenAI-licensed datasets, synthetic data derived from ChatGPT outputs, or content from OpenAI training data pools. This has practical implications for enterprises operating under EU AI Act requirements, which mandate training data provenance documentation for General-Purpose AI models. Regulated industries — financial services, healthcare, and legal services — increasingly include AI training data provenance in vendor qualification requirements. Microsoft's clean provenance architecture means enterprise legal and compliance teams can obtain documentation showing exactly what licensed data trained the MAI models, without exposure to third-party intellectual property licensing entanglements associated with models that used GPT-4 outputs during training. This is a meaningful differentiator for procurement in regulated verticals.

Does the MAI launch change Microsoft's OpenAI partnership?

Microsoft has framed the MAI launch as model diversity rather than a departure from OpenAI. OpenAI models remain available through Azure OpenAI Service, and Microsoft describes the partnership as intact. What changes materially is Microsoft's negotiating leverage: it now enters partnership renegotiations with a credible in-house alternative that has demonstrated production readiness in GitHub Copilot. The financial impact on OpenAI — which distributes a substantial share of API revenue through Azure — depends on how much workload Microsoft shifts to MAI models over the next 12 months. Enterprise customers evaluating the MAI announcement should also assess how multi-model routing through Agent 365 changes their own negotiating leverage with both Microsoft and OpenAI on upcoming contract renewals.

What is Project Solara and when did Microsoft start it?

Project Solara is Microsoft's internal AI model research program, reportedly operational since approximately 2022. It employs more than 400 researchers and engineers in a track separate from the Azure OpenAI Service team, with dedicated compute clusters, training infrastructure, and data sourcing agreements independent of the commercial OpenAI partnership. The program's early outputs include Microsoft's Phi series of small language models, released openly starting in 2023. Solara's operational independence from the OpenAI integration team was intentional: it ensures Microsoft's in-house AI roadmap can proceed regardless of partnership terms. The MAI family announced at Build 2026 is the program's first major generally available product output, with MAI-Code-1-Flash already deployed to GitHub Copilot's global subscriber base.