June 2nd, 2026
celebrate1 reaction

What’s new in Microsoft Foundry | Build Edition

Senior Program Manager, Developer Experience

TL;DR

At Build 2026, Microsoft Foundry added more of the platform pieces developers need for production agents: runtime, tools, memory, grounding, models, observability, and governance. This recap highlights what shipped, what is in preview, and where to start.

  • Agent development got faster: Microsoft Agent Framework adds stable orchestration building blocks, and Foundry Toolkit for VS Code is now generally available.
  • Production agent hosting matured: hosted agents are expected to reach general availability by early July 2026, with sandboxed sessions, state, filesystem access, and framework flexibility.
  • Agents can connect to more tools and channels: Toolboxes in Foundry is in public preview, Voice Live adds real-time voice paths, and Foundry agents can publish to Microsoft Teams and Microsoft 365 Copilot, planned for general availability in June 2026.
  • Agent memory expanded: Memory in Foundry Agent Service, in public preview, now includes procedural, user, and session memory.
  • Foundry IQ became a broader knowledge plane: serverless retrieval, new knowledge sources, knowledge bases, Web IQ, security updates, and agentic retrieval improvements reduce custom retrieval-augmented generation (RAG) plumbing.
  • Model and compute options expanded: new MAI models, Fireworks AI on Foundry, Managed Compute, fine-tuning, and Frontier Tuning give developers more choices for inference, training, and customization.
  • Trust, evaluation, and observability moved closer to the development loop: ASSERT, Agent Control Specification (ACS), Guided Guardrail Setup, Rubric, tracing, Agent Optimizer, and Agent ROI help teams evaluate, govern, and improve agents.

Start here: what to try first

If you only have time to try one thing, start with the row that matches your current build problem:

If you are building… Start with… Why it matters
A production agent runtime Hosted agents in Foundry Agent Service Managed sandboxed sessions, state, filesystem access, and framework flexibility
An agent with many tools Toolboxes in Foundry One governed endpoint for tools, skills, Model Context Protocol (MCP) clients, and enterprise data
An enterprise knowledge experience Foundry IQ knowledge bases SLA-backed retrieval and MCP access without custom indexing glue
A real-time voice agent Voice Live Speech recognition, text-to-speech, turn detection, interruption handling, and avatars in one API
Agent safety and evaluation workflows ASSERT, ACS, and Rubric Policy-driven evaluations, deterministic runtime controls, and generated scoring criteria
Model experimentation or custom inference Fireworks AI on Foundry, MAI models, and Managed Compute More model options through Azure endpoints with enterprise controls

Developer tools and SDKs

The updates in Microsoft Agent Framework give developers more stable building blocks for agent applications:

  • Agent harness with skills, memory, and middleware, now in stable release.
  • Integrations with GitHub Copilot SDK and Claude Agent SDK, now in stable release.
  • Multi-agent orchestration patterns including Magentic-One, now in stable release.
  • File system tools, memory tools, and the deep research agent, now in public preview.

Foundry Toolkit for VS Code is now generally available. Use it to create agents from templates or with GitHub Copilot, debug runs locally with trace visualization, connect to Toolboxes, and deploy to Foundry Agent Service from VS Code.

Animated demo of the Foundry Toolkit for VS Code showing agent development and debugging workflows.

Try this next: Open Foundry Toolkit in VS Code, create an agent from a template, and run it locally before deploying it to Foundry Agent Service.

Watch: Build smarter AI systems in Foundry as models and costs evolve for model selection, benchmarking, and developer workflow guidance in Microsoft Foundry.

Agents

Build 2026 adds more of the pieces developers need to move agents from prototypes to production:

Capability Status Use when…
Hosted agents Expected general availability by early July 2026 You need managed runtime, sandboxing, state, durable filesystem access, and framework flexibility
Routines Public preview You want an agent to run on a timer or schedule, such as overnight issue triage or daily reporting
Toolboxes Public preview You need one managed endpoint for tools, skills, MCP clients, and governance
Voice Live prompt agents Generally available You want the fastest path to a real-time voice experience
Hosted voice agents Public preview You want your own runtime or orchestration framework connected to Voice Live
Memory Public preview You want agents to retain procedural, user, or session context
Teams and Microsoft 365 Copilot publishing Planned for general availability in June 2026 You want users to access your Foundry agent where they already work

Deploy production agents with hosted agents and routines

Hosted agents in Foundry Agent Service, expected to reach general availability by early July 2026, provide a managed runtime for production agents. Every session runs in its own sandbox with dedicated compute, memory, and filesystem access.

The runtime is framework-agnostic, so agents built with Microsoft Agent Framework, GitHub Copilot SDK, LangGraph, or other software development kits (SDKs) can be deployed without rewrites. Two protocols are supported:

  • Responses API for OpenAI-compatible stateful interactions.
  • Invocations protocol for schema-free, pass-through scenarios where you control the request and response format.

Hosted agents now support long-running autonomous agents like OpenClaw and Hermes with durable state and filesystem access, and routines, in public preview, for running any agent on a timer or schedule.

Try this next: Explore the sample code with Microsoft Agent Framework, then deploy an agent to hosted agents.

Watch: From prototype to production: build and run agents at scale for the hosted-agent runtime story.

Toolboxes and skills

Toolboxes in Foundry, in public preview, gives your agent a single managed endpoint for every tool type. Configure tools once, point any MCP client at one URL, and let Foundry handle auth, lifecycle, and governance.

Skills, in preview, are now first-class: versioned in a project-scoped catalog and discoverable as MCP resources by any agent in the project. Tool search, also in preview, helps select the right tools per task instead of surfacing every tool to the model.

Toolbox also connects to Microsoft IQ, including Work IQ in preview, Fabric IQ in preview with Fabric data agent, Ontology, and semantic models, and Foundry IQ, so agents can tap enterprise data without custom plumbing.

Animated demo of the Microsoft Foundry Toolboxes interface showing a catalog of tools, versions, and descriptions.

Try this next: Create a Toolbox, connect it to an MCP-compatible client, and test the Microsoft Agent Framework and Toolbox sample.

Watch: Turn your agents into action: Connect tools, APIs, and documents for tool, API, and Toolbox patterns.

Voice Live

Voice Live unifies speech recognition, text-to-speech, turn detection, interruption handling, avatars, and other real-time conversational features into a single API.

For teams building with prompt agents, Voice Live is now generally available as the fastest path to adding real-time voice experiences. Existing agent capabilities, including tool calling, knowledge, memory, guardrails, and enterprise integrations, can now use low-latency speech interactions.

For teams that need full control over runtime and orchestration, hosted agents with Voice Live are available in public preview. Developers can use Microsoft Agent Framework, LangChain, or a custom stack, host on Foundry Agent Service, and connect directly to Voice Live.

Try this next: Use Voice Live with a prompt agent if you want the shortest path to voice, or pair hosted agents with Voice Live if you need custom orchestration.

Memory

Memory in Foundry Agent Service, in public preview, now includes three types:

  • Procedural memory, new at Build in public preview, helps agents learn how to do the work across runs, not just what was said. Early Tau-bench results show +7–14% absolute success-rate gains at near-baseline cost.
  • User memory remembers preferences and facts across sessions, such as “user is allergic to dairy.”
  • Session memory maintains context within a conversation thread.

Try this next: Enable memory on a test agent and compare task success, tool calls, and token use across repeated runs.

Publish agents where people work

With publishing to Microsoft Teams and Microsoft 365 Copilot, planned for general availability in June 2026, any Foundry agent can be deployed directly into the tools employees already use, with identity, permissions, and policy flowing through automatically.

Animated demo showing a Foundry agent published into a Microsoft business application experience.

Try this next: Publish a low-risk internal agent to Teams, validate permissions, and then expand to Microsoft 365 Copilot.

Knowledge and grounding

Grounding an agent in enterprise knowledge usually means building a retrieval-augmented generation (RAG) pipeline from scratch: chunking, indexing, retrieval, and a different integration for every data source.

Foundry IQ replaces that with a dedicated knowledge layer behind your Foundry agents. It unifies Work IQ, Fabric IQ, Azure SQL, File Search, and MCP sources behind one service-level agreement (SLA)-backed retrieval endpoint. Foundry IQ is available today and is wired into Toolboxes in Foundry, so agents can tap enterprise data without custom plumbing.

Foundry IQ update Status Developer action
Foundry IQ Serverless Public preview Create a Foundry IQ resource or review the SKU documentation
New knowledge sources Public preview Ground agents across Work IQ, Fabric IQ, File Search, Azure SQL, and MCP with a multi-source knowledge base; start with the cookbook
Microsoft Web IQ in Foundry IQ Limited access Use live web, licensed publisher, and marketplace data when your agent needs fresh external context
Foundry IQ knowledge bases Generally available Create an SLA-backed knowledge base and query it with the quickstart
Agentic retrieval quality improvements Available Compare answer quality, latency, and token use with the agentic retrieval quickstart
Security updates Public preview Test encryption, permission sync, and sensitivity-label governance with the security cookbook
Data pipeline updates Public preview Use layout-aware ingestion, image serving, and broader SharePoint indexing when documents include more than raw text

When your agent needs the live web, Web IQ provides sub-200 ms web grounding with zero data retention. It surfaces browse, news, web, video, and image results and is available today for select Azure customers.

Both Foundry IQ and Web IQ roll up into Microsoft IQ, the generally available intelligence layer for connecting work data, business data, and agent knowledge.

Watch: Foundry IQ: Fuel agents with enterprise knowledge and agentic retrieval and Build context-aware agents: From data to decisions for the grounding and IQ deep dive.

Models and compute

New MAI models in Microsoft Foundry

Microsoft’s MAI model family expands in Microsoft Foundry with four first-party models entering public preview at Build:

  • MAI-Thinking-1, a mid-weight large language model for chat and reasoning.
  • MAI-Image-2.5, an updated image generator with image-to-image editing.
  • MAI-Transcribe-2, a speech-to-text model with speaker diarization and content biasing.
  • MAI-Voice-2, a multilingual text-to-speech model with voice cloning.

Together, they cover the four core generative modalities: text, image, transcription, and voice.

Action: Try the new models in the Foundry catalog.

Fireworks AI on Foundry

Fireworks AI on Microsoft Foundry is now generally available, bringing open-model inference through a single Azure endpoint with enterprise SLAs, zero-setup onboarding, no separate infrastructure, and no separate contracts.

Fireworks AI on Foundry provides day-zero access, low latency, high throughput, and support for custom-weight models. It is validated for enterprise use with provisioned throughput unit (PTU) Data Zone support and SOC 2 readiness, governed by the same access controls and audit logging as the rest of Foundry.

Action: Spin up a Fireworks endpoint from the Foundry catalog and benchmark latency, quality, and throughput against your current setup.

Managed Compute in Foundry Models

Use Managed Compute in Foundry Models when regional GPU capacity is the bottleneck and you want Foundry to route workloads globally without managing infrastructure.

  • Capacity: route around regional GPU constraints without managing infrastructure.
  • Fine-tuning: spin up flexible compute for bespoke model training.
  • Operations: reduce infrastructure management for model workloads.

Action: Try Managed Compute for a workload that currently depends on capacity in one region.

Fine-tuning and Frontier Tuning

Fine-tuning in Foundry delivers higher-quality results than prompt engineering alone, broader example coverage, token savings, and lower-latency requests on smaller models. Frontier Tuning is more than 10x more cost-efficient than GPT-5.5 on tasks like producing technical Microsoft documentation.

Action: Try the new developer tier for fine-tuning: no hosting fees, just experiment.

Watch: Post-Training and Deploying Open Source Reasoning Models in Foundry, Turn foundation models into production AI on Microsoft Foundry, and Orchestrate special agents with NVIDIA Nemotron models on Foundry for model customization and open-model deployment patterns.

Trust, observability, and security

Build also introduced new ways to evaluate, control, and improve agents across the development lifecycle:

Capability Status Use when…
ASSERT Open source You want to turn written policies into executable agent evaluations
Agent Control Specification Open source You need deterministic controls at input, model, state, tool, and output checkpoints
Guided Guardrail Setup Public preview You want recommended guardrails based on your agent’s audience, data, and use case
Rubric evaluator Public preview You want generated, weighted quality criteria for an agent-specific scorecard
Tracing and evaluations for any framework Public preview You need observability for agents built on LangChain, Semantic Kernel, or custom frameworks
Agent Optimizer Coming soon in public preview You want production traces to feed ranked, reviewable improvement suggestions
Agent ROI Private preview You need to connect agent performance to business impact

Watch: Observe and control agents across any framework with open source tools, From observability to ROI for AI agents on any framework, and Build secure and enterprise-ready agents with Agent 365 for the trust, observability, and governance track.

ASSERT: open-source policy-driven agent evaluation

ASSERT is Microsoft’s new framework for policy-driven agent evaluation, built on a proven Microsoft Research approach. ASSERT converts your policies into concrete, measurable evaluations, systematically generates targeted evaluation scenarios, and surfaces safety and quality defects before they reach production.

ASSERT is open source and works across LangChain, CrewAI, LightLLM, OpenAI, and more.

Resources:

Action: Clone the ASSERT GitHub repository and run it against your agent’s policies.

Agent Control Specification: an open standard

Agent Control Specification (ACS) is an open industry specification for placing deterministic safety and security controls at five checkpoints in an agent’s lifecycle: input, large language model (LLM), state, tool execution, and output.

ACS is expressed as a portable YAML contract: versionable, auditable, and framework-agnostic. Reference implementations are available for major platforms, and the partner ecosystem at launch includes Infosys, KPMG, IBM, Aviatrix, BigSpin, and CrewAI.

Resources:

Action: Review the Agent Control Specification, then use the Agent Governance Toolkit to apply controls at checkpoints.

Guided Guardrail Setup in Foundry Agent Builder

Guided Guardrail Setup in Foundry Agent Builder is in public preview. A short questionnaire about your agent’s audience, data access, and use case surfaces the risks relevant to your scenario and recommends the right controls, such as personally identifiable information (PII) filters, jailbreak protection, and task adherence, with no security expertise required.

Action: Run the Guided Guardrail Setup wizard on your next agent.

Rubric evaluator

Rubric is a new evaluator type in Microsoft Foundry, in public preview, that auto-generates evaluation criteria based on your agent’s specific context. It creates custom quality criteria from your agent definition and use case, supports weighted dimensions for aggregate scoring, and runs alongside existing safety and quality evaluators for a unified scorecard. It also feeds directly into Agent Optimizer.

Action: Generate a Rubric for your agent and replace your static benchmarks.

Tracing, evaluations, optimization, and ROI

Tracing and evaluations for any agent framework, in public preview, brings Foundry’s production-grade tracing and evaluations to agents built on LangChain, Semantic Kernel, or any custom framework, so no team has to choose between its stack and observability.

Agent Optimizer in Foundry Agent Service, coming soon in public preview, runs Foundry’s full evaluation suite directly within Foundry AI Operations Service and feeds results into Foundry Optimizer, closing the loop from production signal to ranked, reviewable improvements.

ROI for agents in Foundry, in private preview, measures the real business impact of your agents, including task completion rates, time saved, and cost efficiency, giving stakeholders the data they need to justify investment and prioritize what to improve.

Resources and community

Author

Nick Brady
Senior Program Manager, Developer Experience

Since 2018, Nick has been the unofficial spokesperson for Microsoft's AI Platform having delivered hundreds of engagements to customers and partners in his time at Microsoft. Nick leads developer experience for Microsoft Foundry, focusing on content, code, and community to empower every developer to build, customize, and scale AI applications.

0 comments