{"id":312,"date":"2025-04-29T08:30:38","date_gmt":"2025-04-29T15:30:38","guid":{"rendered":"https:\/\/devblogs.microsoft.com\/foundry\/?p=312"},"modified":"2025-04-29T13:53:48","modified_gmt":"2025-04-29T20:53:48","slug":"whats-new-in-azure-ai-foundry-april-2025","status":"publish","type":"post","link":"https:\/\/devblogs.microsoft.com\/foundry\/whats-new-in-azure-ai-foundry-april-2025\/","title":{"rendered":"What&#8217;s new in Azure AI Foundry | April 2025"},"content":{"rendered":"<h2>TL;DR<\/h2>\n<p>Long-context GPT-4.1, GPT-image-1, new o-series reasoning and GPT-4o audio models headline this month\u2019s releases. On the agent side we get cross-cloud A2A, BYO thread storage, an MCP server starter, and a turnkey AI Red Team. Developers also gain a VS Code extension, richer evaluation metrics, persistent memory via Mem0, a full RAG demo suite, and new Content Understanding &amp; Document Intelligence endpoints\u2014everything you need to build, test, and ship safer GenAI apps on a single platform.<\/p>\n<h2>Join the new Azure AI Foundry Developer Forum on GitHub<\/h2>\n<p>We launched the new GitHub Discussions Developer Forum last week and we&#8217;re inviting you to connect with engineers and peers to ask questions, showcase your projects, vote in polls, and shape the roadmap\u2014all in one place. Bring your ideas, code, and curiosity!<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-primary mb-24\" href=\"https:\/\/aka.ms\/azureaifoundry\/forum\" target=\"_blank\">Open Discussions<\/a><\/div><\/p>\n<h2>Models<\/h2>\n<h3>GPT-4.1 One-Million-Token Context<\/h3>\n<p>GPT-4.1 (and its nano\/mini variants) lifts Azure\u2019s context ceiling to <strong>1 million tokens<\/strong>, letting you pass entire codebases or multi-gigabyte corpora in one shot, while retaining GPT-4-class reasoning and function calling. That means fewer chunk-and-stitch hacks, simpler prompts, and major latency savings for large-document RAG or full-repo code reviews. Learn how to call it with the Responses API:<\/p>\n<div><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/ai-services\/openai\/how-to\/responses\" target=\"_blank\">Learn more<\/a><\/div><\/div>\n<h3>GPT-image-1 Text-&amp;-Image Generation<\/h3>\n<p>GPT-image-1 arrives in limited preview with sharper fidelity, reliable text rendering, editing \/ in-painting, and image-as-input support\u2014so you can build marketing creatives, design mocks, and visual KB answers directly in Foundry using the same REST patterns as DALLE 3.<\/p>\n<p><figure id=\"attachment_315\" aria-labelledby=\"figcaption_attachment_315\" class=\"wp-caption aligncenter\" ><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp.png\"><img decoding=\"async\" class=\"size-full wp-image-315\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp.png\" alt=\"foundry happy building wp image\" width=\"400\" height=\"400\" srcset=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp.png 400w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp-300x300.png 300w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp-150x150.png 150w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp-24x24.png 24w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp-48x48.png 48w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/foundry_happy_building_wp-96x96.png 96w\" sizes=\"(max-width: 400px) 100vw, 400px\" \/><\/a><figcaption id=\"figcaption_attachment_315\" class=\"wp-caption-text\">Prompt: A bustling metallurgical foundry is portrayed in an isometric illustration. The scene features developers operating computers, typing into keyboards, speaking into microphones and overseeing an &#8220;ai agent factory&#8221; creation process. The environment is filled with datacenter elements such as long rows of server racks, neatly organized networking cables, and large cooling apparatuses.\u200b<br \/>Above the main workspace, a prominent banner stretches across the foundry floor, displaying the words &#8220;Happy Building&#8221; in bold, industrial-style lettering. The sign adds a touch of positivity and motivation amidst the intense industrial setting.<\/figcaption><\/figure><\/p>\n<p>Happy Building!<\/p>\n<div class=\"d-flex\"><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/learn.microsoft.com\/azure\/ai-services\/openai\/how-to\/dall-e\" target=\"_blank\">Generate images<\/a><\/div><\/div>\n<h3>o4-mini &amp; o3 Reasoning Models<\/h3>\n<p>Need faster, cheaper reasoning? The new o-series pairs GPT-4-level logical depth with lower latency and aggressive pricing, making them ideal for agent planning, re-ranking, or embedded analytics where every millisecond (and penny) counts.<\/p>\n<p><figure id=\"attachment_316\" aria-labelledby=\"figcaption_attachment_316\" class=\"wp-caption aligncenter\" ><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog.png\"><img decoding=\"async\" class=\"size-large wp-image-316\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog-1024x562.png\" alt=\"A graph showing improvements reasoning models demonstrate across challenging academic benchmarks such as GPQA Diamond, Codeforces, and AIME 2024.\" width=\"1024\" height=\"562\" srcset=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog-1024x562.png 1024w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog-300x165.png 300w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog-768x421.png 768w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog-1536x843.png 1536w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/reasoning_devblog-2048x1123.png 2048w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/a><figcaption id=\"figcaption_attachment_316\" class=\"wp-caption-text\">Source: https:\/\/openai.com\/index\/introducing-o3-and-o4-mini\/<\/figcaption><\/figure><\/p>\n<div><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/techcommunity.microsoft.com\/blog\/azure-ai-services-blog\/everything-you-need-to-know-about-reasoning-models-o1-o3-o4-mini-and-beyond\/4406846\" target=\"_blank\">Learn more<\/a><\/div><\/div>\n<h3>GPT-4o Audio (Transcribe &amp; TTS)<\/h3>\n<p><code>gpt-4o-transcribe<\/code>, <code>gpt-4o-mini-transcribe<\/code>, and <code>gpt-4o-mini-tts<\/code> bring high-quality speech-to-text and controllable text-to-speech to Azure. Stream captions, build multilingual voice bots, or generate audio replies\u2014all via familiar <code>\/audio<\/code> and <code>\/realtime<\/code> endpoints.<\/p>\n<p><figure id=\"attachment_318\" aria-labelledby=\"figcaption_attachment_318\" class=\"wp-caption aligncenter\" ><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/aoai-tts-soundboard.gif\"><img decoding=\"async\" class=\"size-full wp-image-318\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/aoai-tts-soundboard.gif\" alt=\"Demonstration of the Azure OpenAI TTS Soundboard\" width=\"1280\" height=\"720\" \/><\/a><figcaption id=\"figcaption_attachment_318\" class=\"wp-caption-text\">Demonstration of the Azure OpenAI TTS Soundboard<\/figcaption><\/figure><\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/get-started-azure-openai-advanced-audio-models\/\" target=\"_blank\">Get started<\/a><\/div><\/p>\n<hr \/>\n<h2>Agents<\/h2>\n<h3>AI Red Teaming Agent (Preview)<\/h3>\n<p>Built atop Microsoft\u2019s PyRIT toolkit, this agent fires automated jailbreak and prompt-injection probes at your models, scores Attack Success Rate, and logs findings into Foundry dashboards\u2014making <em>shift-left safety<\/em> a one-command reality.<\/p>\n<p><div style=\"width: 1620px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-312-1\" width=\"1620\" height=\"1080\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/AI-red-teaming-agent_final_blog_asset-1.mp4?_=1\" \/><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/AI-red-teaming-agent_final_blog_asset-1.mp4\">https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/AI-red-teaming-agent_final_blog_asset-1.mp4<\/a><\/video><\/div><\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/ai-red-teaming-agent-preview\/\" target=\"_blank\">Learn more<\/a><\/div><\/p>\n<h3>Semantic Kernel + A2A Interop<\/h3>\n<p>A new plug-in teaches Semantic Kernel to speak Google\u2019s Agent-to-Agent JSON-RPC protocol, enabling secure cross-cloud agent collaboration\u2014exchange context, not credentials, and orchestrate multi-modal workflows spanning Azure, GCP, and OSS runtimes.<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/semantic-kernel-a2a-integration\/\" target=\"_blank\">Learn more<\/a><\/div><\/p>\n<h3>MCP Server Starter (Typescript)<\/h3>\n<p>Spin up an <strong>MCP-compliant<\/strong> server in minutes; the template wires Azure AI Agents to Claude Desktop (or any MCP client) via standard JSON messages\u2014no bespoke glue code required.<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/integrating-azure-ai-agents-mcp-typescript\/\" target=\"_blank\">Get started<\/a><\/div><\/p>\n<h3>BYO Thread Storage + Monitor<\/h3>\n<p>Agent Service now lets you store conversation threads in <strong>your own Cosmos DB<\/strong> and surfaces run metrics in Azure Monitor\u2014boosting data residency compliance and giving SREs first-class observability out of the box.<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/ai-services\/agents\/quickstart?pivots=programming-language-python-azure\" target=\"_blank\">Quickstart<\/a><\/div><\/p>\n<hr \/>\n<h2>Tools<\/h2>\n<h3>VS Code Foundry Extension<\/h3>\n<p>Test models, deploy agents, and copy sample code without leaving VS Code\u2014goodbye portal context-switching, hello faster inner loops.<\/p>\n<p><div style=\"width: 1920px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-312-2\" width=\"1920\" height=\"1080\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/25_66_Agent_v2.mp4?_=2\" \/><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/25_66_Agent_v2.mp4\">https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/25_66_Agent_v2.mp4<\/a><\/video><\/div><\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/azure-ai-foundry-vscode-extension-preview\/\" target=\"_blank\">Learn more<\/a><\/div><\/p>\n<h3>Quality &amp; Safety Evaluators<\/h3>\n<p>Four new quality metrics (intent-resolution, tool-call accuracy, task adherence, completeness) plus code-vulnerability and ungrounded-attribute safety checks plug straight into CI\/CD so every build ships with score gates.<\/p>\n<p><div style=\"width: 2160px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-312-3\" width=\"2160\" height=\"1440\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2026\/04\/eval-metrics-for-agents-2.mp4?_=3\" \/><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2026\/04\/eval-metrics-for-agents-2.mp4\">https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2026\/04\/eval-metrics-for-agents-2.mp4<\/a><\/video><\/div><\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/evaluation-metrics-azure-ai-foundry\/?utm_source=chatgpt.com\" target=\"_blank\">Learn more<\/a><\/div><\/p>\n<h3>Fine-tuning support for GPT-4.1, GPT-4.1-mini, Phi-4 and Mistral models and more<\/h3>\n<p>The new\u00a0<strong>GPT-4.1 and GPT-4.1-mini<\/strong> models now support fine-tuning, offering enhanced reasoning and instruction-following capabilities, making them ideal for complex enterprise applications. Additionally, expanded serverless fine-tuning support for models like\u00a0<strong>Mistral<\/strong>,\u00a0<strong>Phi<\/strong>, and\u00a0<strong>NTT<\/strong> across all U.S. regions where base model inferencing is available,\u00a0improving latency and compliance with data residency requirements. The\u00a0<strong>Evaluation API<\/strong>\u00a0now supports code-first grading, allowing developers to score model outputs using built-in or custom graders, which simplifies A\/B testing, regression validation, and iterative model refinement.<\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/gpt-4.1-ft.gif\"><img decoding=\"async\" class=\"aligncenter size-full wp-image-329\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/04\/gpt-4.1-ft.gif\" alt=\"gpt 4 1 ft image\" width=\"3828\" height=\"1971\" \/><\/a><\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/techcommunity.microsoft.com\/blog\/azure-ai-services-blog\/advancing-fine-tuning-in-azure-ai-foundry-april-2025-updates\/4408745\" target=\"_blank\">Learn more<\/a><\/div><\/p>\n<h3>Mem0 Persistent Memory Layer<\/h3>\n<p>Mem0 + Azure AI Search lets assistants <strong>remember<\/strong> user details across sessions via semantic retrieval\u2014boosting personalization without extra infra.<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/devblogs.microsoft.com\/foundry\/azure-ai-mem0-integration\/?utm_source=chatgpt.com\" target=\"_blank\">Get started<\/a><\/div><\/p>\n<h3>Content Understanding 2024-12-01 Preview<\/h3>\n<p>The new API adds <strong>generative &amp; classification fields<\/strong>, faster video segmentation, and multi-analyzer docs, producing structured JSON ready for LLM ingestion across docs, audio, and video.<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/ai-services\/content-understanding\/quickstart\/use-rest-api\" target=\"_blank\">Read the docs<\/a><\/div><\/p>\n<h3>Document Intelligence v4.0 Container<\/h3>\n<p>Run the Layout model on-prem or at the edge via new v4.0 containers\u2014perfect for air-gapped PDF\/OCR scenarios that need local processing yet Azure-compatible APIs.<\/p>\n<p><div  class=\"d-flex justify-content-left\"><a class=\"cta_button_link btn-secondary mb-24\" href=\"https:\/\/learn.microsoft.com\/en-us\/azure\/ai-services\/document-intelligence\/containers\/install-run?view=doc-intel-4.0.0\" target=\"_blank\">Read the docs<\/a><\/div><\/p>\n<hr \/>\n<p>Happy building\u2014and let us know what you ship with #AzureAIFoundry!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>This post highlights the latest features and updates in Azure AI Foundry for April 2025, focusing on new models, agents, and tools that enhance AI development capabilities.<\/p>\n","protected":false},"author":185793,"featured_media":1563,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[1,27],"tags":[3,29,12,4,2,28],"class_list":["post-312","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-microsoft-foundry","category-whats-new","tag-ai-development","tag-april-2025","tag-azure-openai","tag-generative-ai","tag-microsoft-foundry","tag-whats-new"],"acf":[],"blog_post_summary":"<p>This post highlights the latest features and updates in Azure AI Foundry for April 2025, focusing on new models, agents, and tools that enhance AI development capabilities.<\/p>\n","_links":{"self":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/posts\/312","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/users\/185793"}],"replies":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/comments?post=312"}],"version-history":[{"count":0,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/posts\/312\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/media\/1563"}],"wp:attachment":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/media?parent=312"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/categories?post=312"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/tags?post=312"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}