{"id":1482,"date":"2025-11-20T08:30:12","date_gmt":"2025-11-20T16:30:12","guid":{"rendered":"https:\/\/devblogs.microsoft.com\/foundry\/?p=1482"},"modified":"2026-01-21T10:53:46","modified_gmt":"2026-01-21T18:53:46","slug":"foundry-local-comes-to-android","status":"publish","type":"post","link":"https:\/\/devblogs.microsoft.com\/foundry\/foundry-local-comes-to-android\/","title":{"rendered":"Foundry Local comes to Android\u2014plus on-device speech, on-prem support, and a simpler SDK"},"content":{"rendered":"<p style=\"font-weight: 400;\">Today, we\u2019re introducing a powerful new way to bring AI directly to your mobile apps: Foundry Local on Android. Starting now, you can build Android applications that run AI on-device\u2014no cloud round trips required. Sign up for the gated preview here: <a href=\"https:\/\/aka.ms\/foundrylocal-androidprp\">https:\/\/aka.ms\/foundrylocal-androidprp<\/a>\u00a0<\/p>\n<p style=\"font-weight: 400;\">Most of us now carry a computer in our pocket. These devices are now powerful enough to run AI models locally, unlocking speed, privacy, and flexibility. With Foundry Local, you can easily deploy optimized open-source models from Microsoft Foundry directly on mobile devices.<\/p>\n<h2 style=\"font-weight: 400;\"><strong>Why on-device AI matters:<\/strong><\/h2>\n<ul>\n<li>Adds an extra layer of\u00a0privacy\u00a0for sensitive scenarios like healthcare or financial data<\/li>\n<li>Minimizes costs\u00a0by reducing cloud calls<\/li>\n<li>Enables\u00a0offline or low connectivity operations<\/li>\n<li>Avoids unnecessary round trips over the internet<\/li>\n<\/ul>\n<p style=\"font-weight: 400;\">Foundry Local on Android opens up massive reach for developers, making AI even more accessible to everyone. We\u2019ve been working closely with select customers in preview, including PhonePe, who have integrated Foundry Local into their mobile app\u2014serving more than 618 million users\u2014to power an upcoming AI-driven experience inside their digital payments platform.<\/p>\n<div style=\"max-width: 360px; margin: 0 auto;\">\n  <video controls style=\"width: 100%; height: auto; border-radius: 8px;\"><source src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/FoundryLocalAndroid.mp4\" type=\"video\/mp4\">Your browser does not support the video tag.<\/video>\n<\/div>\n<blockquote>\n<p><span data-teams=\"true\">&#8220;The [Foundry Local] SDK allows us to leverage models on device without having to go to the server or the internet&#8221; &#8212; Gautham Krishnamurthy, Head of Product, Consumer Platform, PhonePe<\/span><\/p>\n<\/blockquote>\n<h2>On-Device Speech-to-Text (Powered by Whisper)<\/h2>\n<p style=\"font-weight: 400;\">We\u2019re announcing a new Speech API in Foundry Local. Apps can transcribe and understand speech with low latency and no audio leaves the device by default.<\/p>\n<p style=\"font-weight: 400;\">We\u2019re excited about this powerful new capability because it unlocks so many customer scenarios\u2014where voice prompting in offline environments is critical for a seamless, intuitive UX. Imagine filling out forms or capturing notes \u2013 instantly and assisted by AI &#8211; with voice, even when connectivity is poor or intermittent.<\/p>\n<p style=\"font-weight: 400;\">Here\u2019s a quick snippet showing how to acquire a Whisper model from the Foundry Catalog, load it into memory, and transcribe audio:<\/p>\n<pre class=\"prettyprint language-cs language-csharp\"><code class=\"language-cs language-csharp\">var model = await catalog.GetModelAsync(\"whisper-tiny\")\r\n\r\n\/\/ Download the model (the method skips download if already cached)\r\nawait model.DownloadAsync(progress =&gt;\r\n{\r\n    Console.Write($\"\\rDownloading model: {progress:F2}%\");\r\n    if (progress &gt;= 100f)\r\n    {\r\n        Console.WriteLine();\r\n    }\r\n});\r\n\r\n\/\/ Load the model\r\nawait model.LoadAsync();\r\n\r\n\/\/ Get an audio client\r\nvar audioClient = await model.GetAudioClientAsync();\r\n\r\n\/\/ Get a transcription with streaming outputs\r\nvar response = audioClient.TranscribeAudioStreamingAsync(\"Recording.mp3\", ct);\r\nawait foreach (var chunk in response)\r\n{\r\n    Console.Write(chunk.Text);\r\n    Console.Out.Flush();\r\n}\r\nConsole.WriteLine();\r\n<\/code><\/pre>\n<p style=\"font-weight: 400;\"><strong>Try speech-to-text on Foundry Local by following our documentation: <\/strong><a href=\"https:\/\/aka.ms\/foundrylocal-audiodocs\">https:\/\/aka.ms\/foundrylocal-audiodocs<\/a><\/p>\n<h2><strong>The new Foundry Local SDK<\/strong><\/h2>\n<p style=\"font-weight: 400;\">The new Foundry SDK delivers:<\/p>\n<ul>\n<li><strong>Self-contained packaging mechanism \u2013 no need to install separate executables for model serving.<\/strong><\/li>\n<li><strong>Smaller package and runtime footprint.<\/strong><\/li>\n<li><strong>Simple API \u2013 just a few lines to download and load models.<\/strong><\/li>\n<li><strong>APIs<\/strong> for chat completions and audio transcription that follow OpenAI request\/response<\/li>\n<li>Optional OpenAI compliant web server to integrate other tools and frameworks (for example, LangChain, OpenAI SDK, Web UI).<\/li>\n<\/ul>\n<ul style=\"font-weight: 400;\">\n<li><strong>Integration with <\/strong><a href=\"https:\/\/learn.microsoft.com\/en-us\/windows\/ai\/new-windows-ml\/overview\">Windows ML<\/a><strong> for smart device detection. <\/strong><\/li>\n<\/ul>\n<p style=\"font-weight: 400;\">These are exactly what our preview customers asked for and more.<\/p>\n<p style=\"font-weight: 400;\"><strong>Getting started is easy \u2013 see the sample snippet of code below that demonstrates how to acquire a Qwen model from the Foundry Catalog, load the model into memory and do chat completions. The Foundry Local SDK will automatically select the most performant model for the device hardware <\/strong><strong>and<\/strong><strong> ensure that all the hardware runtime and drivers are installed on the device. See Foundry Local in action in this episode of Microsoft Mechanics: <\/strong><a href=\"https:\/\/aka.ms\/FL_IGNITE_MSMechanics\">https:\/\/aka.ms\/FL_IGNITE_MSMechanics<\/a><\/p>\n<p>&nbsp;<\/p>\n<pre class=\"prettyprint language-cs language-csharp\"><code class=\"language-cs language-csharp\">\/\/ Get a model using an alias\r\nvar model = await catalog.GetModelAsync(\"qwen2.5-0.5b\");\r\n\r\n\/\/ Download the model (the method skips download if already cached)\r\nawait model.DownloadAsync(progress =&gt;\r\n{\r\n    Console.Write($\"\\rDownloading model: {progress:F2}%\");\r\n    if (progress &gt;= 100f)\r\n    {\r\n        Console.WriteLine();\r\n    }\r\n});\r\n\r\n\/\/ Load the model\r\nawait model.LoadAsync();\r\n\r\n\/\/ Get a chat client\r\nvar chatClient = await model.GetChatClientAsync();\r\n\r\n\/\/ Create a chat message\r\nList&lt;ChatMessage&gt; messages = new()\r\n{\r\n    new ChatMessage { Role = \"user\", Content = \"Why is the sky blue?\" }\r\n};\r\n\r\nvar streamingResponse = chatClient.CompleteChatStreamingAsync(messages, ct);\r\nawait foreach (var chunk in streamingResponse)\r\n{\r\n    Console.Write(chunk.Choices[0].Message.Content);\r\n    Console.Out.Flush();\r\n}\r\nConsole.WriteLine();\r\n\r\n\/\/ Tidy up - unload the model\r\nawait model.UnloadAsync();\r\n<\/code><\/pre>\n<p style=\"font-weight: 400;\"><strong>Get started here: <\/strong><a href=\"https:\/\/aka.ms\/foundrylocalSDK\">https:\/\/aka.ms\/foundrylocalSDK<\/a><\/p>\n<h2><strong>Foundry Local <\/strong><strong>with Arc<\/strong><strong>-enabled Kubernetes<\/strong><\/h2>\n<p style=\"font-weight: 400;\">On-device AI isn\u2019t limited to personal computing devices. Many of our customers need to run AI in containers on-premises, often in environments with limited connectivity or specialized hardware.<\/p>\n<p>Today, we\u2019re excited to share a sneak peek of Foundry Local\u00a0for edge, hybrid, sovereign, and disconnected scenarios, powered by Azure Arc.<\/p>\n<p>This capability extends the Foundry Local experience beyond dev\/test machines, enabling customers to seamlessly deploy what they\u2019ve validated to edge environments, such as manufacturing rigs or industrial machinery operating with intermittent connectivity. It is delivered by making Foundry Local available in a container orchestrated by Arc-enabled Kubernetes and running on Azure Local, providing a fully managed Microsoft stack for edge, hybrid, sovereign, and disconnected environments.<\/p>\n<p style=\"font-weight: 400;\">Ready to try it out? Join our gated preview list to get notified when it\u2019s available: <a href=\"https:\/\/aka.ms\/FL-K8s-Preview-Signup\">https:\/\/aka.ms\/FL-K8s-Preview-Signup<\/a><\/p>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/ArcWithFoundry.webp\"><img decoding=\"async\" class=\"aligncenter  wp-image-1493\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/ArcWithFoundry-300x167.webp\" alt=\"ArcWithFoundry image\" width=\"478\" height=\"266\" srcset=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/ArcWithFoundry-300x167.webp 300w, https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/ArcWithFoundry.webp 755w\" sizes=\"(max-width: 478px) 100vw, 478px\" \/><\/a><\/p>\n<h2>What\u2019s next<\/h2>\n<p style=\"font-weight: 400;\">On-device AI is an exciting and rapidly evolving space. Partners like NimbleEdge and early customers like PhonePe, Morgan Stanley, Dell, and Anything LLM have been instrumental in helping us deliver what we announced today. Our goal is to make Foundry Local the simplest, most trustworthy, and most powerful local AI platform for state-of-the-art models. We expect to continue to co-create with you all in delivering a robust experience for on-device AI. We have an ambitious roadmap ahead: bringing Foundry Local to General Availability, strengthening Android support, and continuing to advance Windows AI Foundry. We\u2019re also investing in richer capabilities like tool calling, support for Linux, multi-modality, and expanded support for on-prem servers.<\/p>\n<blockquote>\n<p><span class=\"TextRun SCXW107697210 BCX8\" lang=\"EN-US\" xml:lang=\"EN-US\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW107697210 BCX8\">\u201cOur partnership with Microsoft on Foundry Local and the Dell AI Factory with ecosystem enablers for AI PCs gives developers broader model access and greater choice, with an expanded portfolio of AI models tailored to specific business needs and the flexibility to choose the right tools for specific goals across the Dell AI PC portfolio.\u201d &#8211; <\/span><\/span><span class=\"TextRun SCXW107697210 BCX8\" lang=\"EN-US\" xml:lang=\"EN-US\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW107697210 BCX8\">Marc Hammons<\/span><\/span><span class=\"TextRun SCXW107697210 BCX8\" lang=\"EN-US\" xml:lang=\"EN-US\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW107697210 BCX8\">, Dell Senior Director, CSG CTO<\/span><\/span><\/p>\n<p><span class=\"TextRun SCXW175987881 BCX8\" lang=\"EN-US\" xml:lang=\"EN-US\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW175987881 BCX8\">&#8220;<\/span><span class=\"NormalTextRun SpellingErrorV2Themed SCXW175987881 BCX8\">AnythingLLM<\/span><span class=\"NormalTextRun SCXW175987881 BCX8\">, powered by Foundry Local, allows us to run lightning-fast models such as <\/span><span class=\"NormalTextRun SpellingErrorV2Themed SCXW175987881 BCX8\">Deepseek<\/span><span class=\"NormalTextRun SCXW175987881 BCX8\">, Mistral, Phi <\/span><span class=\"NormalTextRun ContextualSpellingAndGrammarErrorV2Themed SCXW175987881 BCX8\">and <span class=\"NormalTextRun SCXW175987881 BCX8\">Qwen on CPU, GPU, and NPU without needing to build our own local LLM engine. This allows us to focus on building the features and integrations enterprises use <\/span>us for<span class=\"NormalTextRun SCXW175987881 BCX8\"> to <\/span><span class=\"NormalTextRun SCXW175987881 BCX8\">leverage<\/span> <span class=\"NormalTextRun SCXW175987881 BCX8\">state-of-the-art<\/span><span class=\"NormalTextRun SCXW175987881 BCX8\"> models entirely on-device in the most powerful ways possible.&#8221; &#8211; <\/span><span class=\"NormalTextRun SCXW175987881 BCX8\">Timothy <\/span><span class=\"NormalTextRun SpellingErrorV2Themed SCXW175987881 BCX8\">Carambat<\/span><span class=\"NormalTextRun SCXW175987881 BCX8\">, CEO <\/span><span class=\"NormalTextRun SCXW175987881 BCX8\">at <\/span><span class=\"NormalTextRun SpellingErrorV2Themed SCXW175987881 BCX8\">AnythingLLM<\/span><\/span><\/span><\/p>\n<\/blockquote>\n<p>Hear from some of our customers below.<\/p>\n<p><div style=\"width: 1920px;\" class=\"wp-video\"><video class=\"wp-video-shortcode\" id=\"video-1482-1\" width=\"1920\" height=\"1072\" preload=\"metadata\" controls=\"controls\"><source type=\"video\/mp4\" src=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/Customers-with-Foundry-Local-short_updated.mp4?_=1\" \/><a href=\"https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/Customers-with-Foundry-Local-short_updated.mp4\">https:\/\/devblogs.microsoft.com\/foundry\/wp-content\/uploads\/sites\/89\/2025\/11\/Customers-with-Foundry-Local-short_updated.mp4<\/a><\/video><\/div><\/p>\n<p>&nbsp;<\/p>\n<p style=\"font-weight: 400;\">Come with us on this journey:<\/p>\n<ul>\n<li style=\"font-weight: 400;\">Try Foundry Local today: <a href=\"https:\/\/aka.ms\/foundrylocal\">https:\/\/aka.ms\/foundrylocal<\/a>\u00a0<\/li>\n<li aria-setsize=\"-1\" data-leveltext=\"\uf0b7\" data-font=\"Symbol\" data-listid=\"20\" data-list-defn-props=\"{&quot;335552541&quot;:1,&quot;335559685&quot;:720,&quot;335559991&quot;:360,&quot;469769226&quot;:&quot;Symbol&quot;,&quot;469769242&quot;:[8226],&quot;469777803&quot;:&quot;left&quot;,&quot;469777804&quot;:&quot;\uf0b7&quot;,&quot;469777815&quot;:&quot;hybridMultilevel&quot;}\" data-aria-posinset=\"2\" data-aria-level=\"1\"><span data-contrast=\"none\">Join us at Ignite: <\/span><a href=\"https:\/\/ignite.microsoft.com\/en-US\/sessions\/BRK199?source=sessions\"><span data-contrast=\"none\">From cloud to edge: Building and shipping Edge AI apps with Foundry<\/span><\/a><span data-ccp-props=\"{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/li>\n<li aria-setsize=\"-1\" data-leveltext=\"\uf0b7\" data-font=\"Symbol\" data-listid=\"20\" data-list-defn-props=\"{&quot;335552541&quot;:1,&quot;335559685&quot;:720,&quot;335559991&quot;:360,&quot;469769226&quot;:&quot;Symbol&quot;,&quot;469769242&quot;:[8226],&quot;469777803&quot;:&quot;left&quot;,&quot;469777804&quot;:&quot;\uf0b7&quot;,&quot;469777815&quot;:&quot;hybridMultilevel&quot;}\" data-aria-posinset=\"2\" data-aria-level=\"1\"><span data-contrast=\"none\">See Foundry Local in action: <\/span><a href=\"https:\/\/aka.ms\/FL_IGNITE_MSMechanics\"><span data-contrast=\"none\">https:\/\/aka.ms\/FL_IGNITE_MSMechanics<\/span><\/a><span data-ccp-props=\"{&quot;134233117&quot;:true,&quot;134233118&quot;:true,&quot;201341983&quot;:0,&quot;335559740&quot;:240}\">\u00a0<\/span><\/li>\n<\/ul>\n<p style=\"font-weight: 400;\">\u00a0<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Today, we\u2019re introducing a powerful new way to bring AI directly to your mobile apps: Foundry Local on Android. Starting now, you can build Android applications that run AI on-device\u2014no cloud round trips required. Sign up for the gated preview here: https:\/\/aka.ms\/foundrylocal-androidprp\u00a0 Most of us now carry a computer in our pocket. These devices are [&hellip;]<\/p>\n","protected":false},"author":189733,"featured_media":1563,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[37,1,76],"tags":[3,5,38,60,77,28],"class_list":["post-1482","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-foundrylocal","category-microsoft-foundry","category-msignite","tag-ai-development","tag-ai-tools","tag-foundry-local","tag-gpt-oss","tag-msignite","tag-whats-new"],"acf":[],"blog_post_summary":"<p>Today, we\u2019re introducing a powerful new way to bring AI directly to your mobile apps: Foundry Local on Android. Starting now, you can build Android applications that run AI on-device\u2014no cloud round trips required. Sign up for the gated preview here: https:\/\/aka.ms\/foundrylocal-androidprp\u00a0 Most of us now carry a computer in our pocket. These devices are [&hellip;]<\/p>\n","_links":{"self":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/posts\/1482","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/users\/189733"}],"replies":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/comments?post=1482"}],"version-history":[{"count":0,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/posts\/1482\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/media\/1563"}],"wp:attachment":[{"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/media?parent=1482"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/categories?post=1482"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/foundry\/wp-json\/wp\/v2\/tags?post=1482"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}