{"id":3257,"date":"2024-09-11T09:07:13","date_gmt":"2024-09-11T16:07:13","guid":{"rendered":"https:\/\/devblogs.microsoft.com\/semantic-kernel\/?p=3257"},"modified":"2024-09-11T09:07:33","modified_gmt":"2024-09-11T16:07:33","slug":"introducing-azure-ai-inference-connector-for-azure-ai-studio-models","status":"publish","type":"post","link":"https:\/\/devblogs.microsoft.com\/agent-framework\/introducing-azure-ai-inference-connector-for-azure-ai-studio-models\/","title":{"rendered":"Introducing Azure AI Inference Connector for Azure AI Studio Models"},"content":{"rendered":"<h1 aria-level=\"1\"><b><span data-contrast=\"none\">Introduction<\/span><\/b><span data-ccp-props=\"{&quot;134245418&quot;:true,&quot;134245529&quot;:true,&quot;201341983&quot;:0,&quot;335559738&quot;:480,&quot;335559739&quot;:0,&quot;335559740&quot;:279}\">\u00a0<\/span><\/h2>\n<p><span data-contrast=\"none\">With the latest launch of the <\/span><a href=\"https:\/\/devblogs.microsoft.com\/dotnet\/azure-ai-model-catalog-dotnet-inference-sdk\/\"><span data-contrast=\"none\">Azure AI Inference SDK<\/span><\/a><span data-contrast=\"none\"> for <\/span><a href=\"https:\/\/ai.azure.com\/explore\/models\"><span data-contrast=\"none\">Azure AI Studio Models Catalog<\/span><\/a><span data-contrast=\"none\">, we are happy to announce that we have also made available our dedicated Azure AI Inference Semantic Kernel Connector.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559738&quot;:180,&quot;335559739&quot;:180,&quot;335559740&quot;:279}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">This connector is specially designed on top of the published Azure AI Inference SDK and is targeted to allow easy access to a comprehensive suite of AI models in the Azure AI Model Catalog for inferencing for use in your Semantic Kernel projects.<\/span><span data-ccp-props=\"{&quot;201341983&quot;:0,&quot;335559738&quot;:180,&quot;335559739&quot;:180,&quot;335559740&quot;:279}\">\u00a0<\/span><\/p>\n<h2><strong><span class=\"TextRun SCXW77940664 BCX8\" lang=\"EN-US\" xml:lang=\"EN-US\" data-contrast=\"none\"><span class=\"NormalTextRun SCXW77940664 BCX8\" data-ccp-parastyle=\"heading 1\">What is the Azure AI Model Catalog?<\/span><\/span><\/strong><span class=\"EOP SCXW77940664 BCX8\" data-ccp-props=\"{&quot;134245418&quot;:true,&quot;134245529&quot;:true,&quot;201341983&quot;:0,&quot;335559738&quot;:480,&quot;335559739&quot;:0,&quot;335559740&quot;:279}\">\u00a0<\/span><\/h2>\n<p><a href=\"https:\/\/devblogs.microsoft.com\/semantic-kernel\/wp-content\/uploads\/sites\/78\/2024\/08\/blog.png\"><img decoding=\"async\" class=\"alignnone wp-image-3258 size-full\" src=\"https:\/\/devblogs.microsoft.com\/semantic-kernel\/wp-content\/uploads\/sites\/78\/2024\/08\/blog.png\" alt=\"Image blog\" width=\"624\" height=\"359\" srcset=\"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-content\/uploads\/sites\/78\/2024\/08\/blog.png 624w, https:\/\/devblogs.microsoft.com\/agent-framework\/wp-content\/uploads\/sites\/78\/2024\/08\/blog-300x173.png 300w\" sizes=\"(max-width: 624px) 100vw, 624px\" \/><\/a><\/p>\n<p>The Model Catalog in Azure AI Studio is a one-stop shop for exploring and deploying AI models.<\/p>\n<p>Models from the catalog can be deployed to Managed Compute or as a Serverless API.<\/p>\n<p>Some key features include:<\/p>\n<ul>\n<li><strong>Model Availability<\/strong>: The model catalog features a diverse collection of models from providers such as Microsoft, Azure OpenAI, Mistral, Meta, and Cohere. This ensures you can find the right model to satisfy your requirements.<\/li>\n<li><strong>Easy to deploy<\/strong>: Serverless API deployments remove the complexity about hosting and provisioning the hardware to run cutting edge models. When deploying models with serverless API, you don&#8217;t need quota to host them and you are billed per token<\/li>\n<li><strong>Responsible AI Built-In<\/strong>: Safety is a priority. Language models from the catalog come with default configurations of Azure AI Content Safety moderation filters which detect harmful content.<\/li>\n<\/ul>\n<p>For more details, see the <a href=\"https:\/\/learn.microsoft.com\/azure\/ai-studio\/how-to\/model-catalog-overview\">Azure AI Model Catalog documentation<\/a>.<\/p>\n<h2><strong>Get Started<\/strong><\/h2>\n<ol>\n<li>Deploy a model like <a href=\"https:\/\/ai.azure.com\/explore\/models?selectedCollection=phi&amp;tid=72f988bf-86f1-41af-91ab-2d7cd011db47\">Phi-3<\/a>. For more details, see the <a href=\"https:\/\/learn.microsoft.com\/azure\/ai-studio\/how-to\/deploy-models-serverless?tabs=azure-ai-studio\">Azure AI Model Catalog deployment documentation<\/a>.<\/li>\n<li>Install the <a href=\"https:\/\/www.nuget.org\/packages\/Microsoft.SemanticKernel.Connectors.AzureAIInference\">Connectors.AzureAIInference<\/a> package in your existing project from NuGet.<\/li>\n<\/ol>\n<p>Add the following code to your application to start making requests to your model service. Make sure to replace your key and endpoint with those provided with your deployment.<\/p>\n<pre class=\"prettyprint language-cs language-csharp\"><code class=\"language-cs language-csharp\">var chatService = new AzureAIInferenceChatCompletionService(\r\n     endpoint: new Uri(\"YOUR-MODEL-ENDPOINT\"),\r\n     apiKey: \"YOUR-MODEL-API-KEY\");\r\n \r\nvar chatHistory = new ChatHistory(\"You are a helpful assistant that knows about AI.\"); \r\n\r\nchatHistory.AddUserMessage(\"Hi, I'm looking for book suggestions\");\r\n\r\nvar reply = await chatService.GetChatMessageContentAsync(chatHistory);\r\n<\/code><\/pre>\n<p>For more details, see the <a href=\"https:\/\/github.com\/microsoft\/semantic-kernel\/blob\/main\/dotnet\/samples\/Concepts\/ChatCompletion\/AzureAIInference_ChatCompletion.cs\">Azure AI Inference Samples<\/a>.<\/p>\n<h2><strong>Conclusion<\/strong><\/h2>\n<p>We&#8217;re excited to see what you build! Try out the new Azure AI Inference Connector give us your feedback. Please reach out if you have any questions or feedback through our\u00a0<a href=\"https:\/\/github.com\/microsoft\/semantic-kernel\/discussions\/categories\/general\" target=\"_blank\" rel=\"noopener\">Semantic Kernel GitHub Discussion Channel<\/a>. We look forward to hearing from you!\u00a0We would also love your support, if you\u2019ve enjoyed using Semantic Kernel, give us a star on\u00a0<a href=\"https:\/\/github.com\/microsoft\/semantic-kernel\" target=\"_blank\" rel=\"noopener\">GitHub<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introduction\u00a0 With the latest launch of the Azure AI Inference SDK for Azure AI Studio Models Catalog, we are happy to announce that we have also made available our dedicated Azure AI Inference Semantic Kernel Connector.\u00a0 This connector is specially designed on top of the published Azure AI Inference SDK and is targeted to allow [&hellip;]<\/p>\n","protected":false},"author":149071,"featured_media":2364,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-3257","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-semantic-kernel"],"acf":[],"blog_post_summary":"<p>Introduction\u00a0 With the latest launch of the Azure AI Inference SDK for Azure AI Studio Models Catalog, we are happy to announce that we have also made available our dedicated Azure AI Inference Semantic Kernel Connector.\u00a0 This connector is specially designed on top of the published Azure AI Inference SDK and is targeted to allow [&hellip;]<\/p>\n","_links":{"self":[{"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/posts\/3257","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/users\/149071"}],"replies":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/comments?post=3257"}],"version-history":[{"count":0,"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/posts\/3257\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/media\/2364"}],"wp:attachment":[{"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/media?parent=3257"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/categories?post=3257"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/agent-framework\/wp-json\/wp\/v2\/tags?post=3257"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}