Showing category results for CSE

Nov 29, 2024
Post likes count1

Semantic Kernel Learnings

Alex,
Dan

A synopsis of our key learnings and findings from using Semantic Kernel.

CSEFrameworksCognitive Services
Nov 20, 2024
Post likes count3

Building AI Search for Production in Corporate Environments: Key Lessons

Daniel,
Alessia,
Dina

In the dynamic world of AI and data science developing production-level solutions for corporate environments comes with its own set of challenges and lessons. As a data science team working within Microsoft, we recently completed an engagement for a large company where we leveraged cutting-edge technologies, including OpenAI tools, GPT-4o for gener...

CSE
Nov 15, 2024
Post likes count2

Accessing Private Resources in Azure

Juan Burckhardt

A practical guide for developers on connecting to Azure resources with private endpoints through a VPN.

CSEISE
Oct 31, 2024
Post likes count3

Building a Multi-tenant GenAI gateway using APIM

Sushant Bhalla

This blog shares insights on developing a GenAI gateway with multi-tenancy and quota management capabilities implemented using Azure APIM where customers can access the GenAI resources across different service tiers like Freemium, Basic, and Premium with each tier having it's own quota and rate limits. The solution used the concept of "Products" to...

CSEISE
Oct 11, 2024
Post likes count8

Multimodal RAG with Vision: From Experimentation to Implementation

Pratima,
Maitreyi,
Oscar

This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the perfo...

CSEISE
Sep 27, 2024
Post likes count2

PromptFlow Serve – Benchmark Result Analysis

Dasith Wijesiriwardena

In this post we discuss how to test the throughput of PromptFlow pf-serve module and key learnings doing so. We explore the impact on throughput and performance the different WSGI and ASGI hosting methods have and the importance of engineering your Python nodes with the async await pattern for I/O.

CSEISE
Sep 9, 2024
Post likes count8

Earning Agentic (and LangChain) Complexity

Michael,
Juan

This post examines the challenges of adopting complex technologies like LangChain and agentic solutions in production environments, emphasizing the importance of understanding the necessity of such complexity. It provides insights on how to evaluate these technologies carefully, manage dependencies, and adhere to best practices for secure and stabl...

CSEISE
Aug 16, 2024
Post likes count8

Semantic Router using Azure AI Search

Christopher Buckley

Potential semantic routing solution using Azure AI Search for scalable query handling.

CSEMachine LearningCognitive Services