Showing results for CSE - ISE Developer Blog

Oct 31, 2024
3

Building a Multi-tenant GenAI gateway using APIM

Sushant Bhalla
Sushant Bhalla

This blog shares insights on developing a GenAI gateway with multi-tenancy and quota management capabilities implemented using Azure APIM where customers can access the GenAI resources across different service tiers like Freemium, Basic, and Premium with each tier having it's own quota and rate limits. The solution used the concept of "Products" to...

CSEISE
Oct 11, 2024
7

Multimodal RAG with Vision: From Experimentation to Implementation

Pratima Sayyaparaju Maitreyi Nair Oscar Fimbres
Pratima,
Maitreyi,
Oscar

This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the perfo...

CSEISE
Sep 27, 2024
2

PromptFlow Serve – Benchmark Result Analysis

Dasith Wijesiriwardena
Dasith Wijesiriwardena

In this post we discuss how to test the throughput of PromptFlow pf-serve module and key learnings doing so. We explore the impact on throughput and performance the different WSGI and ASGI hosting methods have and the importance of engineering your Python nodes with the async await pattern for I/O.

CSEISE
Sep 9, 2024
8

Earning Agentic (and LangChain) Complexity

Michael Lanzetta Juan Burckhardt
Michael,
Juan

This post examines the challenges of adopting complex technologies like LangChain and agentic solutions in production environments, emphasizing the importance of understanding the necessity of such complexity. It provides insights on how to evaluate these technologies carefully, manage dependencies, and adhere to best practices for secure and stabl...

CSEISE
Aug 16, 2024
8

Semantic Router using Azure AI Search

Christopher Buckley
Christopher Buckley

Potential semantic routing solution using Azure AI Search for scalable query handling.

CSEMachine LearningCognitive Services
Jul 26, 2024
1

LLM Prompt Injection Considerations With Tool Use

Dasith Wijesiriwardena
Dasith Wijesiriwardena

This post discusses a pattern to prevent a class of prompt injection attacks in LLM-based solutions. It emphasizes the importance of building strong foundational patterns to mitigate risks and avoid potential pitfalls. By implementing this pattern, teams can enhance the security of their tool-based solutions.

CSEISE