Showing results for October 2024 - ISE Developer Blog

Oct 31, 2024
3

Building a Multi-tenant GenAI gateway using APIM

Sushant Bhalla
Sushant Bhalla

This blog shares insights on developing a GenAI gateway with multi-tenancy and quota management capabilities implemented using Azure APIM where customers can access the GenAI resources across different service tiers like Freemium, Basic, and Premium with each tier having it's own quota and rate limits. The solution used the concept of "Products" to...

CSEISE
Oct 11, 2024
6

Multimodal RAG with Vision: From Experimentation to Implementation

Pratima Sayyaparaju Maitreyi Nair Oscar Fimbres
Pratima,
Maitreyi,
Oscar

This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the perfo...

CSEISE