ISE Developer Blog
Solving global tech challenges, sharing insights, and empowering developers
Latest posts
Building a Multi-tenant GenAI gateway using APIM
This blog shares insights on developing a GenAI gateway with multi-tenancy and quota management capabilities implemented using Azure APIM where customers can access the GenAI resources across different service tiers like Freemium, Basic, and Premium with each tier having it's own quota and rate limits. The solution used the concept of "Products" to group APIs related to specific entitlements and using Product Policies, simplifying the overall design and ensuring scalability.
Multimodal RAG with Vision: From Experimentation to Implementation
This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the performance of each component of the RAG pipeline in isolation, as well as the end-to-end inference flow.
Streamlining Error Handling in Power Automate for Better Workflow Performance
This post explores effective error handling strategies in Power Automate to enhance workflow reliability and maintainability through practical techniques and integrations.
PromptFlow Serve – Benchmark Result Analysis
In this post we discuss how to test the throughput of PromptFlow pf-serve module and key learnings doing so. We explore the impact on throughput and performance the different WSGI and ASGI hosting methods have and the importance of engineering your Python nodes with the async await pattern for I/O.
Earning Agentic (and LangChain) Complexity
This post examines the challenges of adopting complex technologies like LangChain and agentic solutions in production environments, emphasizing the importance of understanding the necessity of such complexity. It provides insights on how to evaluate these technologies carefully, manage dependencies, and adhere to best practices for secure and stable AI applications.
Semantic Router using Azure AI Search
Potential semantic routing solution using Azure AI Search for scalable query handling.
Measuring Differentials of Product Images in AI-generated Ads
Various methodologies of measuring differentials between a product image and an AI-generated ad for the purpose of product representation integrity in serving AI-generated ads.
Converting multiple sources to FHIR using Liquid
A brief summary of how Microsoft ISE teams creation a solution to integrate patients data from multiple fragmented sources into a standard FHIR data model
LLM Prompt Injection Considerations With Tool Use
This post discusses a pattern to prevent a class of prompt injection attacks in LLM-based solutions. It emphasizes the importance of building strong foundational patterns to mitigate risks and avoid potential pitfalls. By implementing this pattern, teams can enhance the security of their tool-based solutions.