ISE Developer Blog

Solving global tech challenges, sharing insights, and empowering developers

Latest posts

Jun 4, 2026
Post likes count 1

How we Decide Between Keyword and Hybrid Search: 5 Enterprise Evaluation Criteria

Beijie Zhang

A data-driven framework we use in enterprise deployments to decide between vector-only keyword and hybrid search, based on five measurable evaluation criteria.

May 28, 2026
Post likes count 0

Verification-driven tooling prompts for fast-moving codebases

Dexter Williams

A practical pattern for using GenAI research agents to produce reliable internal tooling by verifying assumptions with detection classes and a final checklist.

May 21, 2026
Post likes count 4

Coordinating AI-Assisted Development with AGENTS.md and Skills

Jimmy,
Destiny

How to move AI-assisted development from ad-hoc experimentation to a coordinated team-wide capability using AGENTS.md and reusable skills with GitHub Copilot CLI

May 14, 2026
Post likes count 0

WebAssembly Data Processing at the Edge with Azure IoT Operations

Marcel Bindseil

Safe, composable dataflow operators for Azure IoT Operations, built as WASM modules using the Component Model, WIT interfaces, and WASI Preview 2.

May 7, 2026
Post likes count 2

SQL query generation from natural language

Ashley Costigane

Evaluating AI agents for NL-to-SQL generation across Azure Databricks AI/BI Genie, GitHub Copilot CLI, and Microsoft Agent Framework. We achieved ~75% accuracy with schema documentation and runtime validation, while discovering that business logic errors represent a fundamental limitation requiring domain expertise.

Apr 30, 2026
Post likes count 2

Propagating SharePoint Document Permissions to AI Search and RAG Pipelines

Elena Vavilova

A security-first architecture that propagates SharePoint document permissions into downstream AI systems, using Microsoft Entra ID object IDs (GUIDs) for safe, query-time filtering in Azure AI Search, RAG pipelines, and Copilot extensions.

Apr 23, 2026
Post likes count 1

Lessons Learned from Building a Well-Matching Intelligence Layer

Emmeline,
Oscar,
Kavitha

Key insights from the EDA and Ground Truth journey for solving well identity resolution in the Energy industry—challenges that extend to any entity matching problem across distributed systems.

Apr 20, 2026
Post likes count 0

Discoverable – Observable MCP Server

Colin Desmond

How to build an observable MCP Server with discoverable tools and agents.

Mar 26, 2026
Post likes count 1

Building Search-Enabled Agents with Azure AI Foundry and Semantic Kernel and A2A

Munish Malhotra

A step-by-step guide to enable search in the semantic kernel agents using bing grounding in Azure Foundry AI Agent using A2A.

Mar 19, 2026
Post likes count 2

From Azure IoT Operations Data Processor Pipelines to Dataflows

Maho Pacheco

In this post we explore the evolution from Azure IoT Operations Data Processor Pipelines to Dataflows, why we adopted a hybrid strategy with custom Rust pods, and the architectural lessons we learned building event detection systems at the edge.

Mar 6, 2026
Post likes count 0

Using Agents to Setup Experiments

Peter Lasne

When setting up an experiment is complex, agents can automate the process to make it faster and more reliable.

Feb 26, 2026
Post likes count 6

Building a Secure MCP Server with OAuth 2.1 and Azure AD: Lessons from the Field

Juan Burckhardt

How we built a production-ready MCP server with OAuth 2.1 authentication and On-Behalf-Of flow for Microsoft Graph, navigating a rapidly evolving specification.

Feb 20, 2026
Post likes count 1

Using Codes to Increase Adherence to Prompts

Peter Lasne

Agentic systems have some discretion in the parameters they sent to tooling, but there are cases, such as experimentation, when you need 100% adherence to a set of parameters.

Feb 5, 2026
Post likes count 0

Minimal GitOps for Edge Applications with Azure IoT Operations and Azure DevOps

Maho Pacheco

How we built a minimal, scalable GitOps workflow for edge applications using Azure IoT Operations, Azure DevOps, and a multi-repo strategy. Includes pipeline templates and scripts for real-world deployments.

Jan 23, 2026
Post likes count 1

Bridging Local Development and Cloud Evaluation: Using Microsoft Devtunnels with Azure Machine Learning

Nasryn El-Hinnawi

Learn how to streamline AI development by using Microsoft Devtunnels to connect local services with Azure Machine Learning evaluation pipelines, eliminating deployment delays while maintaining comprehensive cloud-based validation.

Jan 16, 2026
Post likes count 1

Evaluate Small Language Model Function Calling using the Azure AI Evaluation SDK

Chinedum Echeta

This blog details how the Azure AI Evaluation SDK can be used to assess the performance of a small language model for function calling, such as Phi-4-mini-instruct, and view the results in Microsoft Foundry.

Dec 18, 2025
Post likes count 2

Introducing the Copilot Studio + Azure AI Search Solution

Ian Jensen

Introduction to a scaleable and secure turnkey architecture for deploying Copilots connected to Azure AI Search.

Dec 10, 2025
Post likes count 1

Agent Onboarding Process for Agentic Systems: Maintain accuracy at scale

Vikesh,
Vidhya

A robust agent onboarding process is the backbone of scalable, reliable multi-agent AI systems—enforcing uniqueness, rigorous evaluation, and responsible AI practices at every step.

Dec 1, 2025
Post likes count 1

Accelerating AI Development with GitHub Copilot: Real-World Use-Cases

isedevblog

This blog post shares real-world experiences from Microsoft's ISE team using GitHub Copilot to accelerate AI development workflows, demonstrating how AI tools can significantly speed up tasks like creating documentation, class diagrams, data visualizations, and demo scripts when used by experienced developers.

Nov 13, 2025
Post likes count 1

Ensuring Seamless User Experience in Modernization Projects with AI‑Generated Stagehand and Playwright UI Tests

Suganya Srinivasan

Using AI‑generated tools like Stagehand with Microsoft's Playwright helps ensure a smooth user experience during modernization efforts such as Hyper Velocity Engineering and monolith-to-microservices migrations.

Nov 7, 2025
Post likes count 10

Patterns for Building a Scalable Multi-Agent System

Sushant,
Vikesh

Practical patterns for designing scalable, high-performing multi-agent systems—grounded in real implementation experience.

Oct 31, 2025
Post likes count 1

Tuning and Optimization of Speech-to-Text (STT), Text-to-Speech (TTS), and Custom Keyword Recognition in Azure Speech Services

Ayaka Hara

This blog outlines best practices for optimizing Speech-to-Text (STT), Text-to-Speech (TTS), and Custom Keyword Recognition in Azure Speech Services, helping developers build more accurate and responsive voice-enabled applications.

Oct 23, 2025
Post likes count 4

Integration Testing with Testcontainers

Jimmy Kurian

How to leverage Testcontainers for integration testing

Oct 16, 2025
Post likes count 3

Managing secrets on Azure KeyVault with a Tagging strategy to perform automations

David Lee

Cloud and Infrastructure teams can manage secrets on Azure KeyVault with a Tagging strategy to perform automations.

Oct 9, 2025
Post likes count 3

Working with Git Submodules: Managing Dependencies Across Repositories

Liliam Leme

Working with Git Submodules and creating actions.

Oct 2, 2025
Post likes count 3

Leveraging Fabric Eventhouse to Store Great Expectations Validation Results

Dipika Varpe

A step-by-step guide to storing data validation results into the Eventhouse & visualize data using queries and dashboard.

Sep 25, 2025
Post likes count 4

Taming Complexity: Intuitive Evaluation Framework for Agentic Chatbots in Business-Critical Environments

Karol,
Marc

This blog post introduces a comprehensive evaluation framework for enterprise chatbots powered by large language models (LLMs), specifically addressing the challenges of assessing Line of Business (LOB) agents in business-critical environments. The authors tackle the fundamental problem that traditional chatbot evaluation metrics fail to capture the nuanced, non-deterministic performance of modern LLM-based systems, proposing a solution that combines realistic chat simulation using an LLM-powered User Agent, automated ground truth generation at scale, and comprehensive metrics including function call precision, r...

Sep 3, 2025
Post likes count 1

Unlocking Vector Search with OneLake Indexer and OpenAI Integration in Microsoft Fabric

Liliam Leme

Exploring how Microsoft Fabric OneLake indexer integrates with OpenAI

Aug 20, 2025
Post likes count 4

Ground Truth Curation Process for AI Systems

Laura,
Uffaz

Steps to Produce High Quality Ground Truth Pairs for AI Systems

Aug 14, 2025
Post likes count 6

Customizable and Extensible Evaluation of a GenAI Application

Alex Fee

Suggestions on how to evaluate a generative AI application in an adaptable and extensible manner.

Aug 7, 2025
Post likes count 2

Learnings from External Data Handling

Ashley Costigane

This blog post discusses the challenges and solutions encountered by the ISE team at Microsoft while making a distributed system production-ready. It focuses on issues including slow processing speeds and out-of-memory exceptions, and provides insights into the methods used to address these problems.

Jul 31, 2025
Post likes count 1

Enhancing Analytical Queries with Semantic Kernel and In-Memory SQL Processing

Alexandre Gattiker

Integrating Semantic Kernel with an ephemeral in-memory SQL plugin, enabling secure and efficient analytical queries over structured data.

Jul 18, 2025
Post likes count 1

AI Model Promotion with dstoolkit-mlops-v2

Malcolm,
Daniel

Evaluates various repository structures and designs for maximizing the efficiency of Data Scientists and Software Engineers developing, promoting and deploying AI models on the same project.

Jul 10, 2025
Post likes count 1

Use Agent to Update Dataverse Table Content

Flavio Furlan

Create an agent using Copilot Studio to update the content of a Dataverse table.

Jun 26, 2025
Post likes count 3

Fleet Configuration Management

Cameron Taylor

This post explains the challenge of fleet configuration management, the role of an automated fleet configuration management system, and describes key considerations for building such a system.

Jun 19, 2025
Post likes count 1

PACT Contract Testing – Because Not Everything Needs Full Integration Tests

Adam Dougal

PACT is a contract testing tool that allows you to define a contract between two applications, enabling

Jun 12, 2025
Post likes count 6

Dockerizing UV

Jimmy Kurian

How to Dockerize Python Package Management Tool UV

Jun 5, 2025
Post likes count 4

Hierarchical Waterfall Evaluation of Query Classification, Retrieval & Generation in Multi-Agent LLM Systems

Abdul,
Wenwen

This post presents a hierarchical waterfall framework for evaluating query classification, retrieval, and generation in multi-agent LLM systems.

May 30, 2025
Post likes count 3

Efficient Ground Truth Generation for Search Evaluation

Geisa,
Phong

This article discusses the challenges and methodologies for creating a reliable groundtruth dataset for search evaluation. It highlights the use of TREC Pooling and GPT-4o assisted ranking to overcome the labor-intensive process of manual labeling, ensuring high-quality search evaluations even with limited data.

May 23, 2025
Post likes count 0

Durable Functions for Indexing in RAG: A Practical Python Approach

Felix Mönckemeyer

Azure Durable Functions streamline RAG indexing by combining push flexibility with pull reliability for scalable,

May 15, 2025
Post likes count 1

Cloud infrastructure provisioning: best practices for IaC

Hannes,
Brent

Infrastructure-as-Code (IaC) is a common DevOps practice that enterprises use to provision and deploy IT infrastructure. Enterprises applying IaC and continuous integration/continuous delivery (CI/CD) pipelines can maintain high availability and manage risk for their cloud environments at scale. However, multi-environment challenges, manual processes, and fragmented guidance still lead to configuration drift, errors, and inconsistencies. These issues are likely to result in downtime, security vulnerabilities, and inefficient resource utilization.

May 8, 2025
Post likes count 0

Integration testing with Dapr and Testcontainers

Jose Santano Gonzalez

This blog post discusses setting up a Pytest integration test framework for a system using Dapr and Testcontainers. This framework was initially setup for a customer to suit their needs and it has been extracted into a sample project to provide a starting point.

May 2, 2025
Post likes count 2

Running RAG with ONNX Runtime GenAI for On-Prem Windows

Omer Demir

Exploring how to efficiently run a RAG pipeline with structured language models (SLMs) and guardrails on Windows, achieving inference under 5 seconds with ONNX Runtime GenAI.

Apr 3, 2025
Post likes count 3

Data Validations with Great Expectations in MS Fabric

Jarang Kim

In this blog post, we will explore how to integrate GX within the Microsoft Fabric environment and use it to validate data programmatically.

Mar 27, 2025
Post likes count 1

Using OpenTelemetry for flexible observability

Adele

OpenTelemetry offers flexible observability in distributed systems, this post looks at how to integrate multiple data sources when connected and disconnected from the internet.

Mar 20, 2025
Post likes count 2

The Hidden Risks of Docker Build Time Arguments and How to Secure Your Secrets

Jason

This post discusses the security risks of using Docker build-time arguments to pass secrets, and suggests alternative methods for handling secrets securely.

Mar 13, 2025
Post likes count 1

Flexible Tool Selection for ML Model Production

Rob,
Daniel,
Swetha

data science and software engineering teams can choose the best tools for their respective roles in delivering machine learning models to production.

Mar 6, 2025
Post likes count 0

Multi Root Workspaces in Visual Studio Code

Colin Desmond

How to manage multiple independent python projects, with different dependencies, inside Visual Studio Code.

Feb 27, 2025
Post likes count 2

Multi-Provider Strategy for App Configuration in Python

Abeeb Amoo

This post discusses a strategy for creating a custom extensible configuration module in python projects.

Feb 20, 2025
Post likes count 0

Instrumenting Apache Spark Structured Streaming jobs using OpenTelemetry

Irene Stanely

Apache spark monitoring using OpenTelemetry

Feb 13, 2025
Post likes count 1

Unlock Generative AI for Enterprise Scalability

Shiran Rubin Manaev

Reusable templates, pipelines, and tools to streamline the setup of GenAI projects, saving time and effort, and ensuring consistency and reliability across projects.

Feb 6, 2025
Post likes count 2

Azure Functions vs. Indexers: AI Data Ingestion

Ivan Sholokh

This article compares Azure Functions with pre-built indexers for data ingestion in Azure AI Search, with a focus on using Azure Functions for a flexible, scalable approach. It explores key steps like data migration, index creation, and deployment automation.

Jan 30, 2025
Post likes count 0

Using Managed Identity on Logic App consumption

Omer Demir

Deploy Azure Logic App Consumption with Managed Identity using Terraform by leveraging azapi_resource and ARM templates. This setup enables secure authentication without manual credential management.

Jan 24, 2025
Post likes count 2

Revolutionising Customer Feedback: Harnessing Large Language Models for Retail Insights and Intelligence

Shinoj Zacharias

In this article, we delve into techniques for extracting valuable insights from customer feedback using Large Language Models (LLMs). By identifying themes, sentiment, and competitor comparisons from feedback, businesses can gain a competitive edge.

Jan 10, 2025
Post likes count 3

LLMOps in restricted networks and addressing continuous evaluation long run constraints

Suganya Srinivasan

In this blog post, we'll explore the challenges we faced in establishing LLMOps and continuous evaluation within a pipeline using Azure Machine Learning, particularly when dealing with long runs and a restricted Bring-Your-Own (BYO) network.

Dec 16, 2024
Post likes count 4

Transforming Language into Code: Building and Evaluating a Robotic Code Generation Copilot

Ihar Shulhan

This blog post explores the development and evaluation of an AI assistant that converts natural language into robotic code.

Dec 6, 2024
Post likes count 2

Implementing Evaluation platform on Azure Machine Learning

David Lee

Azure Machine Learning is known for training and deploying models, but can also be used for running experiments. This blog post will show us how we have implemented our Evaluation platform on Azure Machine Learning.

Nov 29, 2024
Post likes count 1

Semantic Kernel Learnings

Alex,
Dan

A synopsis of our key learnings and findings from using Semantic Kernel.

Nov 20, 2024
Post likes count 3

Building AI Search for Production in Corporate Environments: Key Lessons

Daniel,
Alessia,
Dina

In the dynamic world of AI and data science developing production-level solutions for corporate environments comes with its own set of challenges and lessons. As a data science team working within Microsoft, we recently completed an engagement for a large company where we leveraged cutting-edge technologies, including OpenAI tools, GPT-4o for generating syntactic datasets, embedding models like text_embedding_3, and Azure AI Search for implementing both text and hybrid search solutions. Here are 10 key lessons we learned along the way.

Nov 15, 2024
Post likes count 2

Accessing Private Resources in Azure

Juan Burckhardt

A practical guide for developers on connecting to Azure resources with private endpoints through a VPN.

Nov 8, 2024
Post likes count 1

Three Ways to Simplify CI/CD Pipelines on GitHub Actions

Neeraj,
Stewart

This post focuses on three ways to simplify CI/CD pipelines deploying to Azure with GitHub Actions.

Oct 31, 2024
Post likes count 3

Building a Multi-tenant GenAI gateway using APIM

Sushant Bhalla

This blog shares insights on developing a GenAI gateway with multi-tenancy and quota management capabilities implemented using Azure APIM where customers can access the GenAI resources across different service tiers like Freemium, Basic, and Premium with each tier having it's own quota and rate limits. The solution used the concept of "Products" to group APIs related to specific entitlements and using Product Policies, simplifying the overall design and ensuring scalability.

Oct 11, 2024
Post likes count 8

Multimodal RAG with Vision: From Experimentation to Implementation

Pratima,
Maitreyi,
Oscar

This blog post delves into the experimentation journey of fine-tuning a multimodal RAG pipeline to best answer user queries that require both textual and image context. We ran our experiments by systematically testing various approaches, adjusting one configuration setting at a time and using clearly defined evaluation metrics to validate the performance of each component of the RAG pipeline in isolation, as well as the end-to-end inference flow.

Oct 4, 2024
Post likes count 9

Streamlining Error Handling in Power Automate for Better Workflow Performance

Peter Maynard

This post explores effective error handling strategies in Power Automate to enhance workflow reliability and maintainability through practical techniques and integrations.

Sep 27, 2024
Post likes count 2

PromptFlow Serve – Benchmark Result Analysis

Dasith Wijesiriwardena

In this post we discuss how to test the throughput of PromptFlow pf-serve module and key learnings doing so. We explore the impact on throughput and performance the different WSGI and ASGI hosting methods have and the importance of engineering your Python nodes with the async await pattern for I/O.

Sep 9, 2024
Post likes count 8

Earning Agentic (and LangChain) Complexity

Michael,
Juan

This post examines the challenges of adopting complex technologies like LangChain and agentic solutions in production environments, emphasizing the importance of understanding the necessity of such complexity. It provides insights on how to evaluate these technologies carefully, manage dependencies, and adhere to best practices for secure and stable AI applications.

Aug 16, 2024
Post likes count 8

Semantic Router using Azure AI Search

Christopher Buckley

Potential semantic routing solution using Azure AI Search for scalable query handling.

Aug 9, 2024
Post likes count 5

Measuring Differentials of Product Images in AI-generated Ads

James,
Anna,
Sean

Various methodologies of measuring differentials between a product image and an AI-generated ad for the purpose of product representation integrity in serving AI-generated ads.

Aug 3, 2024
Post likes count 0

Converting multiple sources to FHIR using Liquid

Arpit Gaur

A brief summary of how Microsoft ISE teams creation a solution to integrate patients data from multiple fragmented sources into a standard FHIR data model

Aug 2, 2024
Post likes count 8

Profiling Docker Compose services through Visual Studio

Andrew Cardy

A brief summary of how Crew Clark were able to profile performance using Visual Studio and within a Docker Compose environment.

Jul 26, 2024
Post likes count 1

LLM Prompt Injection Considerations With Tool Use

Dasith Wijesiriwardena

This post discusses a pattern to prevent a class of prompt injection attacks in LLM-based solutions. It emphasizes the importance of building strong foundational patterns to mitigate risks and avoid potential pitfalls. By implementing this pattern, teams can enhance the security of their tool-based solutions.

Jul 12, 2024
Post likes count 4

Document Summarization Solution Patterns using Azure Open AI & Langchain

Komal Grover

This post lists the various solution patterns that can be applied for document summarization. Document summarization comes with its challenges related to token limitation and chunk sizes. This blog post discusses about the solutions to tackle those challenges.

Jul 5, 2024
Post likes count 0

Azure IoT Operations Data Processor Pipelines: Unlocking Efficiency in the Industrial Metaverse

Maho Pacheco

In this post we explore the implementation of Azure IoT Operations Data Processor pipelines in a manufacturing context, focusing on data processing at the edge.

Jun 28, 2024
Post likes count 2

Learnings around developing with Microsoft Graph Toolkit

David Lee

Learnings of Microsoft Graph Toolkit from our recent project, including data leaking and privacy concerns that can help developers get a jumpstart with Microsoft Graph Toolkit.

Jun 21, 2024
Post likes count 3

Empowering Collaboration with a Tech Savvy Customer

Liza Shakury

Empowering Collaboration with a Tech Savvy Customer

Jun 14, 2024
Post likes count 19

Next-Level Boilerplate: An Inside Look Into Our .Net Clean Architecture Repo

Dor Lugasi-Gal

In this article, we will take an inner look at the clean architecture boilerplate and how it helps kickstart our projects rapidly.

Jun 7, 2024
Post likes count 2

7 Essential Tips: Accelerate Prompt Flow Development

Neeraj Mandal

This post focusses on tips and tricks to accelerate from flow development through use of dev container & shell scripts.

May 24, 2024
Post likes count 1

Start Simple, Scale Fast: Azure Storage for Distributed Computing

Andrew Vineyard

Azure Storage Accounts are one of the simplest resources on Azure, but simultaneously one of the most useful and reused building blocks across the Microsoft Cloud ecosystem. Blobs, leases, and queues are fundamental components that enable powerful patterns. Let's dig into why Storage Accounts should be in your Azure toolkit.

May 17, 2024
Post likes count 2

Simplified Connection State Management

Peter Lasne

This post advocates for a simplifying a complex network of states into simple states that correlate with user expectations.

May 10, 2024
Post likes count 4

Synchronizing multiple remote Git Repositories

Liam Moat

summary of the post

May 3, 2024
Post likes count 2

Making an Application More Robust With Data Mapping

Julia Valder

How to use data mapping to make your application more robust against unexpected third-party data schema changes.