November 15th, 2023

Announcing Azure Cosmos DB Mirroring in Microsoft Fabric – Private Preview

We are excited to announce private preview availability of mirroring your Azure Cosmos DB databases in Microsoft Fabric. Mirroring provides a modern way of accessing your Azure Cosmos DB databases in Microsoft Fabric, helping unify your data estate, thereby eliminating data silos. 

With just a few clicks, you can now bring your Azure Cosmos DB databases into Microsoft Fabric, for rich business insights using Fabric’s built-in advanced analytics and AI capabilities. Your Cosmos DB operational data is seamlessly and incrementally replicated into Fabric OneLake in near real-time. Data in OneLake is stored in open-source Delta Parquet format and made available to all analytical engines in Fabric without further data movement.

You can use T-SQL to run complex aggregate queries across Azure Cosmos DB collections or databases and Spark for data exploration. You can seamlessly join this data with all other data in your Fabric workspace and build BI reports using Direct Lake, accessing data directly from OneLake.

In addition to continuous replication for advanced analytics, you can access Azure Cosmos DB Data Explorer directly from your mirrored database in Fabric. This gives you flexibility to access and query your Azure Cosmos DB database, as you would from Azure portal, without leaving Fabric. You can view collections, add/edit/delete items, and run simple SQL queries on your collections.

Key features of mirroring

  • Inserts/updates/deletes on your Azure Cosmos DB data are incrementally replicated into Fabric OneLake, with no-ETL. There is no performance or RU impact on your transactional workload.
  • You can directly query data in OneLake from SQL analytics endpoint, automatically created in mirrored database.
  • SQL analytics endpoint experience:
    • Run complex aggregate queries using T-SQL, within or across Azure Cosmos DB collections in the mirrored database, build views and visual queries, cross-join data with other mirrored artifacts or Lakehouses or Warehouses in Fabric.
    • Use the endpoint directly from SSMS or any other 3rd party tools to access views and queries.
  • Azure Cosmos DB Data Explorer – visual editor experience:
    • Any changes made to the source database, from Azure Cosmos DB Data Explorer in mirrored database, are reflected immediately in Azure portal/other clients and vice-versa.
  • You can add mirrored databases as shortcuts in Fabric Lakehouse. Data engineers can create notebooks and use Spark to analyze the data. Mirrored database data can be joined with any other data in Lakehouse, including shortcuts from Azure Data Lake Storage Gen2 (ADLS Gen2), Amazon S3, other OneLake sources etc.
  • Visualize your Azure Cosmos DB data and build quick BI reports using Direct Lake and leverage copilot to build rich content for your dashboards and reports. Image Picture1a jpg

Getting started with mirroring

  • Navigate to Fabric->Data warehouse and click on Mirrored Azure Cosmos DB database. Give your artifact a name.Image Picture2
  • Create or pick a connection to connect to your Azure Cosmos DB account and database.
  • Azure Cosmos DB Data Explorer – visual editor experience:
    • Access your source database directly using Azure Cosmos DB Data Explorer, integrated into Microsoft Fabric.

Image Picture5

  • Navigate to SQL analytics endpoint to access Azure Cosmos DB data replicated into Fabric OneLake.Image picture6
  • SQL analytics endpoint experience

Automatic continuous replication from Azure Cosmos DB to Fabric OneLake for analytics

    • Run T-SQL queries and views Here’s an example of a cross-collection query with aggregates. Optional: You can select the query below and click on “Save as view”.Image Picture8
    • Build visual queries with just a few clicksImage visualq
    • Cross-join Azure Cosmos DB data with other data in OneLake: Below example shows Azure Cosmos DB data queried together with mirrored Azure SQL Database data.Image Picture10
    • Build BI reports directly from SQL queriesImage Picture11
  • Data science: Access Cosmos DB data from Lakehouse/notebooks

Image Picture12

When to use mirroring

If you are an existing Azure Synapse Link customer or looking for analytics on your operational data in Azure Cosmos DB, you may benefit from below with mirroring:

  • Ease of bringing data across various sources into Microsoft Fabric OneLake
  • Open-source Delta Parquet format and delta features such as time-travel
  • Delta table optimizations with v-order for lightning-fast reads
  • One-click integration with Power BI with Direct Lake and co-pilot
  • Rich business insights by joining data across various sources
  • Richer app integration to access queries and views

If you are an existing Fabric user, you may benefit from having Azure Cosmos DB data with rest of your organizational data in OneLake, unifying your data estate.

How to sign-up for private preview

If you are interested in trying the product in preview, please fill this form. Mirroring is currently available for NoSQL API only.

For any questions or feedback, please reach out to our team at fabriccosmosdbmirror@microsoft.com.

About Azure Cosmos DB

Azure Cosmos DB is a fully managed and serverless distributed database for modern app development, with SLA-backed speed and availability, automatic and instant scalability, and support for open-source PostgreSQL, MongoDB, and Apache Cassandra. Try Azure Cosmos DB for free here. To stay in the loop on Azure Cosmos DB updates, follow us on TwitterYouTube, and LinkedIn.

Author

Anitha Adusumilli
Principal PM Manager

Principal PM Manager, Azure Cosmos DB

Revin Chalil
Principal Manager, Engineering and Product

0 comments

Discussion are closed.

Feedback