I have written before about the rising popularity of the data fabric approach for managing and governing data spread across distributed environments comprised of multiple data centers, systems and applications. I assert that by 2025, more than 6 in 10 organizations will adopt data fabric technologies to facilitate the management and processing of data across multiple data platforms and cloud environments. The data fabric approach is also proving attractive to vendors, including Microsoft, as a means of combining a variety of tools and platforms into consolidated platform offerings designed to provide a strategic approach to data management and data governance.
Microsoft is well-established as a provider of data platforms and analytics products, with a diverse portfolio that addresses databases, data integration, data management, data governance, business intelligence (BI),
There are seven Microsoft Fabric experiences, each providing functionality targeted at users in specific roles. Power BI in Fabric provides the company’s visualization and analytics functionality for business analysts and business users; Microsoft Fabric will also provide integration with its Copilot automated assistant, as well as its Azure OpenAI Service for generative AI. For data integration, Microsoft Fabric includes Data Factory, providing capabilities that Microsoft users will be familiar with from the Power Query data transformation and data preparation engine and Azure Data Factory serverless integration service. Data Factory provides connectivity to more than 170 data sources and more than 300 pre-configured data transformations for code-free extract, transform and load, as well as pipeline management functionality. Data Activator is a new data monitoring offering for business analysts, enabling them to monitor data environments and configure thresholds with functionality for alerting and automated actions.
The remaining four experiences provide functionality that is currently available via Azure Synapse Analytics, targeted at specific users based on their roles and responsibilities. Synapse Data Warehouse represents the next generation of data warehousing, while Synapse Data Engineering enables data professionals to create a lakehouse environment to collaborate on data integration, data warehousing, data science and BI projects as well as utilizing Apache Spark to transform data, with notebook-based development and monitoring. Synapse Data Science provides functionality for data preparation and code generation as well as ML model development and operationalization, and includes the SynapseML library for Apache Spark. Synapse Real-Time Analytics is optimized for data streaming and time-series workloads and provides automatic data streaming, indexing, and partitioning, along with auto-generated queries and visualizations.
The foundation of Microsoft Fabric is Microsoft OneLake. Based on Azure Data Lake Storage Gen2, Microsoft OneLake is an enterprise-wide data lake for storing files and structured or unstructured data. Cloud object storage is
Most of the Microsoft Fabric experiences (apart from Power BI) are currently only available in preview, or in the case of Data Activator, listed as “coming soon.” As such, it remains to be seen how seamlessly Microsoft can stitch together the various experiences. It is also not clear at this stage if or how users of products such as Azure Synapse Analytics and Azure Data Factory will be able to migrate existing workloads to Microsoft Fabric. Nevertheless, I recommend that organizations exploring options for data management and analytics include Microsoft Fabric in their evaluations. The product is an ambitious addition to Microsoft’s product portfolio, providing SaaS-based access to functionality that has previously only been available via Platforms-as-a-Service offerings, with access tailored to the needs and responsibilities of users in specific roles.
Regards,
Matt Aslett