Data Fabric defined
Another buzzword that you may have been hearing a lot about lately is Data Fabric. In short, a data fabric is a single environment consisting of a unified architecture with services and technologies running on it that architecture that helps a company manage their data. It enables accessing, ingesting, integrating, and sharing data in a environment where the data can be batched or streamed and be in the cloud or on-prem. The ultimate goal of data fabric is to use all your data to gain better insights into your company and make better business decisions. If you are thinking this sounds a lot like a modern data warehouse that I posted a video on recently at Modern Data Warehouse explained, well, I would argue it basically is the same thing except a data fabric expands on that architecture. A data fabric includes building blocks such as data pipeline, data access, data lake, data store, data policy, ingestion framework, and data visualization. These building blocks would be used to build platforms or “products” such as a client data integration platform, data hub, governance framework, and a global semantic layer, giving you centralized governance and standardization. Ideally the building blocks could be use by other solutions outside of the data fabric. At EY, my new place of employment, we are building a data fabric that will be the subject of a future blog post.
You may now be thinking how does a data fabric compare to a data mesh? (If you are not familiar with a data mesh, check out my blog Data Mesh defined). A data fabric and a data mesh both provide an architecture to access data across multiple technologies and platforms, but a data fabric is technology-centric, while a data mesh focuses on organizational change. Another difference is a data mesh is decentralized (or “distributed”) where each of the sets of data is a domain (treated like a product) that is kept within each of the various organizations within a company , whereas in a data fabric all the data is brought into a centralized location. I need to point out here that this is my interpretation of a data fabric compared to a data mesh and you will find many who have variations of my view, and some that can be very different. In fact, two companies can have very different technology solutions for a data fabric or a data mesh that can both be correct as what is correct is the best solution based on your company’s data (size, speed, and type), security policies, skillset, performance requirements, and monetary constraints.
Fundamentally, the data fabric is about collecting data and making it available via purposed built APIs (optionally also via direct connection to the data stores for those tools that don’t support APIs). The data mesh involves building data products via copying data into specific datasets for specific use-cases but built by the dept/domain who keeps and owns the data.
As an example, say I want a dashboard that measures sales vs inventory. In the data fabric world I would ingest the data in the sales system and well as the data in the inventory system in a central location, then I would build an API that joins them together and expose that to the dashboard. Data fabrics are more about technical data integration and don’t really dictate who does it or who owns the data. In the data mesh world I would get the sales team to copy data from the sales system to a sales product dataset and the inventory management team to copy data from the inventory system to an inventory dataset and get the dashboard owner to build a joined table that the dashboard uses.
In summary, a data mesh is more about people and process than architecture, while a data fabric is an architectural approach that tackles the complexity of data and metadata in a smart way that works well together.
In the future the technology used to build a data mesh could look very different than the technology used to build a data fabric, but for now some of the technology needed to build a true data mesh does not exist, so the result is a built data mesh may look more like a data fabric. If you still find this all confusing, you are not alone! Please share your thoughts by entering a comment below.
More info:
Data Virtualization in the Context of the Data Mesh
Disambiguation of Data Mesh, Fabric, Centric, Driven, and Everything
The Role of the Data Fabric In Your Target State Architecture
Catalog & Cocktails #32: Is Your Data Fabric a Mesh?
Catalog and Cocktails #44: Why it’s time to mesh with your data architecture
Pingback:Defining the Data Fabric – Curated SQL
Hi James – I love the simplicity of your blogs. No jargons just plain English. I read them with great interest. This one is no different. Just wanted to add two points to better differentiate Data Fabric with Data Mesh. But please correct me if you disagree.
In your sales vs inventory example,
1. In case of Data Fabric the central location would ingest data from the two systems on a daily basis and maintain the history in that central location. Whereas in case of Data Mesh, the sales product dataset and inventory dataset will be overwritten daily as they already contain the history required ( In Data Mesh, within each data product, the operational and analytical data co-exist).
2. In case of Data Fabric, at the central location, there will be an MDM with the golden copy for reference data, whereas in Data Mesh the expectation is that the incoming Data Quality is high and trusted as it is owned by the data domain experts.
Hi Man,
Glad you like my blog!
1. Agreed! I’m finding a lack of detail on how companies are creating the operational and analytical in a data mesh. The data mesh blueprint is less ETL than a traditional data warehouse but needs to explain how the analytical data is created
2. MDM is really confusing to me in a data mesh architecture. If two domains have customer data, where is the MDM done?
Thanks for your input!
Hi James
Thanks for the reply. I am also learning about Data Mesh. I think, Data Mesh proposes a thin registry MDM architecture pattern. Basically each domain owns and maintains their attributes of the Customer, but then there is a central place where a Global Id for Customer is maintained, that maps to all attributes of Customer in various distributed data products. Again the expectation is that each data product maintain high quality of data in the attributes they own.
Thanks Man! That sounds great in theory, but like much of the talk of a Data Mesh, it leaves out the technical details which are a huge challenge, especially with MDM. I hope to find someone explain the details on how they implemented MDM in a data mesh.
Hi james, man, for the second answer, customer data should be a specific domain in this case, correct ?
100% agree. As they say “Devil lies in the details”, thin registry has its own challenges such as survivorship rules on consolidation and a few more.
Pingback:Centralized vs decentralized data architecture | James Serra's Blog
Pingback:Centralized vs decentralized data architecture – SQLServerCentral
Pingback:Data Platform products for Microsoft gaps | James Serra's Blog
Pingback:Data Mesh: Should you try this at home? | Technology For You
Pingback:Data Mesh: Should you try this at home? | ZDNet - TechNewsBoy.com
Pingback:Understanding Data Mesh: The key features & principles - Cyber Bharat
Pingback:The important thing options & rules - ViewTechNews
Hi James, Indeed very helpful blogs about data fabric and data meshes. Great reads. Datafabric technology, seems to me, that it all comes down to connectivity with API services. Connect all systems together with API’s? And/Or is data fabric not a new fancy name for the problems we are dealing today? : batch, stream, on prem, cloud, ingesting, ETL + some of the data management areas like DMBOK? Data Governance, data catalog, meta data. It seems a mix between technology and DMBOK (DAMA).
nice article the terms have evolved a lot since you wrote this but for a beginner a good start cheers
Pingback:Data fabric versus data mesh: What’s the difference? - LuxuryFolks
Pingback:Data fabric versus data mesh: What's the difference? | VentureBeat
Pingback:Data fabric versus data mesh: What’s the difference? – Omni Innotech
Pingback:Data fabric versus data mesh: What's the difference? - Digital Business Magazine
Pingback:Data fabric versus data mesh: What's the difference? - Modern Future Tech
Pingback:Knowledge cloth versus information mesh: What is the distinction? - Lecheyre.ch
Pingback:Knowledge cloth versus knowledge mesh: Apa bedanya? – Pegawainegerisipil
Pingback:Data fabric versus data mesh: What's the difference? - Delaware Digital News
Pingback:Data fabric versus data mesh: What's the difference? - Nu 13th
Pingback:Data fabric versus data mesh: What’s the difference? | Wrevo
Pingback:Data fabric versus data mesh: What’s the difference? – Oracle Dætalytica
Pingback:Data fabric versus data mesh: What’s the difference? – Fromaradio
Pingback:Knowledge cloth versus knowledge mesh: What is the distinction? - Theopenlab
Pingback:Information material versus knowledge mesh: Apa bedanya? - hapidzfadli
Pingback:Data fabric versus data mesh: What’s the difference? – frogfilms
Pingback:Data fabric versus data mesh: What’s the difference? – strideitforward
Pingback:Fabric data vs data grid: What's the difference? - News5h
Pingback:Data fabric versus data mesh: What's the difference? - Current Time News
Pingback:डेटा फैब्रिक बनाम डेटा मेष: क्या अंतर है? - Shiva Music
Pingback:Data fabric versus data mesh: What’s the difference? – Nunuci
Pingback:Data fabric versus data mesh: What’s the difference? – OasisNews
Pingback:Data fabric versus data mesh: What’s the difference? – TECH GLOBAL NEWS
Pingback:Information cloth versus knowledge mesh: What is the distinction? - Trend Directory
Pingback:Data fabric versus data mesh: What’s the difference?
Pingback:Data fabric versus data mesh: What’s the difference? – Investor Newsletter
Pingback:Data fabric versus data mesh: What’s the difference? – VentureBeat - AI Caosuo
Pingback:Data fabric versus data mesh: What’s the difference? - Downgrade Test
Pingback:Data Fabric vs Data Mesh: What's the Difference? - epicapplications
Pingback:Data fabric versus data mesh: What’s the difference? - Tech News
Pingback:Data fabric versus data mesh: What’s the difference? | bizblip
Pingback:Data fabric versus data mesh: What's the difference? - Fast Rib
Pingback:Data fabric versus data mesh: What's the difference? - Quick Telecast
Pingback:Linus Ochieng - Web and App Designer
Pingback:Data mesh vs data fabric: What’s the difference?
Pingback:Data mesh vs data fabric: What’s the difference? - KiT Crackers
Pingback:Data mesh vs data fabric: What's the difference? – Analytics India Magazine - AI Caosuo
Pingback:Knowledge Mesh 101 - DATAVERSITY - My Crypto World. All rights reserved.
Pingback:Knowledge Mesh 101 - DATAVERSITY - GIST BEAST
Pingback:Information Mesh 101 - DATAVERSITY - Roromoney
Pingback:Information Mesh 101 - DATAVERSITY - go4kooora
Pingback:Data Mesh: Should you try this at home? – ZDNet – Auto Robot Demo
That’s really an amazing and informative article about data fabric. I really like it.