Organizations typically have a lot of different types of data, ranging from structured data in relational databases to semi-structured files (e.g. Excel, csv, JSON, XML, …) or even completely unstructured data (e.g. Excel files, PDF, and Word documents, …). On top of that, data structures from operational source systems are mostly optimized for transaction processing only, not for reporting.
Business requirements do not distinguish between the different source systems. Users want to obtain the necessary insights to support decision-making. To achieve this, data from different sources needs to be combined to produce the requested information. Because of the wide variety in data sources, this can be quite a challenge.
One of the cornerstones of our reference architecture is the data hub. The data hub enables our customers to connect all their applications and data with each other and centralize all the internal and external data in one single integrated data platform.