Copy the page URI to the clipboard
De Leo, Vincenzo; Fenu, Gianni; Greco, David; Bidotti, Nicolò; Platter, Paolo; Motta, Enrico; Nuzzolese, Andrea Giovanni; Osborne, Francesco and Recupero, Diego Reforgiato
(2023).
DOI: https://doi.org/10.1109/bigdata59044.2023.10386910
Abstract
The design and management of modern big data platforms are extremely complex. It requires carefully integrating multiple storage and computational platforms as well as implementing approaches to protect and audit data access. Therefore, onboarding new data and implementing new data transformation processes is typically time-consuming and expensive. In many cases, enterprises construct their data platforms without a clear distinction between logical and technical concerns. Consequently, these platforms lack sufficient abstraction and are closely tied to particular technologies, making the adaptation to technological evolution very costly. This paper illustrates a novel approach to designing data platform models based on a formal ontology that structures various domain components into an accessible knowledge graph. We also describe the preliminary version of AGILE-DM, a novel ontology that we built for this purpose. Our solution is flexible, technologically agnostic, and more adaptable to changes and technical advancements.