Virtual Knowledge Graph for data integration in BPS-Statistics Indonesia
Conference
65th ISI World Statistics Congress 2025
Format: CPS Abstract - WSC 2025
Keywords: data_integration, official_statistics, virtual_knowledge_graph
Session: CPS 60 - Technology and Knowledge Integration in Official Statistics
Tuesday 7 October 4 p.m. - 5 p.m. (Europe/Amsterdam)
Abstract
Data integration is often one of the most challenging and time-consuming tasks, particularly within large and complex datasets. BPS-Statistics Indonesia gathers a wide range of data from the field to produce official statistics, which are then published through a data service portal. The main difficulty in this process is data integration, as users spend considerable time gathering and preprocessing the necessary datasets due to the portal's diverse data sources. To address this issue, we propose using a Virtual Knowledge Graph (VKG) approach for integrating official statistics which creates a unified view by linking and relating data from various databases, APIs, and other sources without requiring data conversion to a single format. The VKG relies on three key components: data sources, ontology to characterize data’s meaning, and mappings that specify relationships between ontology and data sources. We use statistical indicators to demonstrate the effectiveness of this approach and its ability to optimize data workflows and boost data analysis efficiency. By reducing the time and effort needed for data preprocessing, this approach frees data users to focus more on analysis and decision-making.
Figures/Tables
Fig 1. Conceptual Workflow