Coping with Data Inconsistencies in the Integration of Heterogenous Data Sources

Article ID

D48AE

Alt text: A digital illustration depicting data analysis with interconnected data points and graphs.

Coping with Data Inconsistencies in the Integration of Heterogenous Data Sources

Joshua Edem Agomor
Joshua Edem Agomor
Meda Saawah Appiah
Meda Saawah Appiah
DOI

Abstract

This research examines the problem of inconsistent data when integrating information from multiple sources into a unified view. Data inconsistencies undermine the ability to provide meaningful query responses based on the integrated data. The study reviews current techniques for handling inconsistent data including domain-specific data cleaning and declarative methods that provide answers despite integrity violations. A key challenge identified is modeling data consistency and ensuring clean integrated data. Data integration systems based on a global schema must carefully map heterogeneous sources to that schema. However, dependencies in the integrated data can prevent attaining consistency due to issues like conflicting facts from different sources. The research summarizes various proposed approaches for resolving inconsistencies through data cleaning, integrity constraints, and dependency mapping techniques. However, outstanding challenges remain regarding accuracy, availability, timeliness, and other data quality restrictions of autonomous sources. Additional research is needed to develop more automated ways of reconciling inconsistencies from source data with the requirements of the global schema. The ability to provide high-quality integrated data is crucial for organizations to maximize the value of their information assets. This research aims to promote further investigation into semi-automated remediation of inconsistencies and leveraging source data quality metrics to aid the integration process. Overcoming inconsistencies is critical to enabling unified views and meaningful analytics from merged cross-organizational data.

Coping with Data Inconsistencies in the Integration of Heterogenous Data Sources

This research examines the problem of inconsistent data when integrating information from multiple sources into a unified view. Data inconsistencies undermine the ability to provide meaningful query responses based on the integrated data. The study reviews current techniques for handling inconsistent data including domain-specific data cleaning and declarative methods that provide answers despite integrity violations. A key challenge identified is modeling data consistency and ensuring clean integrated data. Data integration systems based on a global schema must carefully map heterogeneous sources to that schema. However, dependencies in the integrated data can prevent attaining consistency due to issues like conflicting facts from different sources. The research summarizes various proposed approaches for resolving inconsistencies through data cleaning, integrity constraints, and dependency mapping techniques. However, outstanding challenges remain regarding accuracy, availability, timeliness, and other data quality restrictions of autonomous sources. Additional research is needed to develop more automated ways of reconciling inconsistencies from source data with the requirements of the global schema. The ability to provide high-quality integrated data is crucial for organizations to maximize the value of their information assets. This research aims to promote further investigation into semi-automated remediation of inconsistencies and leveraging source data quality metrics to aid the integration process. Overcoming inconsistencies is critical to enabling unified views and meaningful analytics from merged cross-organizational data.

Joshua Edem Agomor
Joshua Edem Agomor
Meda Saawah Appiah
Meda Saawah Appiah

No Figures found in article.

Joshua Edem Agomor. 2026. “. Global Journal of Computer Science and Technology – G: Interdisciplinary GJCST-G Volume 23 (GJCST Volume 23 Issue G2): .

Download Citation

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Classification
GJCST-G Classification: FOR Code: 0806
Keywords
Article Matrices
Total Views: 2001
Total Downloads: 11
2026 Trends
Research Identity (RIN)
Related Research
Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

Coping with Data Inconsistencies in the Integration of Heterogenous Data Sources

Joshua Edem Agomor
Joshua Edem Agomor
Meda Saawah Appiah
Meda Saawah Appiah

Research Journals