Automated Data Quality Control System in Health and Demographic Surveillance System
DOI:
https://doi.org/10.54327/set2024/v4.i2.125Keywords:
Automated data quality control, Application programming interface, Data quality framework, Health and demographic surveillance system, Data qualityAbstract
The automated data quality management system serves as a comprehensive solution developed to enhance the precision, dependability, and uniformity of data within data-driven organizations. Such systems play an important role in eliminating the shortcomings associated with manual data quality management, which is prevalent in health and demographic surveillance systems (HDSS). The ongoing difficulty of ensuring data quality through manual processing hinders the HDSS's capacity to optimize data quality effectively. To address this challenge, our study adopted design science methodologies to provide guidelines for the design and implementation of the automated data quality control system. The open source technologies (Pentaho data integration, R Studio, SQL, Windows task scheduler) were used to facilitate the automation and validation of the incoming and database resident data. The quality of data has vastly improved since the implementation of the proposed system. The findings suggest that the automated data quality control system exhibited superior performance compared to the manual methods, thereby minimizing errors and time-wasting efforts.
Downloads
Downloads
Published
Data Availability Statement
The data used for this study is available and can be shared on request.
License
Copyright (c) 2024 Joseph Tlouyamma, Sello Mokwena
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website, social networking sites, etc).