TY - JOUR
T1 - Understanding the differences across data quality classifications
T2 - a literature review and guidelines for future research
AU - Haug, Anders
N1 - Publisher Copyright:
© 2021, Emerald Publishing Limited.
PY - 2021/11/10
Y1 - 2021/11/10
N2 - Purpose: Numerous data quality (DQ) definitions in the form of sets of DQ dimensions are found in the literature. The great differences across such DQ classifications (DQCs) imply a lack of clarity about what DQ is. For an improved foundation for future research, this paper aims to clarify the ways in which DQCs differ and provide guidelines for dealing with this variance. Design/methodology/approach: A literature review identifies DQCs in conference and journal articles, which are analyzed to reveal the types of differences across these. On this basis, guidelines for future research are developed. Findings: The literature review found 110 unique DQCs in journals and conference articles. The analysis of these articles identified seven distinct types of differences across DQCs. This gave rise to the development of seven guidelines for future DQ research. Research limitations/implications: By identifying differences across DQCs and providing a set of guidelines, this paper may promote that future research, to a greater extent, will converge around common understandings of DQ. Practical implications: Awareness of the identified types of differences across DQCs may support managers when planning and conducting DQ improvement projects. Originality/value: The literature review did not identify articles, which, based on systematic searches, identify and analyze existing DQCs. Thus, this paper provides new knowledge on the variance across DQCs, as well as guidelines for addressing this.
AB - Purpose: Numerous data quality (DQ) definitions in the form of sets of DQ dimensions are found in the literature. The great differences across such DQ classifications (DQCs) imply a lack of clarity about what DQ is. For an improved foundation for future research, this paper aims to clarify the ways in which DQCs differ and provide guidelines for dealing with this variance. Design/methodology/approach: A literature review identifies DQCs in conference and journal articles, which are analyzed to reveal the types of differences across these. On this basis, guidelines for future research are developed. Findings: The literature review found 110 unique DQCs in journals and conference articles. The analysis of these articles identified seven distinct types of differences across DQCs. This gave rise to the development of seven guidelines for future DQ research. Research limitations/implications: By identifying differences across DQCs and providing a set of guidelines, this paper may promote that future research, to a greater extent, will converge around common understandings of DQ. Practical implications: Awareness of the identified types of differences across DQCs may support managers when planning and conducting DQ improvement projects. Originality/value: The literature review did not identify articles, which, based on systematic searches, identify and analyze existing DQCs. Thus, this paper provides new knowledge on the variance across DQCs, as well as guidelines for addressing this.
KW - Data management
KW - Data quality
KW - Data quality dimensions
KW - Information management
KW - Information quality
KW - Information quality dimensions
U2 - 10.1108/IMDS-12-2020-0756
DO - 10.1108/IMDS-12-2020-0756
M3 - Journal article
AN - SCOPUS:85113720827
SN - 0263-5577
VL - 121
SP - 2651
EP - 2671
JO - Industrial Management & Data Systems
JF - Industrial Management & Data Systems
IS - 12
ER -