TY - JOUR
T1 - Evidence of large-scale conceptual disarray in multi-level taxonomies in Wikidata
AU - Dadalto, Atílio A.
AU - Almeida, João Paulo A.
AU - M. Fonseca, Claudenir
AU - Guizzardi, Giancarlo
PY - 2024/3/7
Y1 - 2024/3/7
N2 - The distinction between types and individuals is key to most conceptual modeling techniques and knowledge representation languages. Despite that, there are a number of situations in which modelers navigate this distinction inadequately, leading to problematic models. We show evidence of a large number of representation mistakes associated with the failure to employ this distinction in the Wikidata knowledge graph, which can be identified with the incorrect use of instantiation, which is a relation between an instance and a type, and specialization (or subtyping), which is a relation between two types. The prevalence of the problems in Wikidata’s taxonomies suggests that methodological and computational tools are required to mitigate the issues identified, which occur in many settings when individuals, types, and their metatypes are included in the domain of interest. We conduct a conceptual analysis of entities involved in recurrent erroneous cases identified in this empirical data, and present a tool that supports users in identifying some of these mistakes.
AB - The distinction between types and individuals is key to most conceptual modeling techniques and knowledge representation languages. Despite that, there are a number of situations in which modelers navigate this distinction inadequately, leading to problematic models. We show evidence of a large number of representation mistakes associated with the failure to employ this distinction in the Wikidata knowledge graph, which can be identified with the incorrect use of instantiation, which is a relation between an instance and a type, and specialization (or subtyping), which is a relation between two types. The prevalence of the problems in Wikidata’s taxonomies suggests that methodological and computational tools are required to mitigate the issues identified, which occur in many settings when individuals, types, and their metatypes are included in the domain of interest. We conduct a conceptual analysis of entities involved in recurrent erroneous cases identified in this empirical data, and present a tool that supports users in identifying some of these mistakes.
U2 - 10.3233/SW-243562
DO - 10.3233/SW-243562
M3 - Article
SN - 1570-0844
JO - Semantic web
JF - Semantic web
ER -