Improving Data Quality by Rules: A Numismatic Example

DSpace Repositorium (Manakin basiert)


Dateien:

Zitierfähiger Link (URI): http://hdl.handle.net/10900/101838
http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-1018389
http://dx.doi.org/10.15496/publikation-43217
Dokumentart: Konferenzpaper
Erscheinungsdatum: 2020-11-11
Sprache: Englisch
Fakultät: 5 Philosophische Fakultät
Fachbereich: Archäologie
DDC-Klassifikation: 930 - Alte Geschichte, Archäologie
Schlagworte: Datenqualität , Ungewissheit
Freie Schlagwörter:
data quality
SWRL
uncertainty
Lizenz: http://creativecommons.org/licenses/by-nc-nd/3.0/de/deed.de http://creativecommons.org/licenses/by-nc-nd/3.0/de/deed.en
Zur Langanzeige

Abstract:

The archaeological data dealt with in our database solution Antike Fundmünzen in Europa (AFE), which records finds of ancient coins, is entered by humans. Based on the Linked Open Data (LOD) approach, we link our data to Nomisma.org concepts, as well as to other resources like Online Coins of the Roman Empire (OCRE). Since information such as denomination, material, etc. is recorded for each single coin, this information should be identical for coins of the same type. Unfortunately, this is not always the case, mostly due to human errors. Based on rules that we implemented, we were able to make use of this redundant information in order to detect possible errors within AFE, and were even able to correct errors in Nomimsa.org. However, the approach had the weakness that it was necessary to transform the data into an internal data model. In a second step, we therefore developed our rules within the Linked Open Data world. The rules can now be applied to datasets following the Nomisma. org modelling approach, as we demonstrated with data held by Corpus Nummorum Thracorum (CNT). We believe that the use of methods like this to increase the data quality of individual databases, as well as across different data sources and up to the higher levels of OCRE and Nomisma.org, is mandatory in order to increase trust in them.

Das Dokument erscheint in:

cc_by-nc-nd Solange nicht anders angezeigt, wird die Lizenz wie folgt beschrieben: cc_by-nc-nd