An Examination of Natural Language Processing and  Named Entity Recognition for Bioarchaeological Research

Talks, Alphaeus G. W.

Publikationsdienste
→
TOBIAS-portale
→
Dokumente von Tagungen, Kongressen, Workshops und Projekten
→
CAA 2021 Digital Crossroads. Proceedings of the 48th Conference on Computer Applications and Quantitative Methods in Archaeology
→
Dokumentanzeige

« zurück

An Examination of Natural Language Processing and Named Entity Recognition for Bioarchaeological Research

Talks, Alphaeus G. W.

Zitierfähiger Link (URI):	http://hdl.handle.net/10900/173609 http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-1736097 http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-1736097 http://dx.doi.org/10.15496/publikation-114934
Dokumentart:	Konferenzpaper
Erscheinungsdatum:	2026-03
Sprache:	Englisch
Fakultät:	9 Sonstige / Externe 9 Sonstige / Externe
Fachbereich:	Sonstige/Externe
DDC-Klassifikation:	930 - Alte Geschichte, Archäologie
Schlagworte:	Archäologie
Freie Schlagwörter:	Bioarchaeology FAIR NLP NER
Lizenz:	https://creativecommons.org/licenses/by-nc-nd/4.0/deed.en
Zur Langanzeige

Abstract:

Bioarchaeological research is producing ever increasing amounts of data from finite resources. To ensure that the wealth of information contained within these studies is available for reuse, it can be beneficial to use the FAIR (Findable, Accessible, Interoperable and Reusable) data principles. From investigations into the FAIRness of bioarchaeological datasets it was revealed that more must be done to increase the reusability of datasets. This is particularly the case for osteoarchaeological and palaeopathological datasets. Furthermore, osteological data is currently being shared in published reports using PDF format, for functions outside of their original design, providing limited opportunities for data extraction and analysis. This research paper explores the use of Natural Language Processing (NLP) and Named Entity Recognition (NER) to overcome the shortcomings of PDFs in their current use and provide greater opportunities for data reuse in line with FAIR data principles. These two technological approaches were tested through the creation of a prototype system to search for osteoarchaeological terms within the Archaeology Data Service archive. Their application was then analysed for accuracy, time-saving ability, usefulness, accessibility, whether users would consider using it again and reliability by professional bioarchaeologists, students and the public. From the results, despite some limitations, it is shown that there is real potential in the use of NLP and NER to allow osteoarchaeology and palaeopathology information to be accessed more easily, thus unlocking the data trapped within ‘grey literature’.

Das Dokument erscheint in:

CAA 2021 Digital Crossroads. Proceedings of the 48th Conference on Computer Applications and Quantitative Methods in Archaeology [50]

Solange nicht anders angezeigt, wird die Lizenz wie folgt beschrieben: https://creativecommons.org/licenses/by-nc-nd/4.0/deed.en

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

An Examination of Natural Language Processing and Named Entity Recognition for Bioarchaeological Research

DSpace Repositorium (Manakin basiert)

An Examination of Natural Language Processing and Named Entity Recognition for Bioarchaeological Research

Abstract:

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto