An Examination of Natural Language Processing and Named Entity Recognition for Bioarchaeological Research

DSpace Repositorium (Manakin basiert)

Zur Kurzanzeige

dc.contributor.author Talks, Alphaeus G. W.
dc.date.accessioned 2025-12-23T09:06:22Z
dc.date.available 2025-12-23T09:06:22Z
dc.date.issued 2026-03
dc.identifier.uri http://hdl.handle.net/10900/173609
dc.identifier.uri http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-1736097 de_DE
dc.identifier.uri http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-1736097 de_DE
dc.identifier.uri http://dx.doi.org/10.15496/publikation-114934
dc.description.abstract Bioarchaeological research is producing ever increasing amounts of data from finite resources. To ensure that the wealth of information contained within these studies is available for reuse, it can be beneficial to use the FAIR (Findable, Accessible, Interoperable and Reusable) data principles. From investigations into the FAIRness of bioarchaeological datasets it was revealed that more must be done to increase the reusability of datasets. This is particularly the case for osteoarchaeological and palaeopathological datasets. Furthermore, osteological data is currently being shared in published reports using PDF format, for functions outside of their original design, providing limited opportunities for data extraction and analysis. This research paper explores the use of Natural Language Processing (NLP) and Named Entity Recognition (NER) to overcome the shortcomings of PDFs in their current use and provide greater opportunities for data reuse in line with FAIR data principles. These two technological approaches were tested through the creation of a prototype system to search for osteoarchaeological terms within the Archaeology Data Service archive. Their application was then analysed for accuracy, time-saving ability, usefulness, accessibility, whether users would consider using it again and reliability by professional bioarchaeologists, students and the public. From the results, despite some limitations, it is shown that there is real potential in the use of NLP and NER to allow osteoarchaeology and palaeopathology information to be accessed more easily, thus unlocking the data trapped within ‘grey literature’. en
dc.language.iso en de_DE
dc.publisher Tübingen University Press de_DE
dc.rights.uri https://creativecommons.org/licenses/by-nc-nd/4.0/deed.en
dc.subject.classification Archäologie de_DE
dc.subject.ddc 930 de_DE
dc.subject.other Bioarchaeology en
dc.subject.other FAIR en
dc.subject.other NLP en
dc.subject.other NER en
dc.title An Examination of Natural Language Processing and Named Entity Recognition for Bioarchaeological Research en
dc.type ConferencePaper de_DE
utue.publikation.fachbereich Sonstige/Externe de_DE
utue.publikation.fakultaet 9 Sonstige / Externe de_DE
utue.publikation.fakultaet 9 Sonstige / Externe de_DE
utue.publikation.noppn yes de_DE
utue.publikation.noppn yes de_DE


Dateien zu dieser Ressource

Dateien Größe Format Anzeige

Zu diesem Dokument gibt es keine Dateien.

Das Dokument erscheint in:

Zur Kurzanzeige

https://creativecommons.org/licenses/by-nc-nd/4.0/deed.en Solange nicht anders angezeigt, wird die Lizenz wie folgt beschrieben: https://creativecommons.org/licenses/by-nc-nd/4.0/deed.en