A Realistic Threat Model for Large Language Model Jailbreaks

DSpace Repository

A Realistic Threat Model for Large Language Model Jailbreaks

Author: Geiping, Jonas; Hein, Matthias; Voracek, Vaclav; Panfilov, Alexander; Boreiko, Valentyn
Tübinger Autor(en):
Hein, Matthias
Issue year: 2024-10-21
Verlagsangabe: arXiv
Language: English
Full text: https://doi.org/10.48550/arXiv.2410.16222
DDC Classifikation: 004 - Data processing and computer science
Dokumentart: Preprint
Show full item record

This item appears in the following Collection(s)