Abstract
Neural text detectors are models trained to detect whether a given text was generated by a language model or written by a human. In this paper, we investigate three simple and resource-efficient strategies (parameter tweaking, prompt engineering, and character-level mutations) to alter texts generated by GPT-3.5 that are unsuspicious or unnoticeable for humans but cause misclassification by neural text detectors. The results show that especially parameter tweaking and character-level mutations are effective strategies.
Original language | English |
---|---|
Title of host publication | Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP-2023) |
Editors | Mourad Abbas, Abed Alhakim Freihat |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 78-83 |
Number of pages | 6 |
ISBN (Electronic) | 979-8-89176-065-3 |
Publication status | Published - 1 Dec 2023 |
Keywords
- 2024 OA procedure