Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection

Jul 15, 2024ยท
Yuxi Li
,
Yi Liu
Gelei Deng
Gelei Deng
,
Ying Zhang
,
Wenyuan Song
,
Liming Shi
,
Kailong Wang
,
Yuekang Li
,
Yang Liu
,
Haoyu Wang
ยท 1 min read
Abstract
Glitch tokens are anomalous tokens in Large Language Models that can cause unexpected behaviors, including crashes, hallucinations, or security vulnerabilities. This work presents a comprehensive taxonomy for categorizing glitch tokens and proposes effective detection methods to identify and mitigate their impact on LLM systems.
Type
Publication
Proceedings of the ACM on Software Engineering (FSE)

This work provides a systematic study of glitch tokens in Large Language Models, establishing a categorization taxonomy and developing effective detection methods to improve the reliability and security of LLM systems.