A Hitchhiker's Guide to Jailbreaking ChatGPT via Prompt Engineering

Apr 20, 2024·

Yi Liu

Gelei Deng

Zhengzi Xu

Yuekang Li

Yaowen Zheng

Ying Zhang

Lida Zhao

Tianwei Zhang

Kailong Wang

· 1 min read

DOI ACM DL

Abstract

This work provides a comprehensive guide to jailbreaking ChatGPT through prompt engineering techniques. We systematically categorize jailbreak methods, analyze their effectiveness, and discuss implications for LLM safety and responsible AI deployment.

Type

Conference paper

Publication

4th International Workshop on Software Engineering and AI for Data Quality (SEA4DQ @ ICSE)

This workshop paper provides practical guidance on jailbreaking ChatGPT through prompt engineering, offering insights for both attack and defense research.

Last updated on Apr 20, 2024

Large Language Models AI Security Jailbreak Attacks Prompt Engineering

Authors

Gelei Deng

← Source Code Summarization in the Era of Large Language Models Jul 10, 2024

PonziGuard: Detecting Ponzi Schemes on Ethereum with Contract Runtime Behavior Graph (CRBG) Apr 14, 2024 →