MASTERKEY: Automated Jailbreaking of Large Language Model Chatbots
A comprehensive framework for automated jailbreaking of Large Language Model chatbots, featuring novel attack methodologies and systematic analysis of defense mechanisms.
Feb 26, 2024