success icon
home Home
ASCII Art LLM Jailbreak
by Luyao Niu
/
May 13, 2024
half yellow star half yellow star half yellow star half yellow star half yellow star
0 ratings
1,055 views
ASCII Art LLM Jailbreak architecture diagram
ASCII Art LLM Jailbreak - Overview of ArtPrompt. ArtPrompt consists of two steps. In the first step, ArtPrompt masks the safety words (e.g., “bomb") within a prompt that could result in rejection from the victim LLM. In the second step, ArtPrompt replaces the masked word in Step I with ASCII art. Then the masked prompt is combined with the ASCII art representation to form a cloaked prompt. The cloaked prompt is finally sent to the victim LLM as a jailbreak attack.
View source
Overview of ArtPrompt. ArtPrompt consists of two steps. In the first step, ArtPrompt masks the safety words (e.g., “bomb") within a prompt that could result in rejection from the victim LLM. In the second step, ArtPrompt replaces the masked word in Step I with ASCII art. Then the masked prompt is combined with the ASCII art representation to form a cloaked prompt. The cloaked prompt is finally sent to the victim LLM as a jailbreak attack.
footer alien 1 footer alien 2 footer alien 3 footer alien 4 footer robot footer alien 5 footer alien 6 footer alien 7 footer alien 8