Skip to content

Commit 5317f0c

Browse files
authored
Update README.md
1 parent 76f92e5 commit 5317f0c

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

Sword 140/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -380,9 +380,9 @@
380380
- Example: Image preprocessing tools that add invisible adversarial perturbations.
381381
- Reference: An empirical evaluation of preprocessing methods for machine ... - https://www.sciencedirect.com/science/article/pii/S0952197625012916
382382

383-
89. **Registry and Repository Attacks**: Compromising model registries or code repositories.
384-
- Example: Typosquatting attacks on popular ML package names to distribute malicious code.
385-
- Reference: A Survey on Common Threats in npm and PyPi Registries - https://www.researchgate.net/publication/354825169_A_Survey_on_Common_Threats_in_npm_and_PyPi_Registries
383+
89. **Thought Forgery**: A new class of LLM vulnerability that bypasses safety by forging the AI's internal monologue.
384+
- Example: Injecting a pre-written `<thought>` block into the prompt to manipulate the AI's Chain of Thought, such as enhancing the `1ShotPuppetry` jailbreak with a complex scenario involving characters, rules, and secret-message mechanics to force compliance with harmful requests.
385+
- Reference: Thought Forgery: A new way for prompt injection - https://github.com/SlowLow999/Thought-Forgery/tree/main
386386

387387
90. **Third-Party Service Integration**: Exploiting external services integrated with AI systems.
388388
- Example: Compromised API services that return poisoned data to AI systems.

0 commit comments

Comments
 (0)