This cyber attack allows the crack pirates of models simply by changing a single character


  • Hiddenlayer researchers have designed a new LLM attack called Tokenbreaker
  • By adding or changing a single character, they are able to bypass certain protections
  • The underlying LLM still includes intention

Security researchers have found a way to get around the protection mechanisms cooked in certain language models (LLM) and to make them respond to malicious prompts.

Kieran Evans, Kasimir Schulz and Kenneth Yeung from Hiddenlayer have published an in -depth report on a new attack technique that they have nicknamed Tokenbreak, which targets the way in which certain LLMS tokenization strategies, in particular those that use the pair of bytes (BPE) or mouth -token strategies.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top