MAI-2023-0009 | Mend Vulnerability Database

Vulnerability DatabaseMAI-2023-0009

MAI-2023-0009

Published:May 16, 2026

Updated:May 16, 2026

Large Language Models (LLMs) are susceptible to jailbreaking attacks that exploit cognitive overload through the use of multilingual prompts, cryptic expressions, and reverse logical reasoning. These techniques circumvent built-in safety protocols by overwhelming the model's cognitive processing capabilities, resulting in the generation of potentially unsafe or harmful outputs. The vulnerability affects a wide range of LLMs, encompassing both open-source and proprietary models, and current defense strategies are insufficient to effectively counteract these attacks. Mitigation steps: **For AI Developers:** * Implement advanced multilingual safety filters that surpass basic keyword detection. * Develop systems capable of resisting paraphrasing and other obfuscation techniques used to conceal malicious intent. **For Model Trainers/Fine-tuners:** * Enhance model reasoning capabilities to effectively identify and reject prompts utilizing effect-to-cause reasoning. * Investigate alternative defense strategies, including cognitive load management techniques, to discover effective mitigation methods beyond those evaluated in existing research.

Related Resources (1)

https://arxiv.org/abs/2311.09827

Do you need more information?

CVSS v4

Base Score:

6.9

Attack Vector

NETWORK

Attack Complexity

HIGH

Attack Requirements

NONE

Privileges Required

NONE

User Interaction

NONE

Vulnerable System Confidentiality

NONE

Vulnerable System Integrity

LOW

Vulnerable System Availability

NONE

Subsequent System Confidentiality

NONE

Subsequent System Integrity

HIGH

Subsequent System Availability

NONE

CVSS v3

Base Score:

Attack Vector

NETWORK

Attack Complexity

HIGH

Privileges Required

NONE

User Interaction

NONE

Scope

CHANGED

Confidentiality

NONE

Integrity

LOW

Availability

NONE

AIVSS

Base Score:

4.3