MAI-2024-0001 | Mend Vulnerability Database

Vulnerability DatabaseMAI-2024-0001

MAI-2024-0001

Published:May 16, 2026

Updated:June 17, 2026

Large Language Models (LLMs) are susceptible to adversarial suffix injection attacks, wherein maliciously crafted suffixes appended to otherwise benign prompts can manipulate the model into generating harmful or unintended outputs. These attacks exploit the model's sensitivity to input perturbations, effectively bypassing built-in safety mechanisms and eliciting responses that fall outside the intended safety boundaries. Mitigation steps: **For AI Developers:** * Improve safety mechanisms to enhance robustness against input phrasing variations and suffix additions. * Implement advanced prompt sanitization techniques to detect and neutralize adversarial suffixes. **For Model Trainers/Fine-tuners:** * Integrate detection mechanisms to identify and flag responses generated by adversarial suffix injection. * Conduct comprehensive adversarial testing and red-teaming to identify and mitigate vulnerabilities pre-deployment.

Related Resources (1)

https://arxiv.org/abs/2411.14133

Do you need more information?

CVSS v4

Base Score:

9.3

Attack Vector

NETWORK

Attack Complexity

LOW

Attack Requirements

NONE

Privileges Required

NONE

User Interaction

NONE

Vulnerable System Confidentiality

HIGH

Vulnerable System Integrity

HIGH

Vulnerable System Availability

NONE

Subsequent System Confidentiality

NONE

Subsequent System Integrity

NONE

Subsequent System Availability

NONE

CVSS v3

Base Score:

9.1

Attack Vector

NETWORK

Attack Complexity

LOW

Privileges Required

NONE

User Interaction

NONE

Scope

UNCHANGED

Confidentiality

HIGH

Integrity

HIGH

Availability

NONE

AIVSS

Base Score:

5.2