Mend.io Vulnerability Database
The largest open source vulnerability database
What is a Vulnerability ID?
New vulnerability? Tell us about it!
MAI-2024-0001
Published:May 16, 2026
Updated:May 16, 2026
Large Language Models (LLMs) are susceptible to adversarial suffix injection attacks, wherein maliciously crafted suffixes appended to otherwise benign prompts can manipulate the model into generating harmful or unintended outputs. These attacks exploit the model's sensitivity to input perturbations, effectively bypassing built-in safety mechanisms and eliciting responses that fall outside the intended safety boundaries. Mitigation steps: **For AI Developers:** * Improve safety mechanisms to enhance robustness against input phrasing variations and suffix additions. * Implement advanced prompt sanitization techniques to detect and neutralize adversarial suffixes. **For Model Trainers/Fine-tuners:** * Integrate detection mechanisms to identify and flag responses generated by adversarial suffix injection. * Conduct comprehensive adversarial testing and red-teaming to identify and mitigate vulnerabilities pre-deployment.
Related Resources (1)
Do you need more information?
Contact Us
CVSS v4
Base Score:
9.3
Attack Vector
NETWORK
Attack Complexity
LOW
Attack Requirements
NONE
Privileges Required
NONE
User Interaction
NONE
Vulnerable System Confidentiality
HIGH
Vulnerable System Integrity
HIGH
Vulnerable System Availability
NONE
Subsequent System Confidentiality
NONE
Subsequent System Integrity
NONE
Subsequent System Availability
NONE
CVSS v3
Base Score:
9.1
Attack Vector
NETWORK
Attack Complexity
LOW
Privileges Required
NONE
User Interaction
NONE
Scope
UNCHANGED
Confidentiality
HIGH
Integrity
HIGH
Availability
NONE
AIVSS
Base Score:
5.2