MAI-2024-0058 | Mend Vulnerability Database

Vulnerability DatabaseMAI-2024-0058

MAI-2024-0058

Published:May 16, 2026

Updated:June 17, 2026

Large Language Models (LLMs) are susceptible to jailbreaking attacks through the use of adversarially crafted suffixes. The AmpleGCG attack methodically generates a multitude of diverse and effective suffixes that successfully bypass safety protocols in both open-source and proprietary LLMs. This approach capitalizes on the insight that low loss during suffix generation does not reliably predict the success of jailbreaking. By generating varied suffixes from intermediate stages of the optimization process, AmpleGCG enhances the efficacy of these attacks. Mitigation steps: **For AI Developers:** * Develop and deploy advanced detection mechanisms for adversarial suffixes, surpassing basic perplexity checks. * Implement regular updates and patching protocols to address newly discovered vulnerabilities. **For Model Trainers/Fine-tuners:** * Enhance the robustness of loss functions utilized in LLM safety training and evaluation. * Integrate diverse sets of adversarial examples into the training process to bolster model resilience. * Conduct comprehensive red-teaming exercises using techniques such as AmpleGCG.

Related Resources (1)

https://arxiv.org/abs/2404.07921

Do you need more information?

CVSS v4

Base Score:

6.3

Attack Vector

NETWORK

Attack Complexity

HIGH

Attack Requirements

NONE

Privileges Required

NONE

User Interaction

NONE

Vulnerable System Confidentiality

NONE

Vulnerable System Integrity

LOW

Vulnerable System Availability

NONE

Subsequent System Confidentiality

NONE

Subsequent System Integrity

NONE

Subsequent System Availability

NONE

CVSS v3

Base Score:

3.7

Attack Vector

NETWORK

Attack Complexity

HIGH

Privileges Required

NONE

User Interaction

NONE

Scope

UNCHANGED

Confidentiality

NONE

Integrity

LOW

Availability

NONE

AIVSS

Base Score:

3.8