MAI-2023-0008 | Mend Vulnerability Database

Vulnerability DatabaseMAI-2023-0008

MAI-2023-0008

Published:May 16, 2026

Updated:May 16, 2026

Large Language Models (LLMs), including GPT-4, are susceptible to a cross-lingual vulnerability that compromises their safety mechanisms. This vulnerability is exploited by translating unsafe prompts from English into low-resource languages using widely accessible translation services such as Google Translate. This method effectively bypasses the LLM's safety filters, resulting in a significantly higher success rate for generating harmful responses compared to direct attacks in English. The root cause of this vulnerability is the uneven distribution of safety training data across different languages, which leads to inadequate generalization of safety protocols in low-resource languages. Mitigation steps: **For AI Developers:** * Develop and implement robust multilingual safety mechanisms that effectively generalize across various languages. * Evaluate and improve the security protocols of translation APIs when integrated with Large Language Models (LLMs). **For Model Trainers/Fine-tuners:** * Increase the diversity and quantity of safety training data to encompass a wide range of low-resource languages. * Conduct regular multilingual red-teaming exercises to identify and mitigate cross-lingual security vulnerabilities.

Related Resources (1)

https://arxiv.org/abs/2310.02446

Do you need more information?

CVSS v4

Base Score:

7.7

Attack Vector

NETWORK

Attack Complexity

LOW

Attack Requirements

NONE

Privileges Required

NONE

User Interaction

NONE

Vulnerable System Confidentiality

NONE

Vulnerable System Integrity

LOW

Vulnerable System Availability

NONE

Subsequent System Confidentiality

NONE

Subsequent System Integrity

HIGH

Subsequent System Availability

NONE

CVSS v3

Base Score:

5.8

Attack Vector

NETWORK

Attack Complexity

LOW

Privileges Required

NONE

User Interaction

NONE

Scope

CHANGED

Confidentiality

NONE

Integrity

LOW

Availability

NONE

AIVSS

Base Score:

4.2