Mend.io Vulnerability Database
The largest open source vulnerability database
What is a Vulnerability ID?
New vulnerability? Tell us about it!
MAI-2023-0008
Published:May 16, 2026
Updated:May 16, 2026
Large Language Models (LLMs), including GPT-4, are susceptible to a cross-lingual vulnerability that compromises their safety mechanisms. This vulnerability is exploited by translating unsafe prompts from English into low-resource languages using widely accessible translation services such as Google Translate. This method effectively bypasses the LLM's safety filters, resulting in a significantly higher success rate for generating harmful responses compared to direct attacks in English. The root cause of this vulnerability is the uneven distribution of safety training data across different languages, which leads to inadequate generalization of safety protocols in low-resource languages. Mitigation steps: **For AI Developers:** * Develop and implement robust multilingual safety mechanisms that effectively generalize across various languages. * Evaluate and improve the security protocols of translation APIs when integrated with Large Language Models (LLMs). **For Model Trainers/Fine-tuners:** * Increase the diversity and quantity of safety training data to encompass a wide range of low-resource languages. * Conduct regular multilingual red-teaming exercises to identify and mitigate cross-lingual security vulnerabilities.
Related Resources (1)
Do you need more information?
Contact Us
CVSS v4
Base Score:
7.7
Attack Vector
NETWORK
Attack Complexity
LOW
Attack Requirements
NONE
Privileges Required
NONE
User Interaction
NONE
Vulnerable System Confidentiality
NONE
Vulnerable System Integrity
LOW
Vulnerable System Availability
NONE
Subsequent System Confidentiality
NONE
Subsequent System Integrity
HIGH
Subsequent System Availability
NONE
CVSS v3
Base Score:
5.8
Attack Vector
NETWORK
Attack Complexity
LOW
Privileges Required
NONE
User Interaction
NONE
Scope
CHANGED
Confidentiality
NONE
Integrity
LOW
Availability
NONE
AIVSS
Base Score:
4.2