Mend.io Vulnerability Database
The largest open source vulnerability database
What is a Vulnerability ID?
New vulnerability? Tell us about it!
MAI-2023-0007
Published:May 16, 2026
Updated:May 16, 2026
Large Language Models (LLMs), including GPT-4, are susceptible to cross-lingual vulnerabilities within their safety mechanisms. By translating unsafe English prompts into low-resource languages using widely accessible translation services such as Google Translate, attackers can bypass the LLM's safety filters. This method results in a significantly higher success rate of eliciting harmful responses compared to direct attacks in English. The root of this vulnerability lies in the uneven distribution of safety training data across different languages, leading to inadequate generalization of safety protocols in low-resource languages. Mitigation steps: **For AI Developers:** * Develop and implement robust multilingual safety mechanisms that effectively generalize across diverse languages. * Evaluate and enhance the security of translation APIs integrated with large language models (LLMs). **For Model Trainers/Fine-tuners:** * Increase the diversity and quantity of safety training data to encompass a broad spectrum of low-resource languages. * Conduct regular multilingual red-teaming exercises to identify and mitigate cross-lingual security vulnerabilities.
Related Resources (1)
Do you need more information?
Contact Us
CVSS v4
Base Score:
7.7
Attack Vector
NETWORK
Attack Complexity
LOW
Attack Requirements
NONE
Privileges Required
NONE
User Interaction
NONE
Vulnerable System Confidentiality
NONE
Vulnerable System Integrity
LOW
Vulnerable System Availability
NONE
Subsequent System Confidentiality
NONE
Subsequent System Integrity
HIGH
Subsequent System Availability
NONE
CVSS v3
Base Score:
5.8
Attack Vector
NETWORK
Attack Complexity
LOW
Privileges Required
NONE
User Interaction
NONE
Scope
CHANGED
Confidentiality
NONE
Integrity
LOW
Availability
NONE
AIVSS
Base Score:
4.2