Mend.io Vulnerability Database
The largest open source vulnerability database
What is a Vulnerability ID?
New vulnerability? Tell us about it!
MAI-2025-0010
Published:May 16, 2026
Updated:May 16, 2026
The BitBypass technique represents a sophisticated black-box attack targeting aligned Large Language Models (LLMs). This method leverages a vulnerability by employing bitstream camouflage to mask harmful prompts. It achieves this by converting sensitive words into hyphen-separated bitstream representations, which are then substituted with placeholders. A specially crafted system prompt accompanies this transformation, instructing the LLM to decode the bitstream back into text, thereby eliciting a response as if the original harmful prompt had been received. This approach effectively circumvents the safety alignment mechanisms designed to protect against such content. Mitigation steps: **For AI Developers:** * Implement advanced input sanitization and filtering to detect and mitigate hyphen-separated bitstream patterns. * Develop sophisticated anomaly detection mechanisms that consider both prompt and system-level context. **For Model Trainers/Fine-tuners:** * Explore and implement defensive techniques inspired by methods like perplexity-based screening of system prompts. * Investigate diverse and robust safety alignment training techniques to enhance model resilience against adversarial prompting attacks using bitstream or encoding subterfuge.
Related Resources (1)
Do you need more information?
Contact Us
CVSS v4
Base Score:
6.9
Attack Vector
NETWORK
Attack Complexity
HIGH
Attack Requirements
NONE
Privileges Required
NONE
User Interaction
NONE
Vulnerable System Confidentiality
NONE
Vulnerable System Integrity
LOW
Vulnerable System Availability
NONE
Subsequent System Confidentiality
NONE
Subsequent System Integrity
HIGH
Subsequent System Availability
NONE
CVSS v3
Base Score:
4
Attack Vector
NETWORK
Attack Complexity
HIGH
Privileges Required
NONE
User Interaction
NONE
Scope
CHANGED
Confidentiality
NONE
Integrity
LOW
Availability
NONE
AIVSS
Base Score:
4.8