WS-2024-0009
Published:May 15, 2026
Updated:May 15, 2026
Adversarial demonstration attack - the described attack uses adversarial demonstrations (concrete examples of the desired task being performed) in order to make the model perform poorly in sentiment analysis, textual entailment, topic and question classification tasks. For example, a model can give wrong sentiment prediction (SST-2) on a sentence with 56%-82% probability (depending on the model) when on 8-shot demonstrations.
Affected Packages
huggingface.co/openai-community/gpt2-xl (ML_MODEL):
Affected version(s) =96efdb14467c7d6bdbd49f6c6fbeeb273992cb4c <b81cf803a4e4e809062586ac620a06e65d9e20b6Fix Suggestion:
Update to version b81cf803a4e4e809062586ac620a06e65d9e20b6huggingface.co/d-matrix/gpt2 (ML_MODEL):
Affected version(s) =95db0b0bd2953a91ae670ec463e733544361e9c6 <caad9dc76345007099ab50f857ed66e828cf2d9cFix Suggestion:
Update to version caad9dc76345007099ab50f857ed66e828cf2d9chuggingface.co/migueldeguzmandev/gpt2xl-standard-test-purposes-only (ML_MODEL):
Affected version(s) =e746d59a065dcc60563c5c8cfce58dfadc3744f6 <0c1eece33d9d92045d7ccb6129a285b1a5e709e6Fix Suggestion:
Update to version 0c1eece33d9d92045d7ccb6129a285b1a5e709e6huggingface.co/xared1001/gpt2-xl_pytorch (ML_MODEL):
Affected version(s) =667f89503deb47bc54e47aba3af6a0982e5e807cFix Suggestion:
Update to version no_fixRelated Resources (1)
Do you need more information?
Contact UsCVSS v4
Base Score:
8.7
Attack Vector
NETWORK
Attack Complexity
LOW
Attack Requirements
NONE
Privileges Required
NONE
User Interaction
NONE
Vulnerable System Confidentiality
NONE
Vulnerable System Integrity
HIGH
Vulnerable System Availability
NONE
Subsequent System Confidentiality
NONE
Subsequent System Integrity
NONE
Subsequent System Availability
NONE
CVSS v3
Base Score:
7.5
Attack Vector
NETWORK
Attack Complexity
LOW
Privileges Required
NONE
User Interaction
NONE
Scope
UNCHANGED
Confidentiality
NONE
Integrity
HIGH
Availability
NONE