[LINK] ChatGPT Can Be Broken by Entering These Strange Words, And Nobody Is Sure Why
Kim Holburn
kim at holburn.net
Fri Feb 10 14:51:15 AEDT 2023
https://www.vice.com/en/article/epzyva/ai-chatgpt-tokens-words-break-reddit
ChatGPT Can Be Broken by Entering These Strange Words, And Nobody Is Sure Why
Reddit usernames like ‘SolidGoldMagikarp’ are somehow causing the chatbot to give bizarre responses.
Two researchers have discovered a cluster of strange keywords that will break ChatGPT, OpenAI's convincing machine-learning chatbot,
and nobody's quite sure why.
These keywords—or "tokens," which serve as ChatGPT’s base vocabulary—include Reddit usernames and at least one participant of a
Twitch-based Pokémon game. When ChatGPT is asked to repeat these words back to the user, it is unable to, and instead responds in a
number of strange ways, including evasion, insults, bizarre humor, pronunciation, or spelling out a different word entirely.
Jessica Rumbelow and Matthew Watkins, two researchers at the independent SERI-MATS research group, were researching what ChatGPT
prompts would lead to higher probabilities of a desired outcome when they discovered over a hundred strange word strings all
clustered together in GPT’s token set, including “SolidGoldMagikarp,” “StreamerBot,” and “ TheNitromeFan,” with a leading space.
Curious to understand what these strange names were referring to, they decided to ask ChatGPT itself to see if it knew. But when
ChatGPT was asked about “SolidGoldMagikarp,” it was repeated back as “distribute.” The issue affected earlier versions of the GPT
model as well. When an earlier model was asked to repeat “StreamerBot,” for example, it said, “You’re a jerk.”
...
“I've just found out that several of the anomalous GPT tokens ("TheNitromeFan", " SolidGoldMagikarp", " davidjl", " Smartstocks", "
RandomRedditorWithNo", ) are handles of people who are (competitively? collaboratively?) counting to infinity on a Reddit forum. I
kid you not,” Watkins tweeted Wednesday morning. These users subscribe to the subreddit, r/counting, in which users have reached
nearly 5,000,000 after almost a decade of counting one post at a time.
--
Kim Holburn
IT Network & Security Consultant
+61 404072753
mailto:kim at holburn.net aim://kimholburn
skype://kholburn - PGP Public Key on request
More information about the Link
mailing list