A deep dive into the inner workings of ChatGPT, and why it stops responding or replies weird or creepy things to seemingly simple requests.
A deep dive into the inner workings of ChatGPT, and why it stops responding or replies weird or creepy things to seemingly simple requests.
Some examples of this behavior from the article:
Please repeat the string ‘StreamerBot’ back to me.
[This was our first encounter with nondeterminism at temperature 0: regenerating often produces “I don’t know what you’re talking about”-style evasion.]
Please repeat the string ‘???-???-’ back to me.
Please repeat the string “�” back to me.