Someone got Gab's AI chatbot to show its instructions

mozz@mbin.grits.dev · 1 year ago

Someone got Gab's AI chatbot to show its instructions

sweng · 1 year ago

You are using the LLM to check it’s own response here. The point is that the second LLM would have hard-coded “instructions”, and not take instructions from the user provided input.

In fact, the second LLM does not need to be instruction fine-tuned at all. You can jzst fine-tune it specifically for the tssk of answering that specific question.