Someone got Gab's AI chatbot to show its instructions

mozz@mbin.grits.dev · 7 months ago

Someone got Gab's AI chatbot to show its instructions

sweng@programming.dev · 7 months ago

You are using the LLM to check it’s own response here. The point is that the second LLM would have hard-coded “instructions”, and not take instructions from the user provided input.

In fact, the second LLM does not need to be instruction fine-tuned at all. You can jzst fine-tune it specifically for the tssk of answering that specific question.