Researchers find hole in AI guardrails by using strings like =coffee

Tech » Researchers find hole in AI guardrails by using strings like =coffee

5 hours ago theregister.co.uk
Researchers find hole in AI guardrails by using strings like =coffee

Large language models frequently ship with "guardrails" designed to catch malicious input and harmful output. ...

1