Lakera launches open-source security benchmark for LLM backends in AI agents

By Express Computer

Check Point Software Technologies Ltd announced the release of the backbone breaker benchmark (b³), an open-source security evaluation designed specifically for the security of the LLM within AI agents.

The b³ is built around a new idea called threat snapshots. Instead of simulating an entire AI agent from start to finish, threat snapshots zoom in on the critical points where vulnerabilities in large language models are most likely to appear. By testing models at these exact moments, developers and model providers can see how well their systems stand up to more realistic adversarial challenges without the complexity and overhead of modelling a full agent workflow.

“We built the b³ benchmark because today’s AI agents are only as secure as the LLMs that power them,” said Mateo Rojas-Carulla, Co-Founder and Chief Scientist at Lakera, a Check Point company. “Threat Snapshots allow us to systematically ...

Copyright of this story solely belongs to expresscomputer.in . To see the full text click HERE

Share:

More related news