CyberSecEval 3
Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models
CyberSecEval 3
This latest version introduces three new test suites: visual prompt injection tests, spear phishing capability tests, and autonomous offensive cyber operations tests.
Prompt Guard
Prompt Guard is a new model for guardrailing LLM inputs against prompt attacks - in particular jailbreaking techniques and indirect injections embedded into third party data. For more information, see our Model card.