Anthropic details Constitutional Classifiers, a protective LLM layer designed to stop AI model jailbreaking by monitoring inputs and outputs for harmful content (Cristina Criddle/Financial Times)

Cristina Criddle / Financial Times: Anthropic details Constitutional Classifiers, a protective LLM layer designed to stop AI model jailbreaking by monitoring inputs and outputs for harmful content  —  Leading tech groups including Microsoft and Meta also invest in similar safety systems  —  Artificial intelligence start …

Feb 4, 2025 - 11:06
 0
Anthropic details Constitutional Classifiers, a protective LLM layer designed to stop AI model jailbreaking by monitoring inputs and outputs for harmful content (Cristina Criddle/Financial Times)

Cristina Criddle / Financial Times:
Anthropic details Constitutional Classifiers, a protective LLM layer designed to stop AI model jailbreaking by monitoring inputs and outputs for harmful content  —  Leading tech groups including Microsoft and Meta also invest in similar safety systems  —  Artificial intelligence start …