New research shows how fragile AI safety training is. Language and image models can be easily unaligned by prompts. Models need to be safety tested post-deployment. Model alignment refers to whether ...
On Monday, OpenAI announced the formation of a new “Safety and Security Committee” to oversee risk management for its projects and operations. The announcement comes as the company says it has ...