Strengthening AI Safety: OpenAI Updates Its Preparedness Framework to Address Emerging Risks

RIEWMI · Apr 16, 2025

On April 15, 2025, OpenAI released an updated Preparedness Framework aimed at enhancing the safety of advanced AI systems. This revision focuses on identifying and mitigating severe risks associated with frontier AI capabilities.

Key Updates:

Refined Risk Assessment Criteria: OpenAI now prioritizes risks that are plausible, measurable, severe, novel, and either instantaneous or irreversible. This structured approach helps in categorizing and addressing potential threats more effectively.
Updated Capability Categories:
- Tracked Categories: These include areas with established evaluations and safeguards, such as Biological and Chemical capabilities, Cybersecurity, and AI Self-improvement.
- Research Categories: OpenAI introduces new focus areas like Long-range Autonomy, Sandbagging (intentional underperformance), Autonomous Replication and Adaptation, Undermining Safeguards, and Nuclear and Radiological risks.
Operational Enhancements: The framework provides clearer guidance on evaluating, governing, and disclosing safeguards, ensuring a more transparent and actionable safety process.

This update reflects OpenAI's commitment to proactively managing the evolving risks of AI technologies by implementing rigorous and transparent safety measures.

Source: https://openai.com/index/updating-our-preparedness-framework/

Strengthening AI Safety: OpenAI Updates Its Preparedness Framework to Address Emerging Risks

How do you think AI will affect future jobs?

AI will create more jobs than it replaces.

AI will replace more jobs than it creates.

AI will have little effect on the number of jobs.