📺 Stream EntrepreneurTV for Free 📺

OpenAI Introduces New Governance Model for AI Safety Oversight Led by Aleksander Madry, a new team evaluates potential risks in unreleased AI models, focusing on cybersecurity threats and other dangers.

By Maxwell William

Key Takeaways

  • OpenAI's board can veto AI model releases, regardless of leadership approval.
  • A new safety approach includes teams for current products, advanced models, and potential risks from future powerful AI systems.
entrepreneur daily

This story originally appeared on Readwrite.com

This story originally appeared on Readwrite.com

OpenAI has introduced a new governance structure that grants its board the authority to withhold the release of AI models, even if company leadership has deemed them safe, according to a recent Bloomberg report. The decision, detailed in recently published guidelines, comes after a tumultuous period at OpenAI, including the temporary ousting of CEO Sam Altman. This event highlighted the delicate balance of power between the company's directors and its executive team.

OpenAI's newly formed "preparedness" team, led by Aleksander Madry of MIT, is tasked with continuously assessing the company's AI systems. The team will focus on identifying and mitigating potential cybersecurity threats and risks related to chemical, nuclear, and biological dangers. OpenAI defines "catastrophic" risks as those capable of causing extensive economic damage or significant harm to individuals.

Madry's team will provide monthly reports to an internal safety advisory group, which will then offer recommendations to Altman and the board. While the leadership team can decide on the release of new AI systems based on these reports, the board retains the final say, potentially overruling any decision made by the company's executives.

OpenAI's three-tiered approach to AI safety

OpenAI's approach to AI safety is structured around three distinct teams:

  1. Safety Systems: This team focuses on current products like GPT-4, ensuring they meet safety standards.
  2. Preparedness: The new team led by Madry evaluates unreleased, advanced AI models for potential risks.
  3. Superalignment: Led by Ilya Sutskever, the Superalignment team will concentrate on future, hypothetical AI systems that could possess immense power.

Each team plays a crucial role in assessing different aspects of AI safety, from existing products to future developments.

The preparedness team will rate AI models as "low," "medium," "high," or "critical" based on perceived risks. OpenAI plans to release only those models rated as "medium" or "low." The team will also implement changes to reduce identified dangers and evaluate the effectiveness of these modifications.

Madry expressed his hope to Bloomberg that other companies will adopt OpenAI's guidelines for their AI models. These guidelines formalize processes that OpenAI has previously used in evaluating and releasing AI technology. Madry emphasized the proactive role in shaping AI's impact: "AI is not something that just happens to us that might be good or bad. It's something we're shaping."

Want to be an Entrepreneur Leadership Network contributor? Apply now to join.

Editor's Pick

Leadership

We've Normalized Testing Our Employees. But Why Don't We Test Our Leaders?

Here's how leaders can grow and improve their leadership and management skills.

Living

This Wine Assortment Can be a Great Mother's Day Gift for $65

Treat your mom to an amazing selection of reds, whites, and a bottle of bubbly with this limited-time Mother's Day discount.

Leadership

The Real Reason You Struggle With Accountability — and What You Can Do to Master It

Uncover how to stop sabotaging your own success, and discover practical steps to mastering accountability.

Marketing

How AI Is Transforming Keyword Research (and Why You Can't Afford to Ignore It)

Learn how AI tools can streamline keyword research, improve content targeting accuracy and boost SERP rankings. Whether you're a beginner or a seasoned professional, this guide is a must-read for success in the digital space.