AIWI / Case Studies / Right to Warn & Daniel Kokotajlo
Right to Warn & Daniel Kokotajlo
Following news about OpenAI’s restrictive non-disparagement agreements, an open letter, entitled A Right to Warn about Advanced Artificial Intelligence, was signed by 13 current and former employees at OpenAI and Google DeepMind. The group warned about serious AI risks and their companies’ cultures, as well as calling for stronger whistleblower protections. The coalition emerged after Daniel Kokotajlo—who shared his departure and growing concerns on LessWrong,where he revealed that he had declined to sign a non-disparagement agreement with OpenAI—a decision that risked the forfeiture of nearly $2 million in vested equity. His public statement sparked the conversations that brought the group together, calling on advanced AI companies to commit to principles ensuring employees’ right to warn the public about AI risks.
Read the Press Coverage:
Company
OpenAI
Google DeepMind
Jurisdiction
US
Year
2024
Issues
Suppression of Knowledge on AI Risks
Existential Risks and Stronger Whistleblower Protections
Channels
Public and Regulatory
Why This Case Matters
Daniel Kokotajlo is now the Executive Director of the AI Futures Project having left OpenAI due to concerns about unchecked AI development. Neel Nanda, another signatory of the Right to Warn letter, still works at Google DeepMind and runs the mechanistic interpretability team. Jacob Hilton, one of several former OpenAI signatories, is Executive Director at the Alignment Research Center. William Saunders and Daniel Ziegler, both former OpenAI employees and signatories of the letter, now work at Anthropic.
Following the public attention on restrictive non-disparagement agreements in May 2024, OpenAI removed these clauses from future employment contracts and released former employees from existing obligations (except where mutual). Anthropic also removed non-disparagement terms from its standard agreements and stated it would not enforce existing ones against employees raising safety concerns. Earlier in April 2024, OpenAI published its Raising Concerns Policy, outlining ways employees may raise issues and prohibits retaliation against those who do so. AIWI has reviewed this policy, details of which can be found here.
In July 2024, OpenAI whistleblowers filed a complaint with the US Securities and Exchange Commission (SEC) alleging the company’s NDAs violated federal whistleblower protections. While it hasn’t been publicly disclosed whether any direct action was taken in relation to this complaint, five US senators signed a letter to Sam Altman in light of the complaint and news about the company’s restrictive NDAs. The case has influenced several pieces of legislation: Senator Chuck Grassley introduced the bipartisan AI Whistleblower Protection Act; California’s SB-53 (building on earlier efforts including SB-1047, which already included whistleblower provisions) and Michigan’s AI Safety and Security Transparency Act now both include whistleblower protection provisions; and SB-53 (the Transparency in Frontier Artificial Intelligence Act ‘TFAIA’) was signed into law by Governor Newsom in September 2025.
Support Received
- Legal Support Lawrence Lessig (pro-bono) [1]
- Legal Support of the Anonymous OpenAI Whistleblowers Kohn, Kohn & Colapinto [2], (who separately represented anonymous OpenAI whistleblowers in the SEC complaint) [3]
You don’t need to be in a crisis to seek help. Pro bono legal counsel can help you navigate your situation and find the right support.
Explore Your Legal Options →
Timeline
Outcomes
For The (Publicly Known) Right to Warn Signatories
Daniel Kokotajlo (Former OpenAI Governance Researcher)
- Current role: Co-founded and leads as Executive Director of the AI Futures Project, a nonprofit which researches the future impact of artificial intelligence.
- Recognition: Named in TIME’s 100 Most Influential People in AI 2024.
William Saunders (Former OpenAI Research Engineer)
- Left OpenAI in February 2024 after three years on the Superalignment team. He became a public advocate for AI safety by speaking to major media outlets, giving interviews, supporting new regulations, and appearing on podcasts.
- Public advocacy: Testified before U.S. Senate in September 2024 that AI companies are racing ahead without knowing how to make advanced AI safe, and that stronger oversight and whistleblower protections are urgently needed [28].
Jacob Hilton (Former OpenAI Researcher)
- Current role: Executive director at the Alignment Research Center, which focuses on ensuring AI systems behave safely, even as they scale in capability.
- Public advocacy: Published an opinion piece in TIME in 2025 arguing OpenAI must uphold its charitable mission, and appeared as a panellist advocating for improved whistleblower protections for AI workers at a webinar hosted by Americans for Responsible Innovation (ARI).
Neel Nanda (Current Google DeepMind Research Engineer)
- Current role: Leads the mechanistic interpretability team at Google DeepMind, focused on reverse engineering neural networks to understand what AI models learn internally.
- Recognition: Named by MIT Technology Review in their Innovator Under 35 List, having advanced at age 26 to lead an AI safety team, publish dozens of influential papers, and mentor 50 junior researchers [29].
Daniel Ziegler
- Current role: Member of Technical Staff at Anthropic, in Alignment Stress-Testing.
- Career Development: Having left OpenAI in 2021, worked at prominent AI safety orgs: lead of the adversarial training team at Redwood Research, and a member of the Technical Staff at METR (formerly ARC Evals)
Carroll Wainwright (Former OpenAI Researcher)
- Background: Co-founder of Metaculus, one of the most prominent forecasting platforms used by researchers, policymakers, and forecasting communities worldwide. Served as CTO from 2014-2021, building a platform that has facilitated thousands of successful predictions on scientific, technological, and geopolitical questions.
- Former role: Member of Technical Staff at OpenAI (September 2021 – June 2024), working on the alignment team. Co-authored the influential InstructGPT paper on training language models with human feedback—a foundational technique that underpins ChatGPT.
- Education: PhD in Physics from UC Santa Cruz.
- Left OpenAI in June 2024, publicly stating his faith in the non-profit structure had “significantly waned” and expressing concerns about the board’s ability to effectively control the for-profit subsidiary and prioritize the mission over profits.
- Current role: Member of Technical Staff at an undisclosed organization (August 2024–present), according to his LinkedIn profile.
Ramana Kumar (Former Google DeepMind Research Engineer)
- Former role: Research engineer at Google DeepMind where he focused on AI safety, formal verification, and interactive theorem proving. Research contributions: Published influential work on AI safety frameworks, including studies on reward tampering, agent incentives, and corrigibility. Collaborated with prominent DeepMind safety researchers including Victoria Krakovna and Tom Everitt on foundational problems in AI alignment.
- Background: Received Future of Life Institute funding for research on applying formal verification to reflective reasoning in AI systems.
- Current status: Current position unknown; described as “formerly of Google DeepMind” as of mid-2024. However, the Verifereum GitHub repository shows he is still actively leading this open-source project, with 842 commits and recent updates. Verifereum focuses on formal verification of Ethereum smart contracts using the HOL4 theorem prover. The project has achieved notable progress, including hosting their first community event (Higher Order Log Cabin) in February 2025.
For the Case
The Right to Warn letter’s publication, combined with revelations about OpenAI’s restrictive non disparagement agreements, shifted public discourse around the company. Following these disclosures, (which occurred soon after the departures of key OpenAI safety leaders including Chief Scientist Ilya Sutskever and Superalignment co-lead Jan Leike) the company began to face serious questions about its true priorities and commitment to safety practices [30].
The threat of equity revocation particularly damaged the company’s reputation, reinforcing perceptions that commercial priorities were eclipsing safety commitments — a view widely discussed across AI safety communities on platforms like LessWrong [31].
Regulatory and Legislative Response
While it hasn’t been publicly disclosed whether any direct action was taken in relation to the SEC complaint, five US senators signed a letter (July 2024) to OpenAI’s Sam Altman in light of the complaint and news in the media about the company’s restrictive NDAs [32]. They requested confirmation that any provisions used to penalize employees who raise concerns be removed from the agreements. They also wanted to know what the company was doing to meet its safety commitments, and how progress on this was internally evaluated.
Industry-Wide Effects
Public discussion followed in July 2024 around the inclusion of similar non disparagement terms in Anthropic’s standard severance agreements [33]. This led to co-founder Sam McCandlish posting on LessWrong that since June 1st they have been removing such terms; anyone who had signed such an agreement were free to state this fact; and that they would not be enforcing the agreement.
The broader impact on industry norms has since been tracked by external assessors. The FLI AI Safety Index, for example, now regularly evaluates frontier AI companies on whistleblowing policy transparency and the use of non-disparagement agreements.
As of its most recent edition (January 2026), OpenAI remains the only major AI company to have published its full whistleblowing policy; Anthropic has shared details of its policy and committed to publishing it publicly (we shared our preliminary thoughts on this); However, the FLI AI Safety Index reveals that most major AI companies have not clearly addressed whether their non-disparagement agreements restrict safety-related whistleblowing. Only Anthropic (score: 10) received full marks for having an explicit statement that NDAs and non-disparagement agreements cannot prevent safety-related whistleblowing. OpenAI and Google each scored 7 out of 10 in this category, while Meta, xAI, DeepSeek, Z.ai, and Alibaba (all score: 0) have not publicly disclosed any such protections.
OpenAI Policy Changes
In October 2024, OpenAI published its Raising Concerns Policy on its official website – while noting it “has long had” the policy prior to its publishing [34]. Their policy, (which was updated and expanded upon in January 2026) outlines ways employees may raise issues and prohibits retaliation against those who do so. It also explicitly states that employees have the right to make reports to government agencies such as the SEC. Based on AIWI’s preliminary evaluation, significant issues with the existing policy include: anti-retaliation protections may be voluntary rather than legally binding; no assurance is given that attorney-client privilege won’t shield investigations from discovery; and there is an unclear structural separation from Legal/HR/executives.
Federal Legislative Impact
Chuck Grassley, then Ranking Member of the Senate Budget Committee, had also sent a letter (August 2024) to Altman requesting details of potential SEC investigations; any changes to NDAs; and information about the number of employee requests to disclose information to federal authorities.
In May 2025, in the position of Senate Judiciary Committee Chair, he introduced the bipartisan AI Whistleblower Protection Act [35].
The federal legislation provides a robust legal framework designed to offer an absolute ban on retaliation against employees who disclose AI risks:
- Guaranteed voice: Renders restrictive NDAs unenforceable and protects “good faith” reports of (among others) security vulnerabilities
- Procedural protection: Strips away forced arbitration (a common tool for keeping corporate misconduct private) and allows whistleblowers to take their claims to open federal court
- Economic restitution: Provides “make-whole” remedies like double back pay and legal fee coverage, helping to remove the financial burden on those considering speaking out
While the Bill is still moving through Congress, the Act has been co-sponsored by several other senators and endorsed by major organisations including the National Whistleblower Center, Government Accountability Project, Center for AI Policy, and Americans for Responsible Innovation [36]. Companion legislation was introduced in the House of Representatives.
State-Level Impact
The case elevated whistleblower protection and transparency as a core AI policy concern at the state level.
California’s SB-53 (the Transparency in Frontier Artificial Intelligence Act ‘TFAIA’) was signed into law by Governor Gavin Newsom in September 2025. SB53’s key strengths in terms of whistleblower protections include its reasonable-belief standard, shifted burden of proof, and access to injunctive relief, rebalancing power between employees and employers and deterring retaliation.
The new law builds on policy recommendations from a report written by a working group of leading academics and experts convened by Newsom, which makes reference to the calls to action of the Right to Warn group [37]. In public statements supporting the bill, its sponsor, Senator Scott Wiener, emphasized the need for “commonsense guardrails to understand and reduce [AI] risk”, while advocacy groups backing SB-53 highlighted the importance of transparency and the ability of workers to raise safety concerns without fear of retaliation [38].
While still under consideration, Michigan’s AI Safety and Security Transparency Act (introduced in June 2025) similarly would require major AI developers to implement and publish safety and security protocols, submit to independent audits, and includes anti-retaliation protections for employees involved in reporting risks.
Rob Eleveld, CEO of Transparency Coalition, a nonprofit focused on AI guardrails, testified in support of the bill. Referring to the importance of whistleblowers’ protections in understanding the latest developments in AI, he stated:
“Waiting for the federal government to do something is just not going to happen…if Michigan passed HB 4668, it puts more pressure, along with Colorado, along with New York, along with California, along with other state…for the federal government to act.” [39].
Your concerns matter and deserve expert answer
If you see something concerning, it’s already worth a confidential conversation. AIWI can help you find clarity with the support of our expert network. Confidential, anonymous and on your terms.


