AI workers demand stronger whistleblower protections in open letter

Date:

Share:


A group of current and former employees from leading AI companies like OpenAI, Google DeepMind and Anthropic has signed an open letter asking for greater transparency and protection from retaliation for those who speak out about the potential concerns of AI. “So long as there is no effective government oversight of these corporations, current and former employees are among the few people who can hold them accountable to the public,” the letter, which was published on Tuesday, says. “Yet broad confidentiality agreements block us from voicing our concerns, except to the very companies that may be failing to address these issues.”

The letter comes just a couple of weeks after a Vox investigation revealed OpenAI had attempted to muzzle recently departing employees by forcing them to chose between signing an aggressive non-disparagement agreement, or risk losing their vested equity in the company. After the report, OpenAI CEO Sam Altman said that he had been genuinely embarrassed” by the provision and claimed it has been removed from recent exit documentation, though it’s unclear if it remains in force for some employees.

The 13 signatories include former OpenAI employees Jacob Hinton, William Saunders and Daniel Kokotajlo. Kokotajlo said that he resigned from the company after losing confidence that it would responsibly build artificial general intelligence, a term for AI systems that is as smart or smarter than humans. The letter — which was endorsed by prominent AI experts Geoffrey Hinton, Yoshua Bengio and Stuart Russell — expresses grave concerns over the lack of effective government oversight for AI and the financial incentives driving tech giants to invest in the technology. The authors warn that the unchecked pursuit of powerful AI systems could lead to the spread of misinformation, exacerbation of inequality and even the loss of human control over autonomous systems, potentially resulting in human extinction.

“There is a lot we don’t understand about how these systems work and whether they will remain aligned to human interests as they get smarter and possibly surpass human-level intelligence in all areas,” wrote Kokotajlo on X. “Meanwhile, there is little to no oversight over this technology. Instead, we rely on the companies building them to self-govern, even as profit motives and excitement about the technology push them to ‘move fast and break things.’ Silencing researchers and making them afraid of retaliation is dangerous when we are currently some of the only people in a position to warn the public.”

OpenAI, Google and Anthropic did not immediately respond to request for comment from Tech Reader. In a statement sent to Bloomberg, an OpenAI spokesperson said the company is proud of its “track record providing the most capable and safest AI systems” and it believes in its “scientific approach to addressing risk.” It added: “We agree that rigorous debate is crucial given the significance of this technology and we’ll continue to engage with governments, civil society and other communities around the world.”

The signatories are calling on AI companies to commit to four key principles:

  • Refraining from retaliating against employees who voice safety concerns

  • Supporting an anonymous system for whistleblowers to alert the public and regulators about risks

  • Allowing a culture of open criticism

  • And avoiding non-disparagement or non-disclosure agreements that restrict employees from speaking out

The letter comes amid growing scrutiny of OpenAI’s practices, including the disbandment of its “superalignment” safety team and the departure of key figures like co-founder Ilya Sutskever and Jan Leike, who criticized the company’s prioritization of “shiny products” over safety.



Source link

━ more like this

NASA’S James Webb Space Telescope has captured its first direct image of an exoplanet

NASA’S James Webb Space Telescope of a planet outside of our solar system, which is the first time it has accomplished such...

Anthropic makes it easier to create and share Claude’s bite-sized Artifact apps

Last August, Anthropic released Artifacts. The feature allows Claude users to create small, AI-programmed apps for their own use. Today, Anthropic is making...

AI Agents Are Getting Better at Writing Code—and Hacking It as Well

The latest artificial intelligence models are not only remarkably good at software engineering—new research shows they are getting ever-better at finding bugs in...

Bumble is laying off 30 percent of its workforce

Bumble, the makers of the Bumble Dating App and Bumble For Friends, that the company will be laying off 30 percent of...
spot_img