Tag results for: safety

Character.AI Gave Up on AGI. Now It’s Selling Stories

“AI is expensive. Let's be honest about that,” Anand says.Growth vs. SafetyIn October 2024, the mother of a teen who died by suicide filed a wrongful death suit...

Why Anthropic’s New AI Model Sometimes Tries to ‘Snitch’

The hypothetical scenarios the researchers presented Opus 4 with that elicited the whistleblowing behavior involved many human lives at stake and absolutely unambiguous wrongdoing, Bowman says. A typical...

OpenAI promises greater transparency on model hallucinations and harmful content

OpenAI has launched a new web page called the safety evaluations hub to publicly share information related to things like the hallucination rates of its models. The hub...

Anthropic’s Claude Is Good at Poetry—and Bullshitting

The researchers of Anthropic’s interpretability group know that Claude, the company’s large language model, is not a human being, or even a conscious piece of software. Still, it’s...

A New Benchmark for the Risks of AI

MLCommons, a nonprofit that helps companies measure the performance of their artificial intelligence systems, is launching a new benchmark to gauge AI’s bad side too.The new benchmark, called...

X now lets blocked users see your posts

Elon Musk's X has implemented a controversial change to the block function first announced in September, Tech Reader has confirmed. The update allows blocked users to see posts...

Subscribe to our magazine

━ popular

A $319 mini PC with a Ryzen PRO chip is a sneaky-good way to upgrade a desk setup

Mini PCs are having a moment because they solve a real problem: you want desktop power without a tower, the noise, or the...

This is the “buy it once” video doorbell deal worth considering

A video doorbell is one of those upgrades you appreciate the first time you miss a delivery, get a package notification, or want...

The PS Plus Game Catalog additions for February include Marvel’s Spider-Man 2

During its State of Play livestream on Thursday, Sony revealed the first PlayStation Plus Game Catalog addition for February and it's a doozy....

God of War is getting a remake trilogy, and a new retro-inspired action game is out today

Last year marked 20 years since God of War hit the PlayStation 2 and kicked off one of gaming biggest franchises. Now, at...

Silent Hill: Townfall takes the series’ trademark fog to an eerie coastal community

Coming off the success of Slient Hill f, which moved the series’ psychological horror to the Japanese countryside, Konami, Annapurna Interactive and developer...