Anthropic aims to fix one of the biggest problems in AI | Tech Reader

Date:

Share:


the Anthropic logo
Anthropic

Hot on the heels of the announcement that its Claude 3.5 Sonnet large language model beat out other leading models, including GPT-4o and Llama-400B, AI startup Anthropic announced Monday that it plans to launch a new program to fund the development of independent, third-party benchmark tests against which to evaluate its upcoming models.

Per a blog post, the company is willing to pay third-party developers to create benchmarks that can “effectively measure advanced capabilities in AI models.”

“Our investment in these evaluations is intended to elevate the entire field of AI safety, providing valuable tools that benefit the whole ecosystem,” Anthropic wrote in a Monday blog post. “Developing high-quality, safety-relevant evaluations remains challenging, and the demand is outpacing the supply.”

The company wants submitted benchmarks to help measure the relative “safety level” of an AI based on a number of factors, including how well it resists attempts to coerce responses that might include cybersecurity; chemical, biological, radiological, and nuclear (CBRN); and misalignment, social manipulation, and other national security risks. Anthropic is also looking for benchmarks to help evaluate models’ advanced capabilities and is willing to fund the “development of tens of thousands of new evaluation questions and end-to-end tasks that would challenge even graduate students,” essentially testing a model’s ability to synthesize knowledge from a variety of sources, its ability to refuse cleverly worded malicious user requests, and its ability to respond in multiple languages.

Anthropic is looking for “sufficiently difficult,” high-volume tasks that can involve as many as “thousands” of testers across a diverse set of test formats that help the company inform its “realistic and safety-relevant” threat modeling efforts. Any interested developers are welcome to submit their proposals to the company, which plans to evaluate them on a rolling basis.








Source link

━ more like this

How to watch Ariane 6 rocket’s maiden launch on Tuesday | Tech Reader

Ariane 6 first flight (Official broadcast) The European Space Agency (ESA) is about to conduct the maiden launch of its new Ariane 6 rocket....

3 rom-coms on Amazon Prime Video you need to watch in July | Tech Reader

Rom-coms make you feel good any time of the year. But on a nice summer night, it’s the best way to relax in...

Meet David Ellison, Paramount’s Future Boss and Hollywood’s Newest Mogul

David Ellison’s Hollywood career has been defined by high-octane blockbusters filled with suspense, stunts and improbable plot twists.But on Sunday he landed his...

Paramount Takes Major Step to Finalizing Merger With Skydance

A special committee of Paramount’s board on Sunday signed off on a deal to merge with Skydance, according to two people familiar with...

Angel Pop brings cutesy chaos to Playdate in a highly addictive bullet hell

This is it, the first game to nearly make me throw my Playdate against a wall. Angel Pop, a bullet hell shoot ‘em...
spot_img