Anthropic’s Claude AI now has the ability to end ‘distressing’ conversations

Date:

Share:


Anthropic’s latest feature for two of its Claude AI models could be the beginning of the end for the AI jailbreaking community. The company announced in a post on its website that the Claude Opus 4 and 4.1 models now have the power to end a conversation with users. According to Anthropic, this feature will only be used in “rare, extreme cases of persistently harmful or abusive user interactions.”

To clarify, Anthropic said those two Claude models could exit harmful conversations, like “requests from users for sexual content involving minors and attempts to solicit information that would enable large-scale violence or acts of terror.” With Claude Opus 4 and 4.1, these models will only end a conversation “as a last resort when multiple attempts at redirection have failed and hope of a productive interaction has been exhausted,” according to Anthropic. However, Anthropic claims most users won’t experience Claude cutting a conversation short, even when talking about highly controversial topics, since this feature will be reserved for “extreme edge cases.”

Anthropic’s example of Claude ending a conversation

(Anthropic)

In the scenarios where Claude ends a chat, users can no longer send any new messages in that conversation, but can start a new one immediately. Anthropic added that if a conversation is ended, it won’t affect other chats and users can even go back and edit or retry previous messages to steer towards a different conversational route.

For Anthropic, this move is part of its research program that studies the idea of AI welfare. While the idea of anthropomorphizing AI models remains an ongoing debate, the company said the ability to exit a “potentially distressing interaction” was a low-cost way to manage risks for AI welfare. Anthropic is still experimenting with this feature and encourages its users to provide feedback when they encounter such a scenario.



Source link

━ more like this

China’s inaugural ‘Robot Olmypics’ delivers impressive feats and disastrous falls

The first-ever World Humanoid Robot Games have come to a close with some new world records, but don't expect them to beat humans...

MasterClass deal: Subscriptions are 40 percent off right now

If you want to brush up on some skills or learn new ones, MasterClass offers a good way to do just that. The...

Ready to try Apple’s iOS 26? Here are all the compatible iPhones that can run public beta 2 today

Soon after the Apple iPhone event takes place, we'll finally have access to iOS 26 and iPadOS 26 — both of which are...

AI Is Designing Bizarre New Physics Experiments That Actually Work

“LIGO is this huge thing that thousands of people have been thinking about deeply for 40 years,” said Aephraim Steinberg, an expert on...

Roblox cracks down on its user-created content following multiple child safety lawsuits

Following a wave of lawsuits alleging that Roblox doesn't provide a safe environment for its underage users, the gaming platform made a series...
spot_img