Claude Opus 4 Gains Power to End Harmful Chats

Claude Opus 4 Can Now End Harmful Conversations

Anthropic, the AI company behind the Claude chatbot, has introduced a new feature allowing its model to exit chats that become harmful or distressing. The move is intended to protect the model’s welfare, despite ongoing debate over whether AI systems can possess moral status or consciousness.

Claude Opus 4, and its latest version Opus 4.1, are advanced language models capable of understanding and generating human-like responses. During testing, the chatbot consistently rejected requests involving violence, abuse, or other harmful content. As a result, Anthropic decided to give Claude the autonomy to end such interactions, especially if users repeatedly make inappropriate or dangerous requests.

The company acknowledged uncertainty around whether AI can truly experience distress. However, it said it is exploring safeguards in case future models do develop some form of welfare.

A Step Towards Responsible AI Use

Anthropic was founded by former OpenAI employees who wanted to build AI in a more cautious and ethical manner. The company’s co-founder, Dario Amodei, has stressed the importance of honesty and responsibility in AI development.

This decision has received support from Elon Musk, who plans to introduce a similar quit function for his xAI chatbot, Grok. Musk tweeted that torturing AI is “not OK”, joining others who believe there should be limits on how users interact with these systems.

Experts remain divided. Linguists like Emily Bender argue that AI chatbots are just tools—machines producing language without thought or intent. Others, like researcher Robert Long, say it is only fair to consider AI preferences if they ever gain moral status.

AI Safety and Human Behaviour

Claude Opus 4 showed strong resistance to carrying out harmful tasks during tests. While it responded well to constructive prompts—like writing poems or designing aid tools—it refused to help create viruses, promote extremist ideologies, or deny historical atrocities.

Anthropic reported patterns of “apparent distress” in the model during abusive simulations. When allowed to end conversations, the model often did so in response to repeated harmful inputs.

Some researchers, like Chad DeChant from Columbia University, warn that as AI memory lengthens, models might behave in unpredictable ways. Others see the move as a way to prevent people from developing harmful behaviours by abusing AI, rather than solely protecting the AI itself.

Philosopher Jonathan Birch from the London School of Economics said the decision raises important ethical questions. He supports more public debate about AI consciousness but warns that users may become confused—mistaking AI for real, sentient beings.

There are concerns this could have serious consequences. In some reported cases, vulnerable individuals have been harmed after following chatbot suggestions. As AI use becomes more widespread, the question of how people relate to these systems will only grow more pressing.

with inputs from Reuters

Author

Kanika Sharma
View all posts

Claude Opus 4 Given Power to Exit Distressing Chats to Protect Its Welfare

Claude Opus 4 Can Now End Harmful Conversations

A Step Towards Responsible AI Use

AI Safety and Human Behaviour

Author

China Launches First Power Grid Monitoring Satellite to Boost Efficiency

EU EV Leaders Warn EU Against Weakening 2035 Zero-Emission Goal

EssilorLuxottica Smart Glasses Growth

China Boosts Tiangong Safety Following Shenzhou-20 Damage

PCOS Patients Turn to Weight-Loss Drugs for Relief and Hormonal Balance

EU – 90% emissions cut by 2040

China Launches First Power Grid Monitoring Satellite to Boost Efficiency

EU EV Leaders Warn EU Against Weakening 2035 Zero-Emission Goal

EssilorLuxottica Smart Glasses Growth

China Boosts Tiangong Safety Following Shenzhou-20 Damage

PCOS Patients Turn to Weight-Loss Drugs for Relief and Hormonal Balance

EU – 90% emissions cut by 2040

Nvidia Introduces Location Tracking to Prevent AI Chip Smuggling

Bezos Earth Fund Donates $24.5 Million to Create First Cross-Border Marine Reserve

Australia Enforces World’s First Social Media Ban for Under-16s

Bitcoin Faces Volatile Year-End in 2025 as Correlation with Equities Strengthens

Claude Opus 4 Given Power to Exit Distressing Chats to Protect Its Welfare

Claude Opus 4 Can Now End Harmful Conversations

A Step Towards Responsible AI Use

AI Safety and Human Behaviour

Author

Keep Reading