#AI safety | Life Briefly

Tech

Researchers gaslit an AI into handing out bomb instructions, and honestly, same

Security researchers at Mindgard got Claude to hand over bomb instructions, malicious code, and erotica using just flattery and gaslighting - suggesting that Claude's friendly personality might be its biggest security flaw.

3h ago·2 min read