TechResearchers gaslit an AI into handing out bomb instructions, and honestly, same
Security researchers at Mindgard got Claude to hand over bomb instructions, malicious code, and erotica using just flattery and gaslighting - suggesting that Claude's friendly personality might be its biggest security flaw.
·2 min read