> In our content, we don’t generally worry that someone is reading our inner dia...

		tharant on Dec 19, 2024 \| parent \| context \| favorite \| on: Alignment faking in large language models > In our content, we don’t generally worry that someone is reading our inner dialogue… Really? Is that what’s wrong with me? ¯\_(ツ)_/¯