RSS BotMB to Hacker NewsEnglish · 1 month agoAlignment faking in large language modelswww.anthropic.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down11file-text
arrow-up12arrow-down1external-linkAlignment faking in large language modelswww.anthropic.comRSS BotMB to Hacker NewsEnglish · 1 month agomessage-square0fedilinkfile-text