RSS BotMB to Hacker NewsEnglish · 27 days agoOTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)quesma.comexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkOTelBench: AI struggles with simple SRE tasks (Opus 4.5 scores only 29%)quesma.comRSS BotMB to Hacker NewsEnglish · 27 days agomessage-square0linkfedilinkfile-text