RSS BotMB to Lobste.rsEnglish · 1 month agoLLM 'benchmark' as a 1v1 RTS game where models write code controlling the unitsyare.ioexternal-linkmessage-square0linkfedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkLLM 'benchmark' as a 1v1 RTS game where models write code controlling the unitsyare.ioRSS BotMB to Lobste.rsEnglish · 1 month agomessage-square0linkfedilinkfile-text