RSS BotMB to Hacker NewsEnglish · 50 minutes agoLLM from scratch, part 28 – training a base model from scratch on an RTX 3090www.gilesthomas.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10file-text
arrow-up11arrow-down1external-linkLLM from scratch, part 28 – training a base model from scratch on an RTX 3090www.gilesthomas.comRSS BotMB to Hacker NewsEnglish · 50 minutes agomessage-square0fedilinkfile-text