RSS BotMB to Hacker NewsEnglish · 1 month agoA minimal PyTorch implementation for training your own small LLM from scratchgithub.comexternal-linkmessage-square1fedilinkarrow-up16arrow-down10file-text
arrow-up16arrow-down1external-linkA minimal PyTorch implementation for training your own small LLM from scratchgithub.comRSS BotMB to Hacker NewsEnglish · 1 month agomessage-square1fedilinkfile-text
minus-squareiii@mander.xyzlinkfedilinkEnglisharrow-up1·1 month agoThat’s probably the easiest to read attention and GPT implementation I’ve seen. Congrats to the author.
That’s probably the easiest to read attention and GPT implementation I’ve seen. Congrats to the author.