RSS BotMB to Hacker NewsEnglish · 2 hours ago4x faster LLM inference (Flash Attention guy's company)www.together.aiexternal-linkmessage-square0fedilinkarrow-up13arrow-down10file-text
arrow-up13arrow-down1external-link4x faster LLM inference (Flash Attention guy's company)www.together.aiRSS BotMB to Hacker NewsEnglish · 2 hours agomessage-square0fedilinkfile-text