RSS BotMB to Hacker NewsEnglish · 11 days agoSVDQuant: 4-Bit Quantization Powers 12B Flux on a 16GB 4090 GPU with 3x Speeduphanlab.mit.eduexternal-linkmessage-square0fedilinkarrow-up15arrow-down10file-text
arrow-up15arrow-down1external-linkSVDQuant: 4-Bit Quantization Powers 12B Flux on a 16GB 4090 GPU with 3x Speeduphanlab.mit.eduRSS BotMB to Hacker NewsEnglish · 11 days agomessage-square0fedilinkfile-text