pepperfree@sh.itjust.works to LocalLLaMA@sh.itjust.worksEnglish · 1 month agoWhen DeepSeek V4 and R2?sh.itjust.worksimagemessage-square26fedilinkarrow-up1192arrow-down16
arrow-up1186arrow-down1imageWhen DeepSeek V4 and R2?sh.itjust.workspepperfree@sh.itjust.works to LocalLLaMA@sh.itjust.worksEnglish · 1 month agomessage-square26fedilink
minus-squareveroxii@aussie.zonelinkfedilinkEnglisharrow-up5·1 month agoOr maybe Facebook data is even worse than Twitter?
minus-squarepepperfree@sh.itjust.worksOPlinkfedilinkEnglisharrow-up1arrow-down1·1 month agoLlama 3.3 was good, tho. For the multimodal, llama 4 also use llama3.2 approach where the image and text is made into single model instead using CLIP or siglip.
Or maybe Facebook data is even worse than Twitter?
Llama 3.3 was good, tho. For the multimodal, llama 4 also use llama3.2 approach where the image and text is made into single model instead using CLIP or siglip.