pepperfree@sh.itjust.works to LocalLLaMA@sh.itjust.worksEnglish · 2 months agoWhen DeepSeek V4 and R2?sh.itjust.worksimagemessage-square26fedilinkarrow-up1192arrow-down16
arrow-up1186arrow-down1imageWhen DeepSeek V4 and R2?sh.itjust.workspepperfree@sh.itjust.works to LocalLLaMA@sh.itjust.worksEnglish · 2 months agomessage-square26fedilink
minus-squareveroxii@aussie.zonelinkfedilinkEnglisharrow-up5·2 months agoOr maybe Facebook data is even worse than Twitter?
minus-squarepepperfree@sh.itjust.worksOPlinkfedilinkEnglisharrow-up1arrow-down1·2 months agoLlama 3.3 was good, tho. For the multimodal, llama 4 also use llama3.2 approach where the image and text is made into single model instead using CLIP or siglip.
Or maybe Facebook data is even worse than Twitter?
Llama 3.3 was good, tho. For the multimodal, llama 4 also use llama3.2 approach where the image and text is made into single model instead using CLIP or siglip.