• Gsus4@mander.xyz
      link
      fedilink
      English
      arrow-up
      3
      ·
      18 hours ago

      yeah, what happened to deepseek, I havent seen it much in the news lately

      • nymnympseudonym@piefed.social
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 hour ago

        Thing is Deepseek didn’t have any new technology insights or “special sauce”

        They just took all the current best practices at the time (high quality machine curated data sets, MoE architecture, etc) and did them as fully and rigorously as possible

        It’s not like they invented chain of thought/Large Reasoning Model or state-space or anything new at all