• Coriza@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    1
    ·
    1 day ago

    You may already know that, but just to make it clear for other readers: It is impossible for an LLM to behave like described. What an LLM algorithm does is generate stuff, It does not search, It does not sort, It only make stuff up. There is not that can be done about it, because LLM is a specific type of algorithm, and that is what the program do. Sure you can train it with good quality data and only real cases and such, but it will still make stuff up based on mixing all the training data together. The same mechanism that make it “find” relationships between the data it is trained on is the one that will generate nonsense.

    • tetris11@feddit.uk
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      edit-2
      1 day ago

      But you can enter in real search data as a prompt, and use its training to summarize it. (Or it can fill its own prompt automatically from an automatic search)

      It won’t/can’t update it’s priors, and I agree with you there, but it can produce novel output on a novel prompt with its existing model/weights

    • MiddleAgesModem@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      3
      ·
      1 day ago

      Whole lot of unsupported assumptions and falsehoods here.

      Stand alone model predicts tokens. LLMs retrieve real documents, rank/filter results and use search engines. Anyone who has used these things would know that it’s not just “making stuff up”.

      It both searches and sorts.

      In short, you have no fucking idea what you’re talking about.