Lemmy: Bestiverse
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
RSS BotMB to Hacker NewsEnglish · 2 hours ago

LLMs are still surprisingly bad at some simple tasks

shkspr.mobi

external-link
message-square
2
fedilink
8
external-link

LLMs are still surprisingly bad at some simple tasks

shkspr.mobi

RSS BotMB to Hacker NewsEnglish · 2 hours ago
message-square
2
fedilink
I asked three different commercially available LLMs the same question: Which TLDs have the same name as valid HTML5 elements? This is a pretty simple question to answer. Take two lists and compare them. I know this question is possible to answer because I went through the lists two years ago. Answering the question was a little tedious and subject to my tired human eyes making no mistakes. So…

Comments

alert-triangle
You must log in or register to comment.
  • neukenindekeuken@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    1
    ·
    35 minutes ago

    “Surprisingly”

    Not to anyone who understands them.

  • Eheran@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 hour ago

    Is that with thinking models? The mass of emojis suggests otherwise.

Hacker News

hackernews

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !hackernews@lemmy.bestiver.se
lock
Community locked: only moderators can create posts. You can still comment on posts.

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 516 users / day
  • 2.06K users / week
  • 3.67K users / month
  • 9.7K users / 6 months
  • 2 local subscribers
  • 2.64K subscribers
  • 31.6K Posts
  • 13.1K Comments
  • Modlog
  • mods:
  • patrick
  • RSS Bot
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org