Pro@programming.dev to Technology@lemmy.worldEnglish · 1 day agoAnthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory managementwww.anthropic.comexternal-linkmessage-square10fedilinkarrow-up171arrow-down15cross-posted to: hackernews
arrow-up166arrow-down1external-linkAnthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory managementwww.anthropic.comPro@programming.dev to Technology@lemmy.worldEnglish · 1 day agomessage-square10fedilinkcross-posted to: hackernews
minus-squareWomble@lemmy.worldlinkfedilinkEnglisharrow-up14arrow-down2·1 day agoI doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
minus-squareA_norny_mousse@feddit.orglinkfedilinkEnglisharrow-up4arrow-down2·1 day agoTrue; I just hate headlines that ask stupid questions. But then again, there’s always the premise that it could work, in such attempts, which annoys me no less.
I doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
True; I just hate headlines that ask stupid questions.
But then again, there’s always the premise that it could work, in such attempts, which annoys me no less.