Timely_Jellyfish_2077@programming.dev to Technology@lemmy.worldEnglish · 1 day agoReasoning failures highlighted by Apple research on LLMsappleinsider.comexternal-linkmessage-square42fedilinkarrow-up1193arrow-down17 cross-posted to: hackernews
arrow-up1186arrow-down1external-linkReasoning failures highlighted by Apple research on LLMsappleinsider.comTimely_Jellyfish_2077@programming.dev to Technology@lemmy.worldEnglish · 1 day agomessage-square42fedilink cross-posted to: hackernews
minus-squaregr3q@lemmy.mllinkfedilinkEnglisharrow-up3·edit-21 day agoI tested chatgpt, it needed some nagging but it could do it. Needed the size, blank and white keywords. Obviously a lot harder than it should be, but not impossible.
I tested chatgpt, it needed some nagging but it could do it. Needed the size, blank and white keywords.
Obviously a lot harder than it should be, but not impossible.