The Picard Maneuver@lemmy.world to People Twitter@sh.itjust.works · 1 month agoIt's best not to dwell on itlemmy.worldimagemessage-square147linkfedilinkarrow-up11.88Karrow-down16
arrow-up11.87Karrow-down1imageIt's best not to dwell on itlemmy.worldThe Picard Maneuver@lemmy.world to People Twitter@sh.itjust.works · 1 month agomessage-square147linkfedilink
minus-squaremelpomenesclevage@lemmy.dbzer0.comBannedlinkfedilinkEnglisharrow-up15·1 month agothat depends on what topic you know and how well you know it.
minus-squaretaladar@sh.itjust.workslinkfedilinkarrow-up10·edit-21 month agoLLMs are actually pretty good for looking up words by their definition. But that is just about the only topic I can think of where they are correct even close to 80% of the time.
minus-squaremelpomenesclevage@lemmy.dbzer0.comBannedlinkfedilinkEnglisharrow-up0·1 month agoyeah. some things I’d be shocked if they were correct 1% of the time. some things, like that, I might expect them to be correct about 80%, yeah.
40% seems low
that depends on what topic you know and how well you know it.
LLMs are actually pretty good for looking up words by their definition. But that is just about the only topic I can think of where they are correct even close to 80% of the time.
yeah. some things I’d be shocked if they were correct 1% of the time. some things, like that, I might expect them to be correct about 80%, yeah.