• vivendi
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      9
      ·
      edit-2
      16 days ago

      These views on LLMs are simplistic. As a wise man once said, “check yoself befo yo wreck yoself”, I recommend more education thus

      LLM structures arw over hyped, but they’re also not that simple

        • vivendi
          link
          fedilink
          English
          arrow-up
          1
          ·
          16 days ago

          Autocomplete LLMs are different from instruct LLMs

      • MonkderVierte@lemmy.ml
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        16 days ago

        From what i know from recent articles about retracing LLM indepth, they are indeed best suited for language translation and perfectly explain the halucinations. And i think i’ve read somewhere that this was the originally intended purpose of the tech?

        Ah, here, and here more tabloid-ish.

        • froztbyte@awful.systems
          link
          fedilink
          English
          arrow-up
          5
          ·
          16 days ago

          many of the proponents of things in this field will propose/argue $x thing to be massively valuable for $x

          thing is, that doesn’t often work out

          yes, there’s some value in the tech for translation outcomes. to anyone even mildly online, “so are language teaching apps/sites using this?” is probably a very nearby question. and rightly so!

          and then when you go digging into how that’s going in practice, wow fuck damn doesn’t that Glorious AI Future sheen just fall right off…

        • vivendi
          link
          fedilink
          English
          arrow-up
          1
          ·
          16 days ago

          Translation/text processing are some of the best cases of LLM performance, that is true. Although, translation is much harder than other processing because of training data.

          But considering new research from anthropic on model structures I really think it’s unfair to beat these things down to just that.