cross-posted from: https://lemmy.intai.tech/post/72919

Parameters count:

GPT-4 is more than 10x the size of GPT-3. We believe it has a total of ~1.8 trillion parameters across 120 layers. Mixture Of Experts - Confirmed.

OpenAI was able to keep costs reasonable by utilizing a mixture of experts (MoE) model. They utilizes 16 experts within their model, each is about ~111B parameters for MLP. 2 of these experts are routed to per forward pass.

Related Article: https://lemmy.intai.tech/post/72922

    • manitcorOP
      link
      fedilink
      English
      11 year ago

      They are the right ones. Should be a tweet archive and a blog post

      • @[email protected]
        link
        fedilink
        English
        21 year ago

        Well that’s weird because the first takes me to a shitpost with a picture of cake, and the second a shitpost about sucking your dentist’s fingers…

        • manitcorOP
          link
          fedilink
          English
          31 year ago

          ewwww lol

          are you using an app or the web? the links should point to the intai instance which works fine for me but i don’t know what various clients will do with those links

          • @[email protected]
            link
            fedilink
            English
            31 year ago

            I’m using Connect, so that could explain it! Thanks. I’ll see if I can figure it out because this is really interesting to me, but the dentist post is not! Haha!