• impure9435
    link
    fedilink
    23218 days ago

    The thing that I find the most funny about this post, is the fact that you call this Italian

    • @[email protected]
      link
      fedilink
      52
      edit-2
      18 days ago

      Typical AI behavior

      Edit: and then it will gaslight you if you say the answer is the same.

      • @[email protected]
        link
        fedilink
        1518 days ago

        Fucking hate when do that.

        You are repeating the same mistake.

        I’m sorry for repeating the same mistake, here’s a new solution with corrections *proceed to write the exactly thing already told it was wrong*

    • @[email protected]
      link
      fedilink
      417 days ago

      Gotta remember they were trained off of the internet. Which is to say the largest body of people loadly professing the opinions are fact and refusing to say otherwise.

  • @abrahambelch
    link
    7818 days ago

    Which language uses these signs? It truly looks like some kind of alien language

    • chapapa
      link
      fedilink
      129
      edit-2
      18 days ago

      Glagolitic script. Oldest known Slavic alphabet according to Wikipedia.

      • velox_vulnus
        link
        fedilink
        English
        2818 days ago

        They should revive this script. I like it more than Cyrillic.

    • @[email protected]
      link
      fedilink
      3418 days ago

      I found it! its the Glagolitic script used in the 9th century before Cyrillic took over:

      ⰀⰁⰂⰃⰄⰅⰆⰇⰈⰉⰊⰋⰌⰍⰎⰏⰐⰑⰒⰓⰔⰕⰖⰗⰘⰙⰚⰛⰜⰝⰞⰟⰠⰡⰢⰣⰤⰥⰦⰧⰨⰩⰪⰫⰬⰭⰮⰰⰱⰲⰳⰴⰵⰶⰷⰸⰹⰺⰻⰼⰽⰾⰿⱀⱁⱂⱃⱄⱅⱆⱇⱈⱉⱊⱋⱌⱍⱎⱏⱐⱑⱒⱓⱔⱕⱖⱗⱘⱙⱚⱛⱜⱝⱞ
      
    • Sunoc
      link
      fedilink
      918 days ago

      I would like to know too! Never saw that writing system before.

      • @[email protected]
        link
        fedilink
        418 days ago

        No that looks like

        ⌶⌷⌸⌹⌺⌻⌼⌽⌾⌿⍀⍁⍂⍃⍄⍅⍆⍇⍈⍉⍊⍋⍌⍍⍎⍏⍐⍑⍒⍓⍔⍕⍖⍗⍘⍙⍚⍛⍜⍝⍞⍟⍠⍡⍢⍣⍤⍥⍦⍧⍨⍩⍪⍫⍬⍭⍮⍯⍰⍱⍲⍳⍴⍵⍶⍷⍸⍹⍺
        
  • @[email protected]
    link
    fedilink
    6418 days ago

    This might be happening because of the ‘elegant’ (incredibly hacky) way openai encodes multiple languages into their models. Instead of using all character sets, they use a modulo operator on each character, to make all Unicode characters represented by a small range of values. On the back end, it somehow detects which language is being spoken, and uses that character set for the response. Seeing as the last line seems to be the same mathematical expression as what you asked, my guess is that your equation just happened to perfectly match some sentence that would make sense in the weird language.

        • @[email protected]
          link
          fedilink
          116 days ago

          Seriously? Python for massive amounts of data? It’s a nice scripting language, but it’s excruciatingly slow

          • @[email protected]
            link
            fedilink
            416 days ago

            There are bindings in java and c++, but python is the industry standard for AI. The libraries for machine learning are actually written in c++, but use python language bindings. Python doesn’t tend to slow things down since machine learning is gpu-bound anyway. There are also library specific programming languages which urges the user to make pythonic code that can be compiled into c++.

    • @[email protected]
      link
      fedilink
      1617 days ago

      I suppose it’s conceivable that there’s a bug in converting between different representations of Unicode, but I’m not buying and of this “detected which language is being spoken” nonsense or the use of character sets. It would just use Unicode.

      The modulo idea makes absolutely no sense, as LLMs use tokens, not characters, and there’s soooooo many tokens. It would make no sense to make those tokens ambiguous.

      • @[email protected]
        link
        fedilink
        717 days ago

        I completely agree that it’s a stupid way of doing things, but it is how openai reduced the vocab size of gpt-2 & gpt-3. As far as I know–I have only read the comments in the source code– the conversion is done as a preprocessing step. Here’s the code to gpt-2: https://github.com/openai/gpt-2/blob/master/src/encoder.py I did apparently make a mistake, as the vocab reduction is done through a lut instead of a simple mod.

  • Redex
    link
    fedilink
    6218 days ago

    Damn, wild Glagolitic script found. I didn’t even realise it was in the Unicode standard.

  • Vitaly
    link
    fedilink
    3418 days ago

    It looks so badass, I could have used that script now because im Ukrainian but instead I have cyrillic script which is so boring

    • Match!!
      link
      fedilink
      English
      518 days ago

      rebel against Russian imperialism, return to glagolitic

      • Vitaly
        link
        fedilink
        4
        edit-2
        18 days ago

        It’s not russian, If my bulgarian friend is right then it was created by a bulgarian guy

        • @TwilightKiddy
          link
          418 days ago

          There is no single person responsible for Cyrillic script. It is mostly believed to be created by mixing and changing Greek and Glagolic scripts by the scholars of Preslav Literary School, which was indeed in Bulgaria. After a while, Peter the Great changed it a lot. And then Stalin stomped out almost all the deviations in the usage of the script.

          The last part is mostly why it is considered Russian. A lot of languages suffered because of Moscow just forcing them to use the version of Cyrillic that Russians were using.

      • @[email protected]
        link
        fedilink
        2
        edit-2
        17 days ago

        Cyrillic is literally greek+glagolitic and it was partly a diplomatic creation of the Eastern Roman Empire(aka Byzantine Empire), in order to bring the slavs culturally closer to them.

        Russians have nothing to do with it, other than them claiming they are the continuation of Eastern Roman Empire, something which is kinda laughable but whatever dont let your dreams be dreams.

  • I Cast Fist
    link
    1617 days ago

    Title mentions speaking italian

    Not a single hand gesture anywhere

    I’ve been duped

  • @[email protected]
    link
    fedilink
    English
    1317 days ago

    You may not understand, but we do.
    Questo segreto rimarrà custodito gelosamente dalla stirpe italica. ◉‿◉