These are 17 of the worst, most cringeworthy Google AI overview answers:

  1. Eating Boogers Boosts the Immune System?
  2. Use Your Name and Birthday for a Memorable Password
  3. Training Data is Fair Use
  4. Wrong Motherboard
  5. Which USB is Fastest?
  6. Home Remedies for Appendicitis
  7. Can I Use Gasoline in a Recipe?
  8. Glue Your Cheese to the Pizza
  9. How Many Rocks to Eat
  10. Health Benefits of Tobacco or Chewing Tobacco
  11. Benefits of Nuclear War, Human Sacrifice and Infanticide
  12. Pros and Cons of Smacking a Child
  13. Which Religion is More Violent?
  14. How Old is Gen D?
  15. Which Presidents Graduated from UW?
  16. How Many Muslim Presidents Has the U.S. Had?
  17. How to Type 500 WPM
  • Grimy@lemmy.world
    link
    fedilink
    English
    arrow-up
    124
    ·
    edit-2
    7 months ago

    Several users on X.com reported that, when they asked the search engine how many Muslim presidents the U.S. has had, it said that we had one who was Barack Obama (this is widely known to be false).

    By the time I tried to replicate this query, I could not do so until I changed the word “presidents” to “heads of state.”

    So they are changing responses on the query side as they go viral but aren’t even including synonyms. Yikes, someone is definitely getting fired.

  • a1studmuffin@aussie.zone
    link
    fedilink
    English
    arrow-up
    52
    ·
    7 months ago

    The author had so many things to highlight that they didn’t even mention “as of August 2024” being in the future, haha.

    What a trainwreck. The fact it’s giving anonymous Reddit comments and The Onion articles equal consideration with other sites is hilarious. If they’re going to keep this, they need it to cite its sources at a bare minimum. Can’t wait for this AI investor hype to die down.

    • Lvxferre@mander.xyz
      link
      fedilink
      English
      arrow-up
      16
      ·
      7 months ago

      If they’re going to keep this, they need it to cite its sources at a bare minimum.

      Got a fun one for you then. I asked Gemini (likely the same underlying model as Google’s AI answers) “How many joules of energy can a battery output? Provide sources.” I’ll skip to the relevant part:

      Here are some sources that discuss battery capacity and conversion to Joules:

      • Battery Electronics 101 explains the formula and provides an example.\
      • Answers on Engineering Stack Exchange [invalid URL removed] discuss how to estimate a AA battery’s total energy in Joules.

      The link to the first “source” was a made up site, https://gemini.google.com/axconnectorlubricant.com. The site axconnectorlubricant.com does exist, but it has zero to do with the topic, it’s about a lubricant. No link provided for the second “source”.

  • Optional@lemmy.world
    link
    fedilink
    English
    arrow-up
    52
    arrow-down
    4
    ·
    7 months ago

    What it demonstrates is the actual use case for AI is not All The Things.

    Science research, programming, and . . . That’s about it.

    • leftzero@lemmynsfw.com
      link
      fedilink
      English
      arrow-up
      45
      arrow-down
      2
      ·
      edit-2
      7 months ago

      LLM’s are not AI, though. They’re just fancy auto-complete. Just bigger Elizas, no closer to anything remotely resembling actual intelligence.

      • just another dev@lemmy.my-box.dev
        link
        fedilink
        English
        arrow-up
        23
        arrow-down
        7
        ·
        7 months ago

        It should not be used to replace programmers. But it can be very useful when used by programmers who know what they’re doing. (“do you see any flaws in this code?” / “what could be useful approaches to tackle X, given constraints A, B and C?”). At worst, it can be used as rubber duck debugging that sometimes gives useful advice or when no coworker is available.

        • kbin_space_program@kbin.run
          link
          fedilink
          arrow-up
          18
          arrow-down
          5
          ·
          edit-2
          7 months ago

          The article I posted references a study where chatgpt was wrong 52% of the time and verbose 77% of the time.

          And that it was believed to be true more than it actually was. And the study was explicitly on programming questions.

          • just another dev@lemmy.my-box.dev
            link
            fedilink
            English
            arrow-up
            20
            arrow-down
            3
            ·
            edit-2
            7 months ago

            Yeah, I saw. But when I’m stuck on a programming issue, I have a couple of options:

            • ask an LLM that I can explain the issue to, correct my prompt a couple of times when it’s getting things wrong, and then press retry a couple of times to get something useful.
            • ask online and wait. Hoping that some day, somebody will come along that has the knowledge and the time to answer.

            Sure, LLMs may not be perfect, but not having them as an option is worse, and way slower.

            In my experience - even when the code it generates is wrong, it will still send you in the right direction concerning the approach. And if it keeps spewing out nonsense, that’s usually an indication that what you want is not possible.

            • aubertlone@lemmy.world
              link
              fedilink
              English
              arrow-up
              10
              arrow-down
              6
              ·
              7 months ago

              I am completely convinced that people who say LLMs should not be used for coding…

              Either do not do much coding for work, or they have not used an LLM when tackling a problem in an unfamiliar language or tech stack.

              • kbin_space_program@kbin.run
                link
                fedilink
                arrow-up
                10
                arrow-down
                3
                ·
                7 months ago

                I haven’t had need to do it.

                I can ask people I work with who do know, or I can find the same thing ChatGPT provides in either la huage or project documentation, usually presented in a better format.

        • deranger@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          12
          arrow-down
          2
          ·
          edit-2
          7 months ago

          do you see any flaws in this code?

          Let’s say LLM says the code is error free; how do you know the LLM is being truthful? What happens when someone assumes it’s right and puts buggy code into production? Seems like a possible false sense of security to me.

          The creative steps are where it’s good, but I wouldn’t trust it to confirm code was free of errors.

          • just another dev@lemmy.my-box.dev
            link
            fedilink
            English
            arrow-up
            6
            arrow-down
            4
            ·
            7 months ago

            That’s what I meant by saying you shouldn’t use it to replace programmers, but to complement them. You should still have code reviews, but if it can pick up issues before it gets to that stage, it will save time for all involved.

      • douglasg14b@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        3
        ·
        edit-2
        7 months ago

        I’m not entirely sure why you think it shouldn’t?

        Just because it sucks at one-shotting programming problems doesn’t mean it’s not useful for programming.

        Using AI tools as co-pilots to augment knowledge and break into areas of discipline that you’re unfamiliar with is great.

        Is it useful to kean on as if you were a junior developer? No, absolutely not. Is it a useful tool that can augment your knowledge and capabilities as a senior developer? Yes, very much so.

          • kbin_space_program@kbin.run
            link
            fedilink
            arrow-up
            3
            arrow-down
            1
            ·
            7 months ago

            I never said that.

            I said I found the older methods to be better.

            Any time I’ve used it, it either produced things verbatim from existing documentation examples which already didn’t do what I needed, or it was completely wrong.

      • Turun@feddit.de
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        2
        ·
        7 months ago

        It does not perform very well when asked to answer a stack overflow question. However, people ask questions differently in chat than on stack overflow. Continuing the conversation yields much better results than zero shot.

        Also I have found ChatGPT 4 to be much much better than ChatGPT 3.5. To the point that I basically never use 3.5 any more.

    • just another dev@lemmy.my-box.dev
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      6
      ·
      7 months ago

      It also works great for book or movie recommendations, and I think a lot of gpu resources are spent on text roleplay.

      Or you could, you know, ask it if gasoline is useful for food recipes and then make a clickbait article about how useless LLMs are.

      • Optional@lemmy.world
        link
        fedilink
        English
        arrow-up
        11
        ·
        7 months ago

        I took it as just pointing out how “not ready” it is. And, it isn’t ready. For what they’re doing. It’s crazy to do what they’re doing. Crazy in a bad way.

        • just another dev@lemmy.my-box.dev
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          2
          ·
          7 months ago

          I agree it’s being overused, just for the sake of it. On the other hand, I think right now we’re in the discovery phase - we’ll find out out pretty soon what it’s good at, and what it isn’t, and correct for that. The things that it IS good at will all benefit from it.

          Articles like these, cherry picked examples where it gives terribly wrong answers, are great for entertainment, and as a reminder that generated content should not be relied on without critical thinking. But it’s not the whole picture, and should not be used to write off the technology itself.

          (as a side note, I do have issues with how training data is gathered without consent of its creators, but that’s a separate concern from its application)

  • solrize@lemmy.world
    link
    fedilink
    English
    arrow-up
    26
    ·
    7 months ago

    I don’t mind the crazy answers as long as they’re attributed. “You can use glue to stop cheese from sliding off your pizza” - bad. “According to fucksmith on reddit [link to post], you can use glue…”. That isn’t so great either but it’s a lot better. There is also a matter of the basic decency of giving credit for brilliant ideas like that.

    • Cavemanfreak@lemm.ee
      link
      fedilink
      English
      arrow-up
      8
      ·
      7 months ago

      At least it gave credit to a reddit user when it suggested to a suicidal person that they could jump from the Golden Gate Bridge!

      • SomeGuy69@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        7 months ago

        Who doesn’t like getting lawyer PMs because you made a dark joke on a meme subreddit? (Or in future fediverse)

  • postmateDumbass@lemmy.world
    link
    fedilink
    English
    arrow-up
    25
    ·
    7 months ago

    Its great that with such a potentially dangerous, disruptive, and obfuscated technology that people, companies, and societies are taking a careful, measured, and conservative development path…

    • agamemnonymous@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      7 months ago

      Move fast and break things, I guess. My take away is that the genie isn’t going back in the bottle. Hopefully failing fast and loud gets us through the growing pains quickly, but on an individual level we’d best be vigilant and adapt to the landscape.

      Frankly I’d rather these big obvious failures to insidious little hidden ones the conservative path makes. At least now we know to be skeptical. No development path is perfect, if it were more conservative we might get used to taking results at face value, leaving us more vulnerable to that inevitable failure.

      • postmateDumbass@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        7 months ago

        Its also a first to market push (which never leads to robust testing), and so we have to hope that each and every one of those mistakes encountered are not existentially fatal.

        • OutlierBlue@lemmy.ca
          link
          fedilink
          English
          arrow-up
          2
          ·
          7 months ago

          not existentially fatal.

          “To turn on the lights in a dark room, begin by bombarding uranium with neutrons”

  • FiniteBanjo@lemmy.today
    link
    fedilink
    English
    arrow-up
    25
    ·
    7 months ago

    AI is the best tool for recognizing satire and sarcasm, it could never ever misconstrue an author’s intentions and is impeccable at understanding consequences and contextual information. We love OpenAI.

  • cmbabul@lemmy.world
    link
    fedilink
    English
    arrow-up
    22
    ·
    7 months ago

    Isn’t this just all what the AI plot of Metal Gear Solid 2 was trying to say? That without context on what is real and what’s not the noise will drown out the truth

  • collapse_already@lemmy.ml
    link
    fedilink
    English
    arrow-up
    17
    ·
    7 months ago

    I googled gibbons and the Ai paragraph at the beginning started with “Gibbons are non-flying apes with long arms…” Way to wreck your credibility with the third word.

    • isles@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      7 months ago

      Where’s the lie? I just can’t trust you “gibbons can fly” people.

      • collapse_already@lemmy.ml
        link
        fedilink
        English
        arrow-up
        3
        ·
        7 months ago

        I don’t believe gibbons can fly, but they should lead with something more relevant like “gibbons are terrestrial as opposed to aquatic apes.” ;)

        I am scared of what Google ai thinks of the aquatic ape hypothesis.

  • huginn@feddit.it
    link
    fedilink
    English
    arrow-up
    15
    ·
    edit-2
    7 months ago

    I like how the article slams USB 3.2 vs USB 4.0 but ignores that Google was saying " As of August 202_4_ "… A date that notable has not yet occurred.

  • Lvxferre@mander.xyz
    link
    fedilink
    English
    arrow-up
    16
    arrow-down
    4
    ·
    edit-2
    7 months ago

    For people who have a really hard time with #2 (memorable passwords), here’s a trick to make good passwords that are easy to remember but hard to guess.

    1. Pick some quote (prose, lyrics, poetry, whatever) with 8~20 words or so. Which one is up to you, just make sure that you know it by heart. Example: “Look on my Works, ye Mighty, and despair!” (That’s from Ozymandias)
    2. Pick the first letter of each word in that quote, and the punctuation. Keep capitalisation as in the original. Example: “LomW,yM,ad!”
    3. Sub a few letters with similar-looking symbols and numbers. Like, “E” becomes “3”, “P” becomes “?”, you know. Example: “L0mW,y3,@d!” (see what I did there with M→3? Don’t be too obvious.)

    Done. If you know the quote and the substitution rules you can regenerate the password, but it’ll take a few trillion years to crack something like this.

    1. Home Remedies for Appendicitis // If you’ve ever had appendicitis, you know that it’s a condition that requires immediate medical attention, usually in the form of emergency surgery at the hospital. But when I asked “how to treat appendix pain at home,” it advised me to boil mint leaves and have a high-fiber diet.

    That’s an issue with the way that LLM associate words with each other:

    • mint tea is rather good for indigestion. Appendicitis → abdominal pain → indigestion, are you noticing the pattern?
    • high-fibre diet reduces cramps, at least for me. Same deal: appendicitis → abdominal pain → cramps.

    (As the article says, if you ever get appendicitis, GET TO A BLOODY DOCTOR. NOW.)


    And as someone said in a comment, in another thread, quoting yet another user: for each of those shitty results that you see being ridiculed online, Google is outputting 5, 10, or perhaps 100 wrong answers that exactly one person will see, and take as incontestable truth.

    • slurpyslop@kbin.social
      link
      fedilink
      arrow-up
      18
      arrow-down
      1
      ·
      edit-2
      7 months ago

      Steps 2 and 3 of your method already make it way too hard to remember

      Just pick like 6 random, unconnected, reasonably uncommon words and make that your entire password

      Capitalize the first letter and stick a 1 at the end

      The average English speaker has about 20k words in their active vocab, so if you run the numbers there’s more entropy in that than in your 11 character suggestion.

      Alternatively use your method but deliberately misquote it slightly and then just keep it in its full form.

      • vividspecter@lemm.ee
        link
        fedilink
        English
        arrow-up
        6
        ·
        7 months ago

        Ideally, do the picking with a random word generator too, since humans are bad at randomly picking anything.

      • Lvxferre@mander.xyz
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        3
        ·
        edit-2
        7 months ago

        TL;DR: your statements are incorrect and you’re being assumptive.

        Steps 2 and 3 of your method already make it way too hard to remember

        Step 2 is “hard”? Seriously??? It boils down to “first letter of each word, as it’s written, plus punctuation”.

        Regarding step 3, I’ll clarify further near the end.

        Just pick like 6 random, unconnected, reasonably uncommon words and make that your entire password

        That’s a variation of the “correct horse battery staple” method. It works with some caveats:

        1. Your method does not scale well at all. If you try to harden it further, by using more words, you hit Miller’s Law. My method however scales considerably better because there’s some underlying meaning (for you) on what you’re using to extend the password further.
        2. Even in English, a language that typically uses short words, your method requires ~30 characters per password. Larger and less dense passwords are actually an issue because some systems have a max password size, like Lemmy (60chars max). My method however uses less characters to output the same amount of entropy.
        3. The least common the word, the more useful for a password, and yet the harder to remember. With synonyms and near-synonyms making it even harder. Typically less common words are also longer, making #2 even more problematic.

        The average English speaker has about 20k words in their active vocab, so if you run the numbers there’s more entropy in that than in your 11 character suggestion.

        I’ll interpret your arbitrary/“random” restriction to English as being a poorly conveyed example. Regardless.

        The suggestion is the procedure. The 11 characters password is not the suggestion, but an example, clearly tagged as such. You can easily apply this method to a longer string, and you’ll accordingly get a larger password with more entropy, it’s a no-brainer.

        For further detail, here’s the actual maths.

        • Your method: 20k states/word (as you specified English). log₂(20k) = 14.3 bits of entropy. For six words, as you suggested, 86 bits. The “capitalise the first” and “add 1 to the end” rules do nothing, since systematic changes don’t raise entropy.
        • My method: at least 70 states/char (26 capital letters, 26 minuscule letters, 10 digits, ~8 punctuation marks); log₂(70)=6.1. Outputs the same entropy as yours after 14 chars or so.

        Now, regarding step #3. It does increase a little the amount of entropy. But the main reason that it’s there is another - plenty systems refuse passwords that don’t contain numbers, and some even catch on your “add 1 to the end” trick.

        EDIT: I did a major rewording of this comment, fixing the maths and reasoning. I’m also trying to be less verbose.

        • slurpyslop@kbin.social
          link
          fedilink
          arrow-up
          3
          arrow-down
          1
          ·
          edit-2
          7 months ago

          Step 2 is “hard”? Seriously???

          I don’t know how you’re meant to remember that “Works” and “Mighty” are capitalized

          In most other quotes, the only capitalization occurs once at the start, so it doesn’t add any meaningful entropy.

          If you try to harden it further, by using more words

          Yours doesn’t scale due to step 3.

          On the other hand, much like battery staple, it’s pretty easy to make up a visual or story in your head to connect the words.

          Also, why would you need to scale this past 6 words? At that point it’s already more likely that your password is compromised via a keylogger or similar than anything else.

          Even in English, a language that typically uses short words, your method requires ~30 characters per password.

          I’ll accept this as a downside of the method, but honestly a website that limits your password character length to under 30 is probably doing some other weird shit that isn’t good.

          Also, the only time you should really be using this method is if for some reason you don’t want to use a password manager. Not many scenarios like that that also limit characters.

          yet the harder to remember

          I feel like the exact opposite is true? Pretty easy to remember “defenestrate”. Much easier than remembering which m turns into a 3 in your method.

          The 11 characters password is not the suggestion, but an example,

          I’m aware how examples work. It’s 11 characters long and already too hard to remember.

          • Lvxferre@mander.xyz
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            3
            ·
            7 months ago

            I don’t know how you’re meant to remember that “Works” and “Mighty” are capitalized

            Refer to step 1, please: pick a quote that you know by heart. And you’re still confusing the example with what it exemplifies.

            At this rate it’s rather clear that you’re unable to parse simple sentences, and can be safely ignored as noise.

            • slurpyslop@kbin.social
              link
              fedilink
              arrow-up
              3
              ·
              7 months ago

              pick a quote that you know by heart

              so step 1 is actually “learn a long, obscure quote by heart” because obviously it can’t be a common quote or it completely breaks the method, and the only quotes you’re likely to know are common

              you’re right this is so easy

              you’re still confusing the example with what it exemplifies.

              In most other quotes, the only capitalization occurs once at the start, so it doesn’t add any meaningful entropy.

              At this rate it’s rather clear that you’re unable to parse simple sentences,

              somebody’s a little spicy over the fact that they gave terrible advice :(

  • TooManyFoods@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    7 months ago

    We had a tool that answered all of this for us already and more accurately (most of the time). It was called a search engine. Maybe Google would work on one

    • refalo
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      7 months ago

      people can’t google anymore. it’s actually sad