• @[email protected]
      link
      fedilink
      1173 months ago

      I had to prepare a high level report to a senior manager last week regarding a project my team was working on.

      We had to make 5 professional recommendations off of data we reported.

      We gave the 5 recommendations with lots of evidence and references to why we came to that decision.

      The top question we got was: “What are ChatGPT’s recommendations?”

      Back to the drawing board this week because LLMs are more credible than teams of professionals with years of experience and bachelor-masters level education on the subject matter.

      • @[email protected]
        link
        fedilink
        883 months ago

        It is quite terrifying that people think these unoriginal and inaccurate regurgitators of internet knowledge, with no concept of or heuristic for correctness… are somehow an authority on anything.

        • @[email protected]
          link
          fedilink
          633 months ago

          All you need to succeed on this planet is the self confidence to say things. It literally does not matter the accuracy. It’s how you express it. I wish I knew this when I was younger. I’d cut out all the imposter syndrome that held me back.

          • @[email protected]
            link
            fedilink
            173 months ago

            I wish it was that easy. If you go too long it’s boring, and if you’re too confident you sound arrogant. At this point I’ve kind of just accepted there are people who can sell, and that I’m not one of those people.

          • @[email protected]
            link
            fedilink
            63 months ago

            I think this depends on the crowd. Unfortunately, the intelligent crowd and the crowd with money and power is not exactly the same. Though hopefully there is overlap.

          • @[email protected]
            link
            fedilink
            English
            33 months ago

            You, and we, are better off for it.

            The issue is that it’s been forgot (Remember the 5th of November)

        • Flax
          link
          fedilink
          English
          63 months ago

          Only thing you need to do to realise how bad they are is to play Chess against it. Vs using a chessbot from 30 years ago, it really shows.

      • rutellthesinful
        link
        fedilink
        353 months ago

        you fool

        “these are chatgpt’s recommendations we just provided research to back them up and verify the ai’s work”

        • @[email protected]
          link
          fedilink
          English
          263 months ago

          “What do we pay you guys for then? You are all fired and Tummy the intern will do everything with ChatGPT from here on out!”

          • @[email protected]
            link
            fedilink
            223 months ago

            You joke but several sections of our HR department got cut and replaced with Enterprise GPT-4. We talk to an internal chatbot now about HR questions and some forms.

            • @MagicShel
              link
              203 months ago

              You should see if you can get it to hallucinate a pay raise or 3 months vacation.

              • @[email protected]
                link
                fedilink
                183 months ago

                It did the opposite lmao. I asked it what my vacation leave was because you need to verify leave amounts before you’re allowed to request any additional leave. It said I had 0 in my balance and I know for a fact I have at least a week left 🤪 took almost a month to sort it out. Had to provide balance screenshots and everything. I’d be probably fucked if I hadn’t manually screenshot my leave amounts beforehand.

                • @MagicShel
                  link
                  153 months ago

                  You work for a crazy company, my friend.

                • Flax
                  link
                  fedilink
                  English
                  123 months ago

                  Why can’t they just use a simple calendar app system where you book it off??? Who would use a large language model for that rubbish?

            • @[email protected]
              link
              fedilink
              English
              183 months ago

              That is the least worst implementation!

              I knew one HR person who cared about employees and did her best to help out. She only lasted 6 months.

          • @MagicShel
            link
            93 months ago

            That’s when you drop trou, bend over, spread the cheeks, and ask them to let you know when they’re done reviewing ChatGPT’s “research”.

      • @[email protected]
        link
        fedilink
        English
        193 months ago

        “It came up with more or less the same recommendations. Though it didn’t fully understand the specific target goals of your project, so our recommendations are more complete and actionable ready.”

      • @[email protected]
        link
        fedilink
        93 months ago

        I think this points to a large problem in our society is how we train and pick our managers. Oh wait we don’t. They pick us.

      • @[email protected]
        link
        fedilink
        83 months ago

        I mean, as long as you are the one prompting ChatGPT, you can probably get it to spit out the right recommendations. Works until they fire you because they are convinced AI made you obsolete.

    • @[email protected]
      link
      fedilink
      153 months ago

      AI cars are still running over pedestrians and people think computers are to the point of medical diagnosis?

      • @[email protected]
        link
        fedilink
        273 months ago

        There are some very impressive AI/ML technologies that are already in use as part of existing medical software systems (think: a model that highlights suspicious areas on an MRI, or even suggests differential diagnoses). Further, other models have been built and demonstrated to perform extremely well on sample datasets.

        Funnily enough, those systems aren’t using language models 🙄

        (There is Google’s Med-PaLM, but I suspect it wasn’t very useful in practice, which is why we haven’t heard anything since the original announcement.)

        • @[email protected]
          link
          fedilink
          3
          edit-2
          3 months ago

          I have read some headline that said that some of these models just measure age of a patient and a quality of the machine making photos.

              • @[email protected]
                link
                fedilink
                63 months ago

                Still AI misalignment is a real issue. I just don’t remember which model was studied and had been found out that it was missaligned.

                • @[email protected]
                  link
                  fedilink
                  43 months ago

                  That and bias, absolutely need improvements. That doesn’t mean LLMs can’t be extremely effective if given appropriate tasks. The problem is that the people who make decisions about where they’re used aren’t technical enough to understand their strengths and limitations

                  • @[email protected]
                    link
                    fedilink
                    13 months ago

                    I don’t think technical knowledge gives as good a sense as a lot of experience working with one.

                    Like saying the guys who designed a particular car would know best how it’ll perform on various racetracks. My sense is a driver would have a better sense.

          • @[email protected]
            link
            fedilink
            English
            93 months ago

            Eh. Depends on which tech is being used and how. For a lot of things, relatively basic ML models purposefully trained do a pretty good job, and are, in fact, limited by the diagnoses in the training data. But more generalized “AI” tools seem rather… questionable.

            Like, you can train a SVM on fMRIs to compare structures in the brain between patients diagnosed with bipolar disorder and those that are not diagnosed with it, and it will have an accuracy rate on new patients basically equal to the accuracy rate of the doctors who did the diagnosing in the training set. But you’ll have a much harder time creating a model that takes in fMRIs and reports back answers to the question of “which brain disease or abnormality do I have?”

            This stuff works much closer to advertised when it’s narrowly defined and purpose built, but the people making and funding this work want catch-all doctor replacements, because of course they do, because there’s way more money in charging hospitals and patience 10% less than a doctor’s salary than there is in providing tools that make doctors’ efforts in diagnosing specific illnesses easier.

            Or, at least there is if you can pull it off.

            • @[email protected]
              link
              fedilink
              13 months ago

              Precisely. Many of the narrowly scoped solutions work really well, too (for what they’re advertised for).

              As of today though, they’re nowhere near reliable enough to replace doctors, and any breakthrough on that front is very unlikely to be a language model IMO.

              • @[email protected]
                link
                fedilink
                English
                23 months ago

                And they should no more replace doctors in the future than x-ray machines did in the past. We should never want them to.

      • @[email protected]
        link
        fedilink
        23 months ago

        They are already used in medicine reliably. Often. Welcome to the future. Computers are pretty good tools for many things actually.

    • @[email protected]
      link
      fedilink
      5
      edit-2
      3 months ago

      Peak intelligence, is realizing an LLM doesn’t care whether its tokens represent chunks of text, sound, images, videos, 3D models, paths, hand movements, floor planning, emojis, etc.

      The keyword is: “multimodal”.

      As for being able to correctly correlate some “chunks of MRI scan” with the word “tumor”… that’s all about the training (which I’d bet Claude is missing… did I hear “investment opportunity”? Guy isn’t wrong).

    • lad
      cake
      link
      33 months ago

      Well, image models are getting better at producing text, just sayin’

      • @MagicShel
        link
        83 months ago

        I read the same thing in Nevvsweeek.