• ebu@awful.systems
    link
    fedilink
    English
    arrow-up
    36
    ·
    7 months ago

    […W]hen examining only those who passed the exam (i.e. licensed or license-pending attorneys), GPT-4’s performance is estimated to drop to 48th percentile overall, and 15th percentile on essays.

    officially Not The Worst™, so clearly AI is going to take over law and governments any day now

    also. what the hell is going on in that other reply thread. just a parade of people incorrecting each other going “LLM’s don’t work like [bad analogy], they work like [even worse analogy]”. did we hit too many buzzwords?

      • froztbyte@awful.systems
        link
        fedilink
        English
        arrow-up
        13
        ·
        edit-2
        7 months ago

        oh is that how come I get so much popcorn around these discussions? 🤔 makes sense when you think about it!

    • froztbyte@awful.systems
      link
      fedilink
      English
      arrow-up
      13
      ·
      7 months ago

      I was considering interjecting in there but I don’t want to get it on my clothes, so I’m content just watching from the outside.

      Not great, but I’m also not obligated to teach anyone anything, soooooo

    • Tja
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      14
      ·
      7 months ago

      Not the worst? 48th percentile is basically “average lawyer”. I don’t need a Supreme Court lawyer to argue my parking ticket. And if you train the LLM with specific case law and use RAG can get much better.

      In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate, they can process more clients, offer faster service and cheaper prices. Maybe not a revolution but still a win.

      • ebu@awful.systems
        link
        fedilink
        English
        arrow-up
        27
        arrow-down
        1
        ·
        7 months ago

        48th percentile is basically “average lawyer”.

        good thing all of law is just answering multiple-choice tests

        I don’t need a Supreme Court lawyer to argue my parking ticket.

        because judges looooove reading AI garbage and will definitely be willing to work with someone who is just repeatedly stuffing legal-sounding keywords into google docs and mashing “generate”

        And if you train the LLM with specific case law and use RAG can get much better.

        “guys our keyword-stuffing techniques aren’t working, we need a system to stuff EVEN MORE KEYWORDS into the keyword reassembler”

        In a worst case scenario if my local lawyer can use AI to generate a letter

        oh i would love to read those court documents

        and just quickly go through it to make sure it didn’t hallucinate

        wow, negative time saved! okay so your lawyer has to read and parse several paragraphs of statistical word salad, scrap 80+% of it because it’s legalese-flavored gobbledygook, and then try to write around and reformat the remaining 20% into something that’s syntactically and legally coherent – you know, the thing their profession is literally on the line for. good idea

        what promptfondlers continuously seem to fail to understand is that verification is the hard step. literally anyone on the planet can write a legal letter if they don’t care about its quality or the ramifications of sending it to a judge in their criminal defense trial. part of being a lawyer is being able to tell actual legal arguments from bullshit, and when you hire an attorney, that is the skill you are paying for. not how many paragraphs of bullshit they can spit out per minute

        they can process more clients, offer faster service and cheaper prices. Maybe not a revolution but still a win.

        “but the line is going up!! see?! sure we’re constantly losing cases and/or getting them thrown out because we’re spamming documents full of nonsense at the court clerk, but we’re doing it so quickly!!”

        • Tja
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          15
          ·
          7 months ago

          Spoken like someone who hasn’t gotten beyond ChatGPT on default settings.

            • Tja
              link
              fedilink
              English
              arrow-up
              1
              ·
              7 months ago

              I kinda agree. With the difference that NFTs used to cost millions of dollars, but using ChatGPT is free, llama is free and github copilot is a few bucks a month.

          • ebu@awful.systems
            link
            fedilink
            English
            arrow-up
            18
            ·
            7 months ago

            it’s funny how your first choice of insult is accusing me of not being deep enough into llm garbage. like, uh, yeah, why would i be

            but also how dare you – i’ll have you know i only choose the most finely-tuned, artisinally-crafted models for my lawyering and/or furry erotic roleplaying needs

            • Tja
              link
              fedilink
              English
              arrow-up
              1
              ·
              7 months ago

              Funny how you consider that an insult…

            • Tja
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              9
              ·
              7 months ago

              A reply to a rant based on false premises.

        • Tja
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          12
          ·
          7 months ago

          You understand that getting a list of sources and checking them is easier than finding them on your own, right?

          Of course it’s even easier not checking them at all and submitting garbage, but one should have learned in 3rd grade not to submit copy-pastes from Wikipedia or any website.

          This one is on human stupidity, not artifical intelligence.

          • skillissuer@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            19
            ·
            7 months ago

            so your process of getting legal advice is:

            1. ask chatgpt, which will output convincing blob of text, with references and sources that might or might be not real, relevant, or make sense, some of which you won’t be able to judge
            2. then, ask a real lawyer about this, which means that they have to make sense of the situation on their own but also dig through machine generated drivel, which means that they need more time for that, and this means extra cost/wasted effort

            how does that simplify anything

            • froztbyte@awful.systems
              link
              fedilink
              English
              arrow-up
              18
              ·
              7 months ago

              Look it’s a really cheap and fast way of going from potential lawsuit to actual damages! That’s progress, that is!

              [ed note: since I can’t markup-joke it in a way that survives lemmy: to be read in pratchett voice)

            • Tja
              link
              fedilink
              English
              arrow-up
              1
              ·
              7 months ago

              Do you enjoy arguing against arguments that nobody made?

              • skillissuer@discuss.tchncs.de
                link
                fedilink
                English
                arrow-up
                1
                ·
                7 months ago

                Not the worst? 48th percentile is basically “average lawyer”. I don’t need a Supreme Court lawyer to argue my parking ticket.

                In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate

                • Tja
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  7 months ago

                  Exactly. Now compare it to what you wrote.

          • skillissuer@discuss.tchncs.de
            link
            fedilink
            English
            arrow-up
            18
            ·
            7 months ago

            You understand that getting a list of sources and checking them is easier than finding them on your own, right?

            that’s one weirdass assumption. when you know what are you looking for, the opposite is true. few months back i’ve authored a review chapter in my (very narrow) field, and while “getting a list of sources” part took maybe a day or two with a few scopus searches, combing through them, finding out what’s relevant and making a coherent story out of all of this was harder and took more time. if you don’t know where even to start, maybe you should ask a professional? especially when alternative is just going in raw into the court of law, defending whatever is at stake with a few paragraphs of possibly nonsensical spicy autocomplete output

            • Tja
              link
              fedilink
              English
              arrow-up
              1
              ·
              7 months ago

              So you are saying that you spent one day or two? And AI could have done it in 3 seconds? So using AI you could have saved one day or two and… That is bad? Somehow? Because it didn’t do all the work for you or…?

              • skillissuer@discuss.tchncs.de
                link
                fedilink
                English
                arrow-up
                1
                ·
                7 months ago

                i’m not convinced that spicy autocomplete would generate passable crude list of articles to consider in a way that would save me any time. already when going through scopus output i’ve made a fair selection (rejecting maybe 80%) of articles, then i had at least to skim them to tell whether it’s what i’m looking for, then i’ve narrowed them all down from few hundred to 80ish. then in some cases i went through literature in these articles to check if i wasn’t missing something. then i got to arrange them in some order and make a story out of all of that. all in all, these next steps took few weeks, so one day is positively swift. spicy autocomplete is utterly useless if it outputs few k of references, or throws out actually important work, or makes shit up and spits out nonexistent DOIs, because all of that makes next steps longer, or threatens to make the passable crude references list incomplete in important ways. i’ll pass

                • Tja
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  7 months ago

                  You’re not convinced? I could whip up a script to do it in half a day, and I’m really not that good of a dev anymore.

                  Skimming to see if that’s what you’re looking for is another task where Ai is great at.

                  • skillissuer@discuss.tchncs.de
                    link
                    fedilink
                    English
                    arrow-up
                    1
                    ·
                    7 months ago

                    we disagree on what my job as an author is

                    anyway, if you haven’t noticed, you’re already banned on awful.systems and i’m also done with you

      • V0ldek@awful.systems
        link
        fedilink
        English
        arrow-up
        12
        ·
        7 months ago

        In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate, they can process more clients, offer faster service and cheaper prices.

        It’s a good thing people are so good at vigilance tasks and don’t tend to fall onto just relying on the automation.

        • Tja
          link
          fedilink
          English
          arrow-up
          1
          ·
          7 months ago

          Unfortunately that’s human nature, you have to remain professional. There have been cases of lawyers that submitted unverified output and got in trouble for that. And vendors submitting product descriptions with “sorry, as a large language model, I cannot…”.

          But that’s human lazyness, not the fault of the tool.