Remember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a lie

David Gerard@awful.systems · 10 months ago

Remember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a lie

ebu@awful.systems · 10 months ago

[…W]hen examining only those who passed the exam (i.e. licensed or license-pending attorneys), GPT-4’s performance is estimated to drop to 48th percentile overall, and 15th percentile on essays.

officially Not The Worst™, so clearly AI is going to take over law and governments any day now

also. what the hell is going on in that other reply thread. just a parade of people incorrecting each other going “LLM’s don’t work like [bad analogy], they work like [even worse analogy]”. did we hit too many buzzwords?

genuineparts@infosec.pub · 10 months ago

But LLM’s don’t work like Typewriters, they work like Microwaves!

froztbyte@awful.systems · edit-2 10 months ago

oh is that how come I get so much popcorn around these discussions? 🤔 makes sense when you think about it!

Amoeba_Girl@awful.systems · edit-2 10 months ago

“Nooo you don’t get it, LLMs are supposed to be shit”

froztbyte@awful.systems · 10 months ago

I was considering interjecting in there but I don’t want to get it on my clothes, so I’m content just watching from the outside.

Not great, but I’m also not obligated to teach anyone anything, soooooo

Tja · 10 months ago

Not the worst? 48th percentile is basically “average lawyer”. I don’t need a Supreme Court lawyer to argue my parking ticket. And if you train the LLM with specific case law and use RAG can get much better.

In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate, they can process more clients, offer faster service and cheaper prices. Maybe not a revolution but still a win.

ebu@awful.systems · 10 months ago

48th percentile is basically “average lawyer”.

good thing all of law is just answering multiple-choice tests

I don’t need a Supreme Court lawyer to argue my parking ticket.

because judges looooove reading AI garbage and will definitely be willing to work with someone who is just repeatedly stuffing legal-sounding keywords into google docs and mashing “generate”

And if you train the LLM with specific case law and use RAG can get much better.

“guys our keyword-stuffing techniques aren’t working, we need a system to stuff EVEN MORE KEYWORDS into the keyword reassembler”

In a worst case scenario if my local lawyer can use AI to generate a letter

oh i would love to read those court documents

and just quickly go through it to make sure it didn’t hallucinate

wow, negative time saved! okay so your lawyer has to read and parse several paragraphs of statistical word salad, scrap 80+% of it because it’s legalese-flavored gobbledygook, and then try to write around and reformat the remaining 20% into something that’s syntactically and legally coherent – you know, the thing their profession is literally on the line for. good idea

what promptfondlers continuously seem to fail to understand is that verification is the hard step. literally anyone on the planet can write a legal letter if they don’t care about its quality or the ramifications of sending it to a judge in their criminal defense trial. part of being a lawyer is being able to tell actual legal arguments from bullshit, and when you hire an attorney, that is the skill you are paying for. not how many paragraphs of bullshit they can spit out per minute

they can process more clients, offer faster service and cheaper prices. Maybe not a revolution but still a win.

“but the line is going up!! see?! sure we’re constantly losing cases and/or getting them thrown out because we’re spamming documents full of nonsense at the court clerk, but we’re doing it so quickly!!”

Tja · 10 months ago

Spoken like someone who hasn’t gotten beyond ChatGPT on default settings.

V0ldek@awful.systems · edit-2 10 months ago

Tja · 10 months ago

I kinda agree. With the difference that NFTs used to cost millions of dollars, but using ChatGPT is free, llama is free and github copilot is a few bucks a month.

ebu@awful.systems · 10 months ago

it’s funny how your first choice of insult is accusing me of not being deep enough into llm garbage. like, uh, yeah, why would i be

but also how dare you – i’ll have you know i only choose the most finely-tuned, artisinally-crafted models for my lawyering and/or furry erotic roleplaying needs

Tja · 10 months ago

Funny how you consider that an insult…

froztbyte@awful.systems · 10 months ago

what the fuck kind of reply is this

Tja · 10 months ago

A reply to a rant based on false premises.

froztbyte@awful.systems · 10 months ago

congratulations on your impending graduation (to expat poster)

Tja · 10 months ago

I love the anger, the name calling and the downvotes. Keep em coming.

self@awful.systems · 10 months ago

you’ll love what happens next

froztbyte@awful.systems · 10 months ago

this is not the place to satisfy your desire for degradation

skillissuer@discuss.tchncs.de · 10 months ago

In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate

at which point it’s just easier to do the right thing straight away, that is pay a lawyer to do their job https://www.bbc.com/news/world-us-canada-65735769

Tja · 10 months ago

You understand that getting a list of sources and checking them is easier than finding them on your own, right?

Of course it’s even easier not checking them at all and submitting garbage, but one should have learned in 3rd grade not to submit copy-pastes from Wikipedia or any website.

This one is on human stupidity, not artifical intelligence.

Deborah@hachyderm.io · 10 months ago

deleted by creator

Tja · 10 months ago

Most people, exactly. Not for someone who already knows how to do it, like a professional lawyer. Or their assistant.

Deborah@hachyderm.io · edit-2 10 months ago

deleted by creator

Tja · 10 months ago

Hence “or their assistant”.

Deborah@hachyderm.io · 10 months ago

deleted by creator

skillissuer@discuss.tchncs.de · 10 months ago

so your process of getting legal advice is:

ask chatgpt, which will output convincing blob of text, with references and sources that might or might be not real, relevant, or make sense, some of which you won’t be able to judge
then, ask a real lawyer about this, which means that they have to make sense of the situation on their own but also dig through machine generated drivel, which means that they need more time for that, and this means extra cost/wasted effort

how does that simplify anything

froztbyte@awful.systems · 10 months ago

Look it’s a really cheap and fast way of going from potential lawsuit to actual damages! That’s progress, that is!

[ed note: since I can’t markup-joke it in a way that survives lemmy: to be read in pratchett voice)

Tja · 10 months ago

Do you enjoy arguing against arguments that nobody made?

skillissuer@discuss.tchncs.de · 10 months ago

Not the worst? 48th percentile is basically “average lawyer”. I don’t need a Supreme Court lawyer to argue my parking ticket.

In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate

Tja · 10 months ago

Exactly. Now compare it to what you wrote.

skillissuer@discuss.tchncs.de · 10 months ago

You understand that getting a list of sources and checking them is easier than finding them on your own, right?

that’s one weirdass assumption. when you know what are you looking for, the opposite is true. few months back i’ve authored a review chapter in my (very narrow) field, and while “getting a list of sources” part took maybe a day or two with a few scopus searches, combing through them, finding out what’s relevant and making a coherent story out of all of this was harder and took more time. if you don’t know where even to start, maybe you should ask a professional? especially when alternative is just going in raw into the court of law, defending whatever is at stake with a few paragraphs of possibly nonsensical spicy autocomplete output

Tja · 10 months ago

So you are saying that you spent one day or two? And AI could have done it in 3 seconds? So using AI you could have saved one day or two and… That is bad? Somehow? Because it didn’t do all the work for you or…?

skillissuer@discuss.tchncs.de · 10 months ago

i’m not convinced that spicy autocomplete would generate passable crude list of articles to consider in a way that would save me any time. already when going through scopus output i’ve made a fair selection (rejecting maybe 80%) of articles, then i had at least to skim them to tell whether it’s what i’m looking for, then i’ve narrowed them all down from few hundred to 80ish. then in some cases i went through literature in these articles to check if i wasn’t missing something. then i got to arrange them in some order and make a story out of all of that. all in all, these next steps took few weeks, so one day is positively swift. spicy autocomplete is utterly useless if it outputs few k of references, or throws out actually important work, or makes shit up and spits out nonexistent DOIs, because all of that makes next steps longer, or threatens to make the passable crude references list incomplete in important ways. i’ll pass

Tja · 10 months ago

You’re not convinced? I could whip up a script to do it in half a day, and I’m really not that good of a dev anymore.

Skimming to see if that’s what you’re looking for is another task where Ai is great at.

skillissuer@discuss.tchncs.de · 10 months ago

we disagree on what my job as an author is

anyway, if you haven’t noticed, you’re already banned on awful.systems and i’m also done with you

V0ldek@awful.systems · 10 months ago

In a worst case scenario if my local lawyer can use AI to generate a letter and just quickly go through it to make sure it didn’t hallucinate, they can process more clients, offer faster service and cheaper prices.

It’s a good thing people are so good at vigilance tasks and don’t tend to fall onto just relying on the automation.

Tja · 10 months ago

Unfortunately that’s human nature, you have to remain professional. There have been cases of lawyers that submitted unverified output and got in trouble for that. And vendors submitting product descriptions with “sorry, as a large language model, I cannot…”.

But that’s human lazyness, not the fault of the tool.

Remember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a lie

Remember how ChatGPT totally aced the bar exam? Wow! yeah, turns out that was just a lie

Opinion | Press Pause on the Silicon Valley Hype Machine