It's best not to dwell on it

The Picard Maneuver@lemmy.world · 2 months ago

It's best not to dwell on it

snooggums@lemmy.world · 2 months ago

I love that this mirrors the experience of experts on social media like reddit, which was used for training chatgpt…

skillissuer@discuss.tchncs.de · 2 months ago

it’s much older than reddit https://en.wikipedia.org/wiki/Gell-Mann_amnesia_effect

jjjalljs@ttrpg.network · 2 months ago

i was going to post this, too.

The Gell-Mann amnesia effect is a cognitive bias describing the tendency of individuals to critically assess media reports in a domain they are knowledgeable about, yet continue to trust reporting in other areas despite recognizing similar potential inaccuracies.

foofiepie@lemmy.world · 2 months ago

SmokeyDope’s Law

PM_Your_Nudes_Please@lemmy.world · edit-2 2 months ago

Also common in news. There’s an old saying along the lines of “everyone trusts the news until they talk about your job.” Basically, the news is focused on getting info out quickly. Every station is rushing to be the first to break a story. So the people writing the teleprompter usually only have a few minutes (at best) to research anything before it goes live in front of the anchor. This means that you’re only ever going to get the most surface level info, even when the talking heads claim to be doing deep dives on a topic. It also means they’re going to be misleading or blatantly wrong a lot of the time, because they’re basically just parroting the top google result regardless of accuracy.

ChickenLadyLovesLife@lemmy.world · 2 months ago

One of my academic areas of expertise way back in the day (late '80s and early '90s) were the so-called “Mitochondrial Eve” and “Out of Africa” hypotheses. The absolute mangling of this shit by journalists even at the time was migraine-inducing and it’s gotten much worse in the decades since then. It hasn’t helped that subsequent generations of scholars have mangled the whole deal even worse. The only advice I can offer people is that if the article (scholastic or popular) contains the word “Neanderthal” anywhere, just toss it.

Doctor_Satan@lemmy.world · 2 months ago

Science journalism in a nutshell.

Credit to Saturday Morning Breakfast Cereal

ᴍᴜᴛɪʟᴀᴛɪᴏɴᴡᴀᴠᴇ @lemmy.dbzer0.com · 2 months ago

I’m curious. Are you saying neanderthal didn’t exist, or was just homo sapiens? Or did you mean in the context of mitochondrial Eve?

ChickenLadyLovesLife@lemmy.world · 2 months ago

Are you saying neanderthal didn’t exist, or was just homo sapiens? Or did you mean in the context of mitochondrial Eve?

All of these things, actually. The measured, physiological differences between “homo sapiens” and “neanderthal” (the air quotes here meaning “so-called”) fossils are much smaller than the differences found among contemporary humans, so the premise that “neanderthals” represent(ed) a separate species - in the sense of a reproductively isolated gene pool since gone extinct - is unsupported by fossil evidence. Of course nobody actually makes that claim anymore, since it’s now commonly reported that contemporary humans possess x% of neanderthal DNA (and thus cannot be said to be “extinct”). Of course nobody originally (when Mitochondrial Eve was first mooted) made any claims whatsoever about neanderthals: the term “neanderthal” was imported into the debate over the age and location of the last common mtDNA ancestor years later, after it was noticed that the age estimates of neanderthal remains happened to roughly match the age estimates of the genetic last common ancestor. And this was also after the term “neanderthal” had previously gone into the same general category in Anthropology as “Piltdown Man”.

Most ironically, articles on the subject today now claim a correspondence between the fossil and genetic evidence, despite the fact that the very first articles (out of Allan Wilson’s lab and published in Nature and Science in the mid-1980s) drew their entire impact and notoriety from the fact that the genetic evidence (which supposedly gave 100,000 years ago and then 200,000 years ago as the age of the last common ancestor) completely contradicted the fossil evidence (which shows upright bipedal hominids spreading out of Africa more than a million and half years ago). To me, the weirdest thing is that academic articles on the subject now almost never cite these two seminal articles at all, and most authors seem genuinely unaware of them.

AnUnusualRelic@lemmy.world · edit-2 2 months ago

~~Sientists~~ Scientists confirm it: we are living in a simulation!

ᴍᴜᴛɪʟᴀᴛɪᴏɴᴡᴀᴠᴇ @lemmy.dbzer0.com · 2 months ago

You fucking know it man. Deep in your heart you know it. The thing is, it doesn’t matter.

UnderpantsWeevil@lemmy.world · 2 months ago

There’s an old saying along the lines of “everyone trusts the news until they talk about your job.”

This is something of a selection bias. Generally speaking, if you don’t trust a news broadcast then you won’t watch it. So of course you’re going to be predisposed to trust the news sources you do listen to. Until the news source bumps up against some of your prior info/intuition, at which point you start experiencing skepticism.

This means that you’re only ever going to get the most surface level info, even when the talking heads claim to be doing deep dives on a topic.

Investigative journalism has historically been a big part of the industry. You do get a few punchy “If it bleeds, it leads” hit pieces up front, but the Main Story tends to be the result of some more extensive investigation and coverage. I remember my home town of Houston had Marvin Zindler, a legendary beat reporter who would regularly put out interconnected 10-15 minute segments that offered continuous coverage on local events. This was after a stint at a municipal Consumer Fraud Prevention division that turned up numerous health code violations and sales frauds (he was allegedly let go by an incoming sheriff with ties to the local used car lobby, after Zindler exposed one too many odometer scams).

But investigative journalism costs money. And its not “business friendly” from a conservative corporate perspective, which can cut into advertising revenues. So it is often the first line of business to be cut when a local print or broadcast outlet gets bought up and turned over for syndication.

SirSamuel@lemmy.world · 2 months ago

First off, the beauty of these two posts being beside each other is palpable.

Second, as you can see on the picture, it’s more like 60%

morrowind@lemmy.ml · 2 months ago

No it’s not. If you actually read the study, it’s about AI search engines correctly finding and citing the source of a given quote, not general correctness, and not just the plain model

SirSamuel@lemmy.world · 2 months ago

Read the study? Why would i do that when there’s an infographic right there?

(thank you for the clarification, i actually appreciate it)

DudeImMacGyver@kbin.earth · 2 months ago

40% seems low

melpomenesclevage@lemmy.dbzer0.com · 2 months ago

that depends on what topic you know and how well you know it.

taladar@sh.itjust.works · edit-2 2 months ago

LLMs are actually pretty good for looking up words by their definition. But that is just about the only topic I can think of where they are correct even close to 80% of the time.

melpomenesclevage@lemmy.dbzer0.com · 2 months ago

yeah. some things I’d be shocked if they were correct 1% of the time. some things, like that, I might expect them to be correct about 80%, yeah.

jsomae@lemmy.ml · 2 months ago

ChatGPT is a tool. Use it for tasks where the cost of verifying the output is correct is less than the cost of doing it by hand.

qarbone@lemmy.world · 2 months ago

Honestly, I’ve found it best for quickly reformatting text and other content. It should live and die as a clerical tool.

ArchRecord@lemm.ee · 2 months ago

Which is exactly why every time I see big tech companies making another stupid implementation of it, it pisses me off.

LLMs like ChatGPT are fundamentally word probability machines. They predict the probability of words based on context (or if not given context, just the general probability) when given notes, for instance, they have all the context and knowledge, and all they have to do it predict the most statistically probable way of formatting the existing data into a better structure. Literally the perfect use case for the technology.

Even in similar contexts that don’t immediately seem like “text reformatting,” it’s extremely handy. For instance, Linkwarden can auto-tag your bookmarks, based on a predetermined list you set, using the context of each page fed into a model running via Ollama. Great feature, very useful.

Yet somehow, every tech company manages to use it in every way except that when developing products with it. It’s so discouraging to see.

tacobellhop@midwest.social · 2 months ago

Youre still doing it by hand to verify in any scientific capacity. I only use ChatGPT for philosophical hypotheticals involving the far future. We’re both wrong but it’s fun for the back and forth.

jsomae@lemmy.ml · edit-2 2 months ago

It is not true in general that verifying output for a science-related prompt requires doing it by hand, where “doing it by hand” means putting in the effort to answer the prompt manually without using AI.

tacobellhop@midwest.social · 2 months ago

You can get pretty in the weeds with conversions on ChatGPT in the chemistry world or even just basic lab work where a small miscalculation at scale can cost thousands of dollars or invite lawsuits.

I check against actual calibrated equipment as a verification final step.

jsomae@lemmy.ml · 2 months ago

I said not true in general. I don’t know much about chemistry. It may be more true in chemistry.

Coding is different. In many situations it can be cheap to test or eyeball the output.

Crucially, in nearly any subject, it can give you leads. Nobody expects every lead to pan out. But leads are hard to find.

tacobellhop@midwest.social · 2 months ago

I imagine ChatGPT and code is a lot like air and water.

Both parts are in the other part. Meaning llm is probably more native at learning reading and writing code than it is at interpreting engineering standards worldwide and allocation the exact thread pitch for a bolt you need to order thousands of. Go and thread one to verify.

jsomae@lemmy.ml · 2 months ago

This is possibly true due to the bias of the people who made it. But I reject the notion that because ChatGPT is made of code per se that it must understand code better than other subjects. Are humans good at biology for this reason?

tacobellhop@midwest.social · 2 months ago

You might know better than me. If you ask ChatGPT to write the code for itself I have no way to verify it. You would.

RabbitBBQ@lemmy.world · 2 months ago

If the standard is replicating human level intelligence and behavior, making up shit just to get you to go away about 40% of the time kind of checks out. In fact, I bet it hallucinates less and is wrong less often than most people you work with

Devanismyname@lemmy.ca · 2 months ago

And it just keeps improving over time. People shit all over ai to make themselves feel better because scary shit is happening.

bier@feddit.nl · 2 months ago

My kid sometimes makes up shit and completely presents it as facts. It made me realize how many made up facts I learned from other kids.

foxlore · 2 months ago

Talking with an AI model is like talking with that one friend, that is always high that thinks they know everything. But they have a wide enough interest set that they can actually piece together an idea, most of the time wrong, about any subject.

dagger_punch@lemmy.world · 2 months ago

Isn’t this called “the Joe Rogan experience”?

enbipanic@lemmy.blahaj.zone · 2 months ago

I am sorry to say I can frequently be this friend…

PartiallyApplied@lemmy.world · edit-2 2 months ago

I feel this hard with the New York Times.

99% of the time, I feel like it covers subjects adequately. It might be a bit further right than me, but for a general US source, I feel it’s rather representative.

Then they write a story about something happening to low income US people, and it’s just social and logical salad. They report, it appears as though they analytically look at data, instead of talking to people. Statisticians will tell you, and this is subtle: conclusions made at one level of detail cannot be generalized to another level of detail. Looking at data without talking with people is fallacious for social issues. The NYT needs to understand this, but meanwhile they are horrifically insensitive bordering on destructive at times.

“The jackboot only jumps down on people standing up”

Hozier, “Jackboot Jump”

Then I read the next story and I take it as credible without much critical thought or evidence. Bias is strange.

CancerMancer@sh.itjust.works · 2 months ago

There is a name for this: Gell-Mann amnesia effect

PartiallyApplied@lemmy.world · 2 months ago

“Wet sidewalks cause rain”

Pretty much. I never really thought about the causal link being entirely reversed, moreso that the chain of reasoning being broken or mediated by some factor they missed, which yes definitely happens, but now I can definitely think of instances where it’s totally flipped.

Very interesting read, thanks for sharing!

Lady Butterfly @lazysoci.al · 2 months ago

Can you give me an example of conclusions on one level of detail can’t be generalised to another level? I can’t quite understand it

PartiallyApplied@lemmy.world · edit-2 2 months ago

Perhaps the textbook example is the Simpson’s Paradox.

This article goes through a couple cases where naively and statically conclusions are supported, but when you correctly separate the data, those conclusions reverse themselves.

Another relevant issue is Aggregation Bias. This article has an example where conclusions about a population hold inversely with individuals of that population.

And the last one I can think of is MAUP, which deals with the fact that statistics are very sensitive in whatever process is used to divvy up a space. This is commonly referenced in spatial statistics but has more broad implications I believe.

This is not to say that you can never generalize, and indeed, often a big goal of statistics is to answer questions about populations using only information from a subset of individuals in that population.

All Models Are Wrong, Some are Useful

George Box

The argument I was making is that the NYT will authoritatively make conclusions without taking into account the individual, looking only at the population level, and not only is that oftentimes dubious, sometimes it’s actively detrimental. They don’t seem to me to prove their due diligence in mitigating the risk that comes with such dubious assumptions, hence the cynic in me left that Hozier quote.

Lady Butterfly @lazysoci.al · 2 months ago

That’s really interesting and I really appreciate you writing that out

Hikermick@lemmy.world · 2 months ago

I did a google search to find out how much i pay for water, the water department where I live bills by the MCF (1,000 cubic feet). The AI Overview told me an MCF was one million cubic feet. It’s a unit of measurement. It’s not subjective, not an opinion and AI still got it wrong.

TonyTonyChopper@mander.xyz · 2 months ago

Everywhere else in the world a big M means million.

Hikermick@lemmy.world · 2 months ago

I think in this case it’s Roman numeral M

SaharaMaleikuhm@feddit.org · 2 months ago

Americans really using ANYTHING but metric, huh?

AnUnusualRelic@lemmy.world · 2 months ago

The only thing that would make more sense would be if the bill was in cuneiform.

TonyTonyChopper@mander.xyz · 2 months ago

💀

Ilovethebomb@lemm.ee · 2 months ago

Yeah, shouldn’t that be Kcf, Kilo cubic foot?

AnUnusualRelic@lemmy.world · 2 months ago

Kilo is a small k as there wasn’t a person named that.

meliaesc@lemmy.world · 2 months ago

Except languages like French (mille)

Lemmy_2019@lemmy.one · 2 months ago

And Irish – míle.

AnUnusualRelic@lemmy.world · 2 months ago

Shouldn’t it be kcf? Or tcf if you’re desperate to avoid standard prefixes?

sugar_in_your_tea@sh.itjust.works · 2 months ago

Yeah, that’s an odd one. My city does water by the gallon, which is much more reasonable.

TranslateErr0r@lemmy.world · 2 months ago

I just think you need an abbrevations chart.

RedSnt 👓♂️🖥️@feddit.dk · edit-2 2 months ago

I’ve been using o3-mini mostly for ffmpeg command lines. And a bit of sed. And it hasn’t been terrible, it’s a good way to learn stuff I can’t decipher from the man pages. Not sure what else it’s good for tbh, but at least I can test and understand what it’s doing before running the code.

Nalivai@lemmy.world · 2 months ago

In my experience plain old googling still better.

henfredemars@infosec.pub · 2 months ago

I wonder if AI got better or if Google results got worse.

SaharaMaleikuhm@feddit.org · 2 months ago

Bit of the first, lots of the second.

RedSnt 👓♂️🖥️@feddit.dk · 2 months ago

True, in many cases I’m still searching around because the explanations from humans aren’t as simplified as the LLM. I’ll often have to be precise in my prompting to get the answers I want which one can’t be if they don’t know what to ask.

Nalivai@lemmy.world · 2 months ago

And that’s how you learn, and learning includes knowing how to check if the info you’re getting is correct.
LLM confidently gives you easy to digest bite, which is plain wrong 40 to 60% of the time, and even if you’re lucky it will be worse for you.

RedSnt 👓♂️🖥️@feddit.dk · 1 month ago

I’m in the kiddie pool, so I do look things up or ask what stuff does. Even though I looked at the man page for printf (printf.3 I believe), there was nothing about %*s for example, and searching for these things outside of asking LLM’s is some times too hard to filter down to the correct answer. I’m on 2 lines of code per hour, so I’m not exactly rushing.
Shell scripting is quite annoying to be sure. Thinking of learning python instead.

Nalivai@lemmy.world · 1 month ago

Come on, I just googled printf bash and the first link gave me very comprehensive page on how it works and what parameters are and how to use them. It was 3 pages on my phone.
Please, don’t get what I am about to say the wrong way, but if this was too complicated to you, this is your problem, not anything else. This is how people learn, there is no cheat code to it, you need to learn how to find the information and how to absorb it, and no robot will ever do it for you.
Bash is confusing mess, sure, but using random words genrtator to chew it for you will make things worse for you. It’s very possible that you’re on 2 lines per hour precisely because you’re using LLM.

Legume5534@lemm.ee · 2 months ago

Are you me? I’ve been doing the exact same thing this week. How creepy.

rapchee@lemmy.world · 2 months ago

we just had to create a new instance for coder7ZybCtRwMc, we’ll merge it back soon

faythofdragons@slrpnk.net · 2 months ago

Totally didn’t misread that as ‘ffmpreg’ nope.

RedSnt 👓♂️🖥️@feddit.dk · 2 months ago

I’m not judging. The LLM might though.

Zachariah@lemmy.world · 2 months ago

This, but for tech bros.

Korhaka@sopuli.xyz · 2 months ago

I just use it to write emails, so I declare the facts to the LLM and tell it to write an email based on that and the context of the email. Works pretty well but doesn’t really sound like something I wrote, it adds too much emotion.

jjjalljs@ttrpg.network · 2 months ago

That sounds like more work than just writing the email to me

sugar_in_your_tea@sh.itjust.works · 2 months ago

Yeah, that has been my experience so far. LLMs take as much or more work vs the way I normally do things.

uniquethrowagay@feddit.org · 2 months ago

This is what LLMs should be used for. People treat them like search engines and encyclopedias, which they definitely aren’t

🇰 🌀 🇱 🇦 🇳 🇦 🇰 🇮 @pawb.social · 2 months ago

Most of my searches have to do with video games, and I have yet to see any of those AI generated answers be accurate. But I mean, when the source of the AI’s info is coming from a Fandom wiki, it was already wading in shit before it ever generated a response.

henfredemars@infosec.pub · 2 months ago

I’ve tried it a few times with Dwarf Fortress, and it was always horribly wrong hallucinated instructions on how to do something.

balderdash@lemmy.zip · 2 months ago

Deepseek is pretty good tbh. The answers sometimes leave out information in a way that is misleading, but targeted follow up questions can clarify.

snooggums@lemmy.world · edit-2 2 months ago

Like leaving out what happened in Tiananmen Square in 1989?

heavydust@sh.itjust.works · 2 months ago

You must be more respectful of all cultures and opinions.

JusticeForPorygon@lemmy.blahaj.zone · 2 months ago

The amount of people who don’t realize this is satire reminds me of old Reddit

snooggums@lemmy.world · edit-2 2 months ago

Is it though? I really can’t tell.

Poe’s law has been working overtime recently.

Edut: saw a comment further down that it is a default deepseek response for censored content, so yeah a joke. People who don’t have that context aren’t going to get the joke.

Remember_the_tooth@lemmy.world · 2 months ago

It got me, for whatever that’s worth.

SpaceNoodle@lemmy.world · 2 months ago

Not everybody has heard every joke, buddy.

Geometrinen_Gepardi@sopuli.xyz · 2 months ago

In my opinion it should have been the politburo that was pureed under tank tracks and hosed down into the sewers instead of those students.

InvertedParallax@lemm.ee · 2 months ago

It really is so convenient, there are so many CPC members, but they all happen to be near a conveniently placed wall that is more than enough.

Remember_the_tooth@lemmy.world · 2 months ago

Is this a reference I’m not getting? Otherwise, I feel like censorship of massacre is not moraly acceptable regardless of culture. I’ll leave this here so this doesn’t get mistaken for nationalism:

https://en.m.wikipedia.org/wiki/List_of_massacres_in_the_United_States

It’s by no means a comprehensive list, but more of a primer. We do not forget these kinds of things in the hope that we may prevent future occurrences.

heavydust@sh.itjust.works · 2 months ago

It’s a fucking joke FFS. It’s the standard response from Deepseek.

Remember_the_tooth@lemmy.world · 2 months ago

Oh, gotcha. Yeah, I’m not on board with that. Thanks for clarifying. I thought you were being sincere for a moment. This is good satire. Carry on, please.

snooggums@lemmy.world · 2 months ago

Thank you, that provides context that was missing for the joke to land.

SpaceNoodle@lemmy.world · 2 months ago

How dare they ask!

ChickenLadyLovesLife@lemmy.world · 2 months ago

https://en.m.wikipedia.org/wiki/List_of_massacres_in_the_United_States

Huh, I used to make a joke about how there’s never been a “Bloody Monday” in history. I learn something new every day …

Ricky Rigatoni@lemm.ee · 2 months ago

Ah dun wanna 😠

InvertedParallax@lemm.ee · 2 months ago

Are we calling the communist party of China and their history of genocide and general evil, some kind of culture now?

Can’t believe how hostile people are against nazis, we should have respected their cultural use of gas chambers.

4am@lemm.ee · 2 months ago

Communism was never the problem, authoritarianism is the problem

InvertedParallax@lemm.ee · 2 months ago

The cpc is and has always been the definition of authoritarianism , and now it’s hyeprcapitalist authoritarianism.

SkyeStarfall@lemmy.blahaj.zone · 2 months ago

You can get an uncensored local version running if you got the hardware at least

RandomVideos · 2 months ago

It censors 1989 China. If you ask it to not say the year, it will work

aceshigh@lemmy.world · 2 months ago

I use chatgpt as a suggestion. Like an aid to whatever it is that I’m doing. It either helps me or it doesn’t, but I always have my critical thinking hat on.

BlackPenguins@lemmy.world · 2 months ago

Same. It’s an idea generator. I asked what kinda pie should I should make. I saw one I liked and then googled a real recipe.

I needed a SQL query for work. It gave me different methods of optimization. I then googled those methods, implemented, and tested it.

lowside@lemmy.world · 2 months ago

One thing I have found it to be useful for is changing the tone if what I write.

I tend to write very clinicaly because my job involves a lot of that style of writing. I have started asked chat gpt to rephrase what i write in a softer tone.

Not for everything, but for example when Im texting my girlfriend who is feeling insecure. It has helped me a lot! I always read thrugh it to make sure it did not change any of the meaning or add anything, but so far it has been pretty good at changing the tone.

Also use it to rephrase emails at work to make it sound more professional.

taxiiiii@lemmy.world · 2 months ago

I do that in reverse, lol. Except I’m also not a native speaker. “Rephrase this, it should sound more scientific”.