Of course DeepSeek lied about its training costs, as we had strongly suspected. SemiAnalysis has been following DeepSeek for the past several months. High Flyer, DeepSeek’s owner, was buying Nvidia…
After experimentation with models with clusters of thousands of GPUs, High Flyer made an investment in 10,000 A100 GPUs in 2021 before any export restrictions. That paid off. As High-Flyer improved, they realized that it was time to spin off “DeepSeek” in May 2023 with the goal of pursuing further AI capabilities with more focus.
So where is the lie?
your post is asking a lot of questions already answered by your posting
standard “fuck off programming.dev” ban with a side of who the fuck cares. deepseek isn’t the good guys, you weird fucks don’t have to go to a nitpick war defending them, there’s no good guys in LLMs and generative AI. all these people are grifters, all of them are gaming the benchmarks they designed to be gamed, nobody’s getting good results out of this fucking mediocre technology.
I hate LLMs, never really used them, and I dislike people who do for anything.
But I still like that DeepSeek wiped some of the bubble on the stock market, and feel like the disinfo around it is just OpenAI/MS/nvidia FUD to pretend they still matter.
No, it’s not. OpenAI doesn’t spend all that money on R&D, they spent majority of it on the actual training (hardware, electricity).
And that’s (supposedly) only $6M for Deepseek.
So where is the lie?
shot:
chaser:
citation:
your post is asking a lot of questions already answered by your posting
They did not answer anything, only alluded.
Just because they bought GPUs like everyone else doesn’t mean they could not train it cheaper.
standard “fuck off programming.dev” ban with a side of who the fuck cares. deepseek isn’t the good guys, you weird fucks don’t have to go to a nitpick war defending them, there’s no good guys in LLMs and generative AI. all these people are grifters, all of them are gaming the benchmarks they designed to be gamed, nobody’s getting good results out of this fucking mediocre technology.
I’m not arguing against any of that.
I hate LLMs, never really used them, and I dislike people who do for anything.
But I still like that DeepSeek wiped some of the bubble on the stock market, and feel like the disinfo around it is just OpenAI/MS/nvidia FUD to pretend they still matter.
Nothing more.