The NPU: Neural processing unit or needless pricey upsell?

Alphane Moon@lemmy.world · 3 days ago

The NPU: Neural processing unit or needless pricey upsell?

Crafter72@lemmy.dbzer0.com · 3 days ago

Imo NPU for average consumer usage is still niche (compared to something like CUDA/Tensor cores). Just like TPU, NPU is just another hardware acceleration for specific tasks that most of average user can be fulfilled with CPU/GPUs.

Alphane Moon@lemmy.world · edit-2 2 days ago

I would like to know how well the NPUs would work on more specialized tasks.

AI up-scaling of video takes about ~30 minutes for a 5 min SD min video (sometimes it can be longer) on my 3080.

I’ve seen reviews reference the 3080 having ~238 TOPS, which implies that current gen NPUs would take significantly longer in upscaling.

I would be willing to get a dedicated NPU if it offered significantly faster AI upscaling relative to high-end GPUs. Having software and an ecosystem provided by the NPU manufacturer would also be nice.

Crafter72@lemmy.dbzer0.com · 2 days ago

I see you seem fascinated by NPUs, here is probably one of the most recent publications that discuss performance comparison between NPU, GPU, and CPU for Edge computing case using YoloV5 model. And here’s another benchmark provided by Google for its TPU product (Google Coral).

Given the circumstances, it’s unlikely for current generation NPU to fulfill those heavy data tasks like upscaling video or generating video but for something like Edge computing they provide fairly performant solutions at affordable prices (in the case of NPU you can get them on Single Board Computer using Rockchip SoC especially RK3588 one). If you’re using Raspberry Pi 5 or other SBC, you can get Google Coral (M.2 or USB version).

Lastly, as recent as July 28th 2024, Anandtech AI benchmark on the new Ryzen AI 9 HX 370 still using CPU because the software to benchmark the NPU modules isn’t available yet for desktop environment to prevent any bias in result.

Alphane Moon@lemmy.world · 2 days ago

Thanks for those links. Will take a look.

I am honestly surprised we don’t see more “power user” type benchmarks for measuring AI Compute for use cases such as video upscaling, local LLM (my next project, I use ChatGPT as an elaborate spell checker, but I want to switch away from corporate cloud solutions).

Crafter72@lemmy.dbzer0.com · 2 days ago

Maybe this resources can help your local LLM project.

Alphane Moon@lemmy.world · 2 days ago

Cheers!