Researchers figured out how to run a 120-billion parameter model across four regular desktop PCs

noumenon@lemmy.world · 1 month ago

Researchers figured out how to run a 120-billion parameter model across four regular desktop PCs

FlexibleToast@lemmy.world · 1 month ago

A 120b parameter model is small compared to the models running in datacenters. However, this does seem like the current “Moore’s Law” for AI. Finding more and more efficient ways to run larger parameter models.

Researchers figured out how to run a 120-billion parameter model across four regular desktop PCs

Researchers figured out how to run a 120-billion parameter model across four regular desktop PCs

Do we really need big data centers for AI?