The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

Gsus4@mander.xyz · edit-2 25 days ago

The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

Gsus4@mander.xyz · 25 days ago

A superficial search returned:

2020: https://github.com/rahulvigneswaran/Lottery-Ticket-Hypothesis-in-Pytorch

2024: https://arxiv.org/pdf/2403.04861

2025: https://github.com/gabrielolympie/moe-pruner

But yeah, in hindsight, I’ve been hearing about this stuff since 2019, it is not that new, given everything else. I added the paper date to the title.

afk_strats@lemmy.world · 25 days ago

Working pruning techniques are tested and seem at least good at maintaining coherent transformer MOE models. https://doi.org/10.48550/arXiv.2510.13999

There are several working examples of REAP pruned models HuggingFace and that method seems very good.

The op paper suggests a technique which starts with an arbitrary structured expers pruned during training. I’m not 100% understanding it, but I still don’t think I’ve seen this exact technique which might be even more efficient

The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks