The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

Gsus4@mander.xyz · edit-2 11 days ago

The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

afk_strats@lemmy.world · 11 days ago

Submitted in 2018. Does anyone know of any working implementations?

howrar@lemmy.ca · 11 days ago

I don’t know about implementation, but a lot of theoretical work I’ve been seeing with regards to LLMs and other deep learning models appear to confirm the central claim of this paper.

The most recent one I remember reading was this: https://arxiv.org/abs/2306.00978

Gsus4@mander.xyz · 11 days ago

A superficial search returned:

2020: https://github.com/rahulvigneswaran/Lottery-Ticket-Hypothesis-in-Pytorch

2024: https://arxiv.org/pdf/2403.04861

2025: https://github.com/gabrielolympie/moe-pruner

But yeah, in hindsight, I’ve been hearing about this stuff since 2019, it is not that new, given everything else. I added the paper date to the title.

afk_strats@lemmy.world · 11 days ago

Working pruning techniques are tested and seem at least good at maintaining coherent transformer MOE models. https://doi.org/10.48550/arXiv.2510.13999

There are several working examples of REAP pruned models HuggingFace and that method seems very good.

The op paper suggests a technique which starts with an arbitrary structured expers pruned during training. I’m not 100% understanding it, but I still don’t think I’ve seen this exact technique which might be even more efficient

The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

The Lottery Ticket Hypothesis: finding sparse trainable NNs with 90% less params [2018]

The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks