• howrar@lemmy.ca
    link
    fedilink
    arrow-up
    5
    ·
    2 months ago

    I don’t know about implementation, but a lot of theoretical work I’ve been seeing with regards to LLMs and other deep learning models appear to confirm the central claim of this paper.

    The most recent one I remember reading was this: https://arxiv.org/abs/2306.00978