• cheddar
    link
    fedilink
    English
    arrow-up
    22
    arrow-down
    1
    ·
    5 months ago

    Typically you need about 1GB graphics RAM for each billion parameters (i.e. one byte per parameter). This is a 405B parameter model.