• Mojave@lemmy.world
    link
    fedilink
    English
    arrow-up
    39
    ·
    1 day ago

    DeepSeek claimed the model training took 2,788 thousand H800 GPU hours, which, at a cost of $2/GPU hour, comes out to a mere $5.576 million.

    That seems impossibly low.

    DeepSeek is clear that these costs are only for the final training run, and exclude all other expenses