• brucethemoose@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    3 days ago

    I dunno. From my more isolated perspective on GitHub and small LLM testing circles, I see a lot of 3090s, 4090s, sometimes arrays of 3060s/3090s or old P40s or MI50s, which people got basically for the purpose of experimentation and development because they can’t drop (or at least justify) $5K.

    They would 100% drop that money on at least one 7900 48GB instead (as the sheer capacity is worth it over the speed hit and finickiness), and then do a whole bunch of bugfixing/testing on them. I know I would. Hence the Framework Strix Halo thing is sold out even though it’s… rather compute-lite compared to a 3090+ GPU.

    It seems like a tiny market, but a lot of the frameworks/features/models being developed by humble open source devs filter up to the enterprise space. You’d absolutely see more enterprise use once the toolkits were hammered out on desktops… But they aren’t, because AMD gives us no incentive to do so. A 7900 is just not worth the trouble over a 3090/4090 if its VRAM capacity is the same, and this (more or less) extends up and down the price ranges.