Is anyone actually surprised by this?

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    77
    arrow-down
    1
    ·
    edit-2
    19 hours ago

    Anyone using DeepSeek as a service the same way proprietary LLMs like ChatGPT are used is missing the point. The game-changer isn’t that a Chinese company like DeepSeek can compete with OpenAI and its ilk—it’s that, thanks to DeepSeek, any organization with a few million dollars to train and host their own model can now compete with OpenAI.

    • naeap@sopuli.xyz
      link
      fedilink
      arrow-up
      4
      ·
      2 days ago

      I’d like to look into that, how can I train an existing model further?

      I’m only playing around with ollama, but like to do a bit more - mostly just to fulfill my needs to understand things - but have no idea where to start

        • naeap@sopuli.xyz
          link
          fedilink
          arrow-up
          2
          ·
          edit-2
          1 day ago

          Python is not a problem
          SW Dev is my job. Just never had real contact with AI before, besides playing around a bit.

          Thank you very much for the link!!

          Edit: thank you very much again, that was pretty much exactly what I was looking for.
          Don’t know how I missed to checkout huggingface. Thought of it always just as a github for models and didn’t bother checking for docs…
          But that’s a great intro with simple tools/tutorials to get a grip on it, thanks!