merari42@lemmy.worldtoTechnology@lemmy.ml•‘Sputnik moment’: $1tn wiped off US stocks after Chinese firm unveils AI chatbot
1·
2 days agodoesn’t deepseek work on that though with their janus models?
doesn’t deepseek work on that though with their janus models?
And twitching like crazy from his ketamine habit at the inauguration.
Didn’t deepseek solve some of the data wall problems by creating good chain of thought data with an intermediate RL model. That approach should work with the tried and tested scaling laws just using much more compute.