We’re a tiny team @deepseek-ai pushing our limits in AGI exploration.
Starting next week, we’ll open-source 5 repos – one daily drop – not because we’ve made grand claims,
but simply as developers sharing our small-but-sincere progress with full transparency.
These are humble building blocks of our online service: documented, deployed and battle-tested in production.
No vaporware, just code that moved our tiny moonshot forward.
Why? Because every line shared becomes collective momentum that accelerates the journey.
Daily unlocks begin soon. No ivory towers – just pure garage-energy and community-driven innovation 🔧
Stay tuned – let’s geek out in the open together.
28 Comments
ipsum2
This is more exciting to me than OpenAI's 12 days of Christmas
mindwok
This team is truly something special.
codelion
This is great to see! Open-sourcing infrastructure tools can really accelerate innovation in the AI space. I've found that having access to well-documented repos makes it much easier to experiment and build on existing work. Are there any specific areas these repos focus on, like distributed training or model serving?
antupis
Kinda interesting to see where the moat is in AI space. Good base models can always distilled when you have access to API. System prompts can get leaked, and UI tricks can be copied. In the end, the moat might be in the hardware and vertical integration.
rvz
I really like this definition of "AGI": When everyone (yes everyone) benefits from very powerful AI models released for free and it is not gate-kept by one company and it costs $0 to use commercially or for research and you can do whatever you want with it.
Unlike the other counterpart which believes that "AGI" means: "raising billions of dollars to achieve $100BN of profits to their investors". (Which is complete nonsense).
While not totally "open source" by the strictest definition, it is at least better than having no model released with no mention of the architecture on the system card or paper and just vague comments about the 'performance'.
Ladies and gentlemen, this is closer towards being an better "Open AI". Unlike the other alleged $157BN "non-profit" scam.
I think you know which one really is beneficial to humanity and is the real "Open AI".
sgt
Speaking of DeepSeek, anyone here used SambaNova – are they reliable?
swyx
odds on r1.5/r2 release?
voxelizer
I wonder if they are just shorting Nvidia…
mythz
Looking forward to it! I'll generally make an effort to use Open Models over proprietary alternatives when the use-case permits as Open Models getting better and more popular encourages more models to become open as well – a requisite for a future to be able to build self-hosted solutions that's not beholden to the control of mega corps and AI monopolies.
thundergolfer
“Pure garage-energy” is a great phrase.
Most interested to see their inference stack, hope that’s one of the 5. I think most people are running R1 on a single H200 node but Deepseek had much lower RAM per GPU for their inference and so had some cluster based MoE deployment.
suraci
I always consider open-sourcing to be a great social experiment. It may fail one day, but its effects will remain and benefit everyone.
oefrha
> Starting next week, we'll open-source 5 repos – one daily drop
Probably counts as announcement of announcement? Let’s wait for the actual repo drops before discussing them, especially because there are no details about what will be open sourced other than
> These are humble building blocks of our online service: documented, deployed and battle-tested in production.
deyiao
I really admire their mindset of striving for the betterment of humanity.
There was a time when OpenAI, Anthropic, and even Musk used to talk with that same lofty vision. But now, they've all shifted to competing for national interests instead, which is honestly quite disappointing.
csomar
> These are humble building blocks of our online service: documented, deployed and battle-tested in production. No vaporware, just code that moved our tiny moonshot forward.
My not-so-innocent guess is that they are looking to crowd-source their online platform (the front-end essentially) in order to reduce costs. Still acceptable though as they made the model open weight and partially re-producible.
vinhnx
Deep respect for DeepSeek and what they've done regarding all the innovations and researches they have been putting out in-the-open.
"Because every line shared becomes collective momentum that accelerates the journey. Daily unlocks begin soon. No ivory towers – just pure garage-energy and community-driven innovation" is a great phase.
yobid20
Deepseek showing US ai engineers are overpaid and many worthless lol keep it comin!!
macns
> Why? Because every line shared becomes collective momentum that accelerates the journey.
Truly admireable on their part and a great paradigm for others. Reasons for this doesn't really matter to me but I can't help but wonder if somehow they were obliged or otherwise indebted to follow this route.
dhdjruf
Long live llms I hope they infest every part of the internet with low level comments. Both the clear , deep, and dark.
Imagine no more human interactions just a permanent flood of meaningless thoughtless word salad.
I think the Chinese are perfect to introduce such a product very inline with what they usually produce.
Get ready for web3.o
Mr_Bees69
R1 is a better o1, this is a better devdays.
t24uo2i34j324l
Deepseek seems to be having huge PR wins as the "oh shucks" modest boy genius, while the Americans seem like pouty jerks.
Amodei's / Hassabis' comments in particular came off as so arrogant and annoying.
sidcool
This may be my cynical take, but this cannot be out of good will or noble intentions. There has to be an ulterior motive.
bigcat12345678
No turning back…
abdellah123
DeepSeek seems like Hisoka helping Gon and Killua … just for a more challenging battle at some point xD
ein0p
Beatings will continue until openness improves, apparently. Kudos to Deepseek, about time someone spilled some significant beans.
andy_ppp
How do the valuations of foundation model companies compete with them being firmly open sourced by Facebook and DeepSeek? It seems likely that building these models will not produce hundreds of billions in value given China and Facebook are giving them away largely for free.
wizardsbot
DeepSeek’s claim of focusing on humanity sounds noble, but their history as a spin-off from a hedge fund raises questions. If they’re truly altruistic, why not be more transparent about their funding and business goals?
RandyOrion
Well, although R1-671b is way too expensive for me to host, given their past open source (or weight) contributions, I DO have high expectation of them.
Each and every contribution to open source community will be helpful. Thanks DeepSeek!
maxglute
> geek out in the open together.
Some people seem butthurt that others are attributing altruism/nobility to smart rich kids that just want to have fun.
AFAIK Liang focus/personal interest is AGI – Deepseek appears simply to be side project of young rich quant who'd rather spend his millions doing AGI than buying a yacht – like there are multiple yachts and car collections worth more than aggregate $$$ that has been dumped into Deepseek (even if you believe the stupid semianalysis claim that they have 1B+ of chips – which they don't / didn't). Maybe just so happens actions that align with said interest can be attributed as altruistic by some. But no need to project other motivations, or other ulterior motives.