DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

ByHackTech February 21, 2025

28Comments

Share This Article

Sed ut perspiciatis unde.

Send to HN

We’re a tiny team @deepseek-ai pushing our limits in AGI exploration.

Starting next week, we’ll open-source 5 repos – one daily drop – not because we’ve made grand claims,
but simply as developers sharing our small-but-sincere progress with full transparency.

These are humble building blocks of our online service: documented, deployed and battle-tested in production.
No vaporware, just code that moved our tiny moonshot forward.

Why? Because every line shared becomes collective momentum that accelerates the journey.
Daily unlocks begin soon. No ivory towers – just pure garage-energy and community-driven innovation 🔧

Stay tuned – let’s geek out in the open together.

2024 AI Infrastructure Paper (SC24)

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

📄 Paper Link
📄 Arxiv Paper Link

0Likes

Written by

HackTech

View all posts by HackTech

Show comments (28)

28 Comments

Post Author

ipsum2

Posted February 21, 2025 at 5:46 am

This is more exciting to me than OpenAI's 12 days of Christmas

0Likes Log in to Reply
Post Author

mindwok

Posted February 21, 2025 at 5:49 am

This team is truly something special.

0Likes Log in to Reply
Post Author

codelion

Posted February 21, 2025 at 5:51 am

This is great to see! Open-sourcing infrastructure tools can really accelerate innovation in the AI space. I've found that having access to well-documented repos makes it much easier to experiment and build on existing work. Are there any specific areas these repos focus on, like distributed training or model serving?

0Likes Log in to Reply
Post Author

antupis

Posted February 21, 2025 at 5:53 am

Kinda interesting to see where the moat is in AI space. Good base models can always distilled when you have access to API. System prompts can get leaked, and UI tricks can be copied. In the end, the moat might be in the hardware and vertical integration.

0Likes Log in to Reply
Post Author

rvz

Posted February 21, 2025 at 5:54 am

I really like this definition of "AGI": When everyone (yes everyone) benefits from very powerful AI models released for free and it is not gate-kept by one company and it costs $0 to use commercially or for research and you can do whatever you want with it.

Unlike the other counterpart which believes that "AGI" means: "raising billions of dollars to achieve $100BN of profits to their investors". (Which is complete nonsense).

While not totally "open source" by the strictest definition, it is at least better than having no model released with no mention of the architecture on the system card or paper and just vague comments about the 'performance'.

Ladies and gentlemen, this is closer towards being an better "Open AI". Unlike the other alleged $157BN "non-profit" scam.

I think you know which one really is beneficial to humanity and is the real "Open AI".

0Likes Log in to Reply
Post Author

sgt

Posted February 21, 2025 at 6:00 am

Speaking of DeepSeek, anyone here used SambaNova – are they reliable?

0Likes Log in to Reply
Post Author

swyx

Posted February 21, 2025 at 6:04 am

odds on r1.5/r2 release?

0Likes Log in to Reply
Post Author

voxelizer

Posted February 21, 2025 at 6:11 am

I wonder if they are just shorting Nvidia…

0Likes Log in to Reply
Post Author

mythz

Posted February 21, 2025 at 6:12 am

Looking forward to it! I'll generally make an effort to use Open Models over proprietary alternatives when the use-case permits as Open Models getting better and more popular encourages more models to become open as well – a requisite for a future to be able to build self-hosted solutions that's not beholden to the control of mega corps and AI monopolies.

0Likes Log in to Reply
Post Author

thundergolfer

Posted February 21, 2025 at 6:14 am

“Pure garage-energy” is a great phrase.

Most interested to see their inference stack, hope that’s one of the 5. I think most people are running R1 on a single H200 node but Deepseek had much lower RAM per GPU for their inference and so had some cluster based MoE deployment.

0Likes Log in to Reply
Post Author

suraci

Posted February 21, 2025 at 6:18 am

I always consider open-sourcing to be a great social experiment. It may fail one day, but its effects will remain and benefit everyone.

0Likes Log in to Reply
Post Author

oefrha

Posted February 21, 2025 at 6:21 am

> Starting next week, we'll open-source 5 repos – one daily drop

Probably counts as announcement of announcement? Let’s wait for the actual repo drops before discussing them, especially because there are no details about what will be open sourced other than

> These are humble building blocks of our online service: documented, deployed and battle-tested in production.

0Likes Log in to Reply
Post Author

deyiao

Posted February 21, 2025 at 6:26 am

I really admire their mindset of striving for the betterment of humanity.
There was a time when OpenAI, Anthropic, and even Musk used to talk with that same lofty vision. But now, they've all shifted to competing for national interests instead, which is honestly quite disappointing.

0Likes Log in to Reply
Post Author

csomar

Posted February 21, 2025 at 6:34 am

> These are humble building blocks of our online service: documented, deployed and battle-tested in production. No vaporware, just code that moved our tiny moonshot forward.

My not-so-innocent guess is that they are looking to crowd-source their online platform (the front-end essentially) in order to reduce costs. Still acceptable though as they made the model open weight and partially re-producible.

0Likes Log in to Reply
Post Author

vinhnx

Posted February 21, 2025 at 6:35 am

Deep respect for DeepSeek and what they've done regarding all the innovations and researches they have been putting out in-the-open.

"Because every line shared becomes collective momentum that accelerates the journey. Daily unlocks begin soon. No ivory towers – just pure garage-energy and community-driven innovation" is a great phase.

0Likes Log in to Reply
Post Author

yobid20

Posted February 21, 2025 at 6:42 am

Deepseek showing US ai engineers are overpaid and many worthless lol keep it comin!!

0Likes Log in to Reply
Post Author

macns

Posted February 21, 2025 at 6:45 am

> Why? Because every line shared becomes collective momentum that accelerates the journey.

Truly admireable on their part and a great paradigm for others. Reasons for this doesn't really matter to me but I can't help but wonder if somehow they were obliged or otherwise indebted to follow this route.

0Likes Log in to Reply
Post Author

dhdjruf

Posted February 21, 2025 at 6:45 am

Long live llms I hope they infest every part of the internet with low level comments. Both the clear , deep, and dark.

Imagine no more human interactions just a permanent flood of meaningless thoughtless word salad.

I think the Chinese are perfect to introduce such a product very inline with what they usually produce.

Get ready for web3.o

0Likes Log in to Reply
Post Author

Mr_Bees69

Posted February 21, 2025 at 6:57 am

R1 is a better o1, this is a better devdays.

0Likes Log in to Reply
Post Author

t24uo2i34j324l

Posted February 21, 2025 at 7:01 am

Deepseek seems to be having huge PR wins as the "oh shucks" modest boy genius, while the Americans seem like pouty jerks.

Amodei's / Hassabis' comments in particular came off as so arrogant and annoying.

0Likes Log in to Reply
Post Author

sidcool

Posted February 21, 2025 at 7:14 am

This may be my cynical take, but this cannot be out of good will or noble intentions. There has to be an ulterior motive.

0Likes Log in to Reply
Post Author

bigcat12345678

Posted February 21, 2025 at 7:21 am

No turning back…

0Likes Log in to Reply
Post Author

abdellah123

Posted February 21, 2025 at 7:23 am

DeepSeek seems like Hisoka helping Gon and Killua … just for a more challenging battle at some point xD

0Likes Log in to Reply
Post Author

ein0p

Posted February 21, 2025 at 7:28 am

Beatings will continue until openness improves, apparently. Kudos to Deepseek, about time someone spilled some significant beans.

0Likes Log in to Reply
Post Author

andy_ppp

Posted February 21, 2025 at 7:31 am

How do the valuations of foundation model companies compete with them being firmly open sourced by Facebook and DeepSeek? It seems likely that building these models will not produce hundreds of billions in value given China and Facebook are giving them away largely for free.

0Likes Log in to Reply
Post Author

wizardsbot

Posted February 21, 2025 at 7:43 am

DeepSeek’s claim of focusing on humanity sounds noble, but their history as a spin-off from a hedge fund raises questions. If they’re truly altruistic, why not be more transparent about their funding and business goals?

0Likes Log in to Reply
Post Author

RandyOrion

Posted February 21, 2025 at 8:02 am

Well, although R1-671b is way too expensive for me to host, given their past open source (or weight) contributions, I DO have high expectation of them.

Each and every contribution to open source community will be helpful. Thanks DeepSeek!

0Likes Log in to Reply
Post Author

maxglute

Posted February 21, 2025 at 8:23 am

> geek out in the open together.

Some people seem butthurt that others are attributing altruism/nobility to smart rich kids that just want to have fun.

AFAIK Liang focus/personal interest is AGI – Deepseek appears simply to be side project of young rich quant who'd rather spend his millions doing AGI than buying a yacht – like there are multiple yachts and car collections worth more than aggregate $$$ that has been dumped into Deepseek (even if you believe the stupid semianalysis claim that they have 1B+ of chips – which they don't / didn't). Maybe just so happens actions that align with said interest can be attributed as altruistic by some. But no need to project other motivations, or other ulterior motives.

0Likes Log in to Reply

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

Share This Article

Newsletter

2024 AI Infrastructure Paper (SC24)

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

HackTech

28 Comments

ipsum2

mindwok

codelion

antupis

rvz

sgt

swyx

voxelizer

mythz

thundergolfer

suraci

oefrha

deyiao

csomar

vinhnx

yobid20

macns

dhdjruf

Mr_Bees69

t24uo2i34j324l

sidcool

bigcat12345678

abdellah123

ein0p

andy_ppp

wizardsbot

RandyOrion

maxglute

Leave a comment Cancel reply

Editor's Choice

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

Share This Article

Newsletter

2024 AI Infrastructure Paper (SC24)

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

28 Comments

Leave a comment Cancel reply

Editor's Choice

Sign Up to Our Newsletter