Skip to content Skip to footer
0 items - $0.00 0

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

DeepSeek Open Infra: Open-Sourcing 5 AI Repos in 5 Days by ahsmha_

28 Comments

  • Post Author
    ipsum2
    Posted February 21, 2025 at 5:46 am

    This is more exciting to me than OpenAI's 12 days of Christmas

  • Post Author
    mindwok
    Posted February 21, 2025 at 5:49 am

    This team is truly something special.

  • Post Author
    codelion
    Posted February 21, 2025 at 5:51 am

    This is great to see! Open-sourcing infrastructure tools can really accelerate innovation in the AI space. I've found that having access to well-documented repos makes it much easier to experiment and build on existing work. Are there any specific areas these repos focus on, like distributed training or model serving?

  • Post Author
    antupis
    Posted February 21, 2025 at 5:53 am

    Kinda interesting to see where the moat is in AI space. Good base models can always distilled when you have access to API. System prompts can get leaked, and UI tricks can be copied. In the end, the moat might be in the hardware and vertical integration.

  • Post Author
    rvz
    Posted February 21, 2025 at 5:54 am

    I really like this definition of "AGI": When everyone (yes everyone) benefits from very powerful AI models released for free and it is not gate-kept by one company and it costs $0 to use commercially or for research and you can do whatever you want with it.

    Unlike the other counterpart which believes that "AGI" means: "raising billions of dollars to achieve $100BN of profits to their investors". (Which is complete nonsense).

    While not totally "open source" by the strictest definition, it is at least better than having no model released with no mention of the architecture on the system card or paper and just vague comments about the 'performance'.

    Ladies and gentlemen, this is closer towards being an better "Open AI". Unlike the other alleged $157BN "non-profit" scam.

    I think you know which one really is beneficial to humanity and is the real "Open AI".

  • Post Author
    sgt
    Posted February 21, 2025 at 6:00 am

    Speaking of DeepSeek, anyone here used SambaNova – are they reliable?

  • Post Author
    swyx
    Posted February 21, 2025 at 6:04 am

    odds on r1.5/r2 release?

  • Post Author
    voxelizer
    Posted February 21, 2025 at 6:11 am

    I wonder if they are just shorting Nvidia…

  • Post Author
    mythz
    Posted February 21, 2025 at 6:12 am

    Looking forward to it! I'll generally make an effort to use Open Models over proprietary alternatives when the use-case permits as Open Models getting better and more popular encourages more models to become open as well – a requisite for a future to be able to build self-hosted solutions that's not beholden to the control of mega corps and AI monopolies.

  • Post Author
    thundergolfer
    Posted February 21, 2025 at 6:14 am

    “Pure garage-energy” is a great phrase.

    Most interested to see their inference stack, hope that’s one of the 5. I think most people are running R1 on a single H200 node but Deepseek had much lower RAM per GPU for their inference and so had some cluster based MoE deployment.

  • Post Author
    suraci
    Posted February 21, 2025 at 6:18 am

    I always consider open-sourcing to be a great social experiment. It may fail one day, but its effects will remain and benefit everyone.

  • Post Author
    oefrha
    Posted February 21, 2025 at 6:21 am

    > Starting next week, we'll open-source 5 repos – one daily drop

    Probably counts as announcement of announcement? Let’s wait for the actual repo drops before discussing them, especially because there are no details about what will be open sourced other than

    > These are humble building blocks of our online service: documented, deployed and battle-tested in production.

  • Post Author
    deyiao
    Posted February 21, 2025 at 6:26 am

    I really admire their mindset of striving for the betterment of humanity.
    There was a time when OpenAI, Anthropic, and even Musk used to talk with that same lofty vision. But now, they've all shifted to competing for national interests instead, which is honestly quite disappointing.

  • Post Author
    csomar
    Posted February 21, 2025 at 6:34 am

    > These are humble building blocks of our online service: documented, deployed and battle-tested in production. No vaporware, just code that moved our tiny moonshot forward.

    My not-so-innocent guess is that they are looking to crowd-source their online platform (the front-end essentially) in order to reduce costs. Still acceptable though as they made the model open weight and partially re-producible.

  • Post Author
    vinhnx
    Posted February 21, 2025 at 6:35 am

    Deep respect for DeepSeek and what they've done regarding all the innovations and researches they have been putting out in-the-open.

    "Because every line shared becomes collective momentum that accelerates the journey. Daily unlocks begin soon. No ivory towers – just pure garage-energy and community-driven innovation" is a great phase.

  • Post Author
    yobid20
    Posted February 21, 2025 at 6:42 am

    Deepseek showing US ai engineers are overpaid and many worthless lol keep it comin!!

  • Post Author
    macns
    Posted February 21, 2025 at 6:45 am

    > Why? Because every line shared becomes collective momentum that accelerates the journey.

    Truly admireable on their part and a great paradigm for others. Reasons for this doesn't really matter to me but I can't help but wonder if somehow they were obliged or otherwise indebted to follow this route.

  • Post Author
    dhdjruf
    Posted February 21, 2025 at 6:45 am

    Long live llms I hope they infest every part of the internet with low level comments. Both the clear , deep, and dark.

    Imagine no more human interactions just a permanent flood of meaningless thoughtless word salad.

    I think the Chinese are perfect to introduce such a product very inline with what they usually produce.

    Get ready for web3.o

  • Post Author
    Mr_Bees69
    Posted February 21, 2025 at 6:57 am

    R1 is a better o1, this is a better devdays.

  • Post Author
    t24uo2i34j324l
    Posted February 21, 2025 at 7:01 am

    Deepseek seems to be having huge PR wins as the "oh shucks" modest boy genius, while the Americans seem like pouty jerks.

    Amodei's / Hassabis' comments in particular came off as so arrogant and annoying.

  • Post Author
    sidcool
    Posted February 21, 2025 at 7:14 am

    This may be my cynical take, but this cannot be out of good will or noble intentions. There has to be an ulterior motive.

  • Post Author
    bigcat12345678
    Posted February 21, 2025 at 7:21 am

    No turning back…

  • Post Author
    abdellah123
    Posted February 21, 2025 at 7:23 am

    DeepSeek seems like Hisoka helping Gon and Killua … just for a more challenging battle at some point xD

  • Post Author
    ein0p
    Posted February 21, 2025 at 7:28 am

    Beatings will continue until openness improves, apparently. Kudos to Deepseek, about time someone spilled some significant beans.

  • Post Author
    andy_ppp
    Posted February 21, 2025 at 7:31 am

    How do the valuations of foundation model companies compete with them being firmly open sourced by Facebook and DeepSeek? It seems likely that building these models will not produce hundreds of billions in value given China and Facebook are giving them away largely for free.

  • Post Author
    wizardsbot
    Posted February 21, 2025 at 7:43 am

    DeepSeek’s claim of focusing on humanity sounds noble, but their history as a spin-off from a hedge fund raises questions. If they’re truly altruistic, why not be more transparent about their funding and business goals?

  • Post Author
    RandyOrion
    Posted February 21, 2025 at 8:02 am

    Well, although R1-671b is way too expensive for me to host, given their past open source (or weight) contributions, I DO have high expectation of them.

    Each and every contribution to open source community will be helpful. Thanks DeepSeek!

  • Post Author
    maxglute
    Posted February 21, 2025 at 8:23 am

    > geek out in the open together.

    Some people seem butthurt that others are attributing altruism/nobility to smart rich kids that just want to have fun.

    AFAIK Liang focus/personal interest is AGI – Deepseek appears simply to be side project of young rich quant who'd rather spend his millions doing AGI than buying a yacht – like there are multiple yachts and car collections worth more than aggregate $$$ that has been dumped into Deepseek (even if you believe the stupid semianalysis claim that they have 1B+ of chips – which they don't / didn't). Maybe just so happens actions that align with said interest can be attributed as altruistic by some. But no need to project other motivations, or other ulterior motives.

Leave a comment

In the Shadows of Innovation”

© 2025 HackTech.info. All Rights Reserved.

Sign Up to Our Newsletter

Be the first to know the latest updates

Whoops, you're not connected to Mailchimp. You need to enter a valid Mailchimp API key.