Skip to content Skip to footer

AI Meets WinDBG by thunderbong

8 Comments

  • Post Author
    Zebfross
    Posted May 5, 2025 at 5:30 am

    Considering AI is trained on the average human experience, I have a hard time believing it would be able to make any significant difference in this area. The best experience I’ve had debugging at this level was using Microsoft’s time travel debugger which allows stepping forward and back.

  • Post Author
    cadamsdotcom
    Posted May 5, 2025 at 5:31 am

    Author built an MCP server for windbg: https://github.com/svnscha/mcp-windbg

    Knows plenty of arcane commands in addition to the common ones, which is really cool & lets it do amazing things for you, the user.

    To the author: most of your audience knows what MCP is, may I suggest adding a tl;dr to help people quickly understand what you've done?

  • Post Author
    indigodaddy
    Posted May 5, 2025 at 5:43 am

    My word, that's one of the most beautiful sites I've ever encountered on mobile.

  • Post Author
    revskill
    Posted May 5, 2025 at 5:45 am

    [flagged]

  • Post Author
    lowleveldesign
    Posted May 5, 2025 at 6:04 am

    I do a lot of Windows troubleshooting and still thinking about incorporating AI in my work. The posted project looks interesting and it's impressive how fast it was created. Since it's using MCP it should be possible to bind it with local models. I wonder how performant and effective it would be. When working in the debugger, you should be careful with what you send to the external servers (for example, Copilot). Process memory may contain unencrypted passwords, usernames, domain configuration, IP addresses, etc. Also, I don’t think that vibe-debugging will work without knowing what eax registry is or how to navigate stack/heap. It will solve some obvious problems, such as most exceptions, but for anything more demanding (bugs in application logic, race conditions, etc.), you will still need to get your hands dirty.

    I am actually more interested in improving the debugger interface. For example, AI assistant could help me create breakpoint commands that nicely print function parameters when you only partly know the function signature and do not have symbols. I used Claude/Gemini for such tasks and they were pretty good at it.

    As a side note, I recall Kevin Gosse also implemented a WinDbg extension [1][2] which used OpenAI API to interpret the debugger command output.

    [1] https://x.com/KooKiz/status/1641565024765214720

    [2] https://github.com/kevingosse/windbg-extensions

  • Post Author
    danielovichdk
    Posted May 5, 2025 at 6:25 am

    Claiming to use WinDBG for debugging a crash dump and the only commands I can find in the MCP code are these ? I am not trying to be a dick here, but how does this really work under the covers ? Is the MCP learning windbg ? Is there a model that knows windbg ? I am asking becuase I have no idea.

            results["info"] = session.send_command(".lastevent")
            results["exception"] = session.send_command("!analyze -v")
            results["modules"] = session.send_command("lm")
            results["threads"] = session.send_command("~")
    

    You cannot debug a crash dump only with these 4 commands, all the time.

  • Post Author
    anougaret
    Posted May 5, 2025 at 7:13 am

    this is pretty cool but ultimately it won't be enough to debug real bugs that are nested deep within business logic or happening because of long chains of events across multiple services/layers of the stack

    imo what AI needs to debug is either:

    – train with RL to use breakpoints + debugger or to do print debugging, but that'll suck because chains of action are super freaking long and also we know how it goes with AI memory currently, it's not great

    – a sort of omniscient debugger always on that can inform the AI of all that the program/services did (sentry-like observability but on steroids). And then the AI would just search within that and find the root cause

    none of the two approaches are going to be easy to make happen but imo if we all spend 10+ hours every week debugging that's worth the shot

    that's why currently I'm working on approach 2. I made a time travel debugger/observability engine for JS/Python and I'm currently working on plugging it into AI context the most efficiently possible so it debugs even super long sequences of actions in dev & prod hopefully one day

    it's super WIP and not self-hostable yet but if you want to check it out: https://ariana.dev/

  • Post Author
    Tepix
    Posted May 5, 2025 at 7:16 am

    Sounds really neat!

    How does it compare to using the Ghidra MCP server?

Leave a comment

In the Shadows of Innovation”

© 2025 HackTech.info. All Rights Reserved.

Sign Up to Our Newsletter

Be the first to know the latest updates

Whoops, you're not connected to Mailchimp. You need to enter a valid Mailchimp API key.