Skip to content Skip to footer
Phi-4 Reasoning Models by meetpateltech

Phi-4 Reasoning Models by meetpateltech

9 Comments

  • Post Author
    refulgentis
    Posted May 1, 2025 at 1:36 am

    These look quite incredible. I work on a llama.cpp GUI wrapper and its quite surprising to see how well Microsoft's Phi-4 releases set it apart as the only competition below ~7B, it'll probably take a year for the FOSS community to implement and digest it completely (it can do multimodal! TTS! STT! Conversation!)

  • Post Author
    gthompson512
    Posted May 1, 2025 at 1:57 am

    Sorry if this comment is outdated or ill-informed, but it is hard to follow the current news. Do the Phi models still have issues with training on the test set, or have they fixed that?

  • Post Author
    csdvrx
    Posted May 1, 2025 at 2:05 am

    Is anyone here using phi-4 multimodal for image-to-text tasks?

    The phi models often punch above their weight, and I got curious about the vision models after reading https://unsloth.ai/blog/phi4 stories of finetuning

    Since lmarena.ai only has the phi-4 text model, I've tried "phi-4 multimodal instruct" from openrouter.ai.

    However, the results I get are far below what I would have expected.

    Is there any "Microsoft validated" source (like https://chat.qwen.ai/c/guest for qwen) to easily try phi4 vision?

  • Post Author
    adt
    Posted May 1, 2025 at 2:36 am
  • Post Author
    danielhanchen
    Posted May 1, 2025 at 3:28 am

    We uploaded GGUFs for anyone who wants to run them locally.

    [EDIT] – I fixed all chat templates so no need for –jinja as at 10:00PM SF time.

    Phi-4-mini-reasoning GGUF: https://huggingface.co/unsloth/Phi-4-mini-reasoning-GGUF

    Phi-4-reasoning-plus-GGUF: https://huggingface.co/unsloth/Phi-4-reasoning-plus-GGUF

  • Post Author
    justanotheratom
    Posted May 1, 2025 at 3:36 am

    is there a well-established tool-chain for finetuning these models?

  • Post Author
    behnamoh
    Posted May 1, 2025 at 3:45 am

    what's Microsoft's plan here? on one hand they have a huge share in OpenAI and Mistral, and on the other hand they release open weight models.

  • Post Author
    gitroom
    Posted May 1, 2025 at 3:59 am

    Honestly the Phi-4 stuff is starting to get real interesting for me. Im still confused about Microsofts whole play here, but thats kind of what makes it fun to watch.

  • Post Author
    mrbonner
    Posted May 1, 2025 at 4:21 am

    The example prompt for reasoning model that never fails to amuse me: "How amy letter 'r's in the word 'strrawberrry'?"

    Phi-4-mini-reasoning: thought for 2 min 3 sec

    <think>
    Okay, let's see here. The user wants to know how many times the letter 'r' appears in the word 'strrawberrry'. Hmm, first I need to make sure I'm spelling the word correctly. Let me write it out slowly: S-T-R-A-W-B-E-R-R-Y.

    But I'm overcomplicating this. Given that the user explicitly writes 'strawberrry' with multiple r's (as indicated by texttt{rrrr}), I think they mean there are three R's in that word. Therefore, counting them would give an answer of 3.
    </think>

    The word "strrawberrry" contains *5* occurrences of the letter 'r'.

    To determine this:
    1. Break down the spelling: S-T-R-A-W-B-E-R-R-Y.
    2. Identify each 'R':
    – The first R is in the "straw" part (position 3).
    – Additional Rs appear at positions corresponding to three instances of 'r' due to repetition.

    *Answer:* boxed{5}

    I have no idea how it came to that conclusion with those thoughts lol.

    The full transcript: https://pastebin.com/tyF5aSb9

Leave a comment

In the Shadows of Innovation”

© 2025 HackTech.info. All Rights Reserved.

Sign Up to Our Newsletter

Be the first to know the latest updates

Whoops, you're not connected to Mailchimp. You need to enter a valid Mailchimp API key.