New AI text diffusion models break speed barriers by pulling words from noise by WithinReason

0CommentsShare PostShare on Facebook Share on XShare by EmailSend Link

News

New AI text diffusion models break speed barriers by pulling words from noise by WithinReason

ByHackTech February 28, 2025

0Comments

Share This Article

Sed ut perspiciatis unde.

Send to HN

These diffusion models maintain performance faster than or comparable to similarly sized conventional models. LLaDA’s researchers report their 8 billion parameter model performs similarly to LLaMA3 8B across various benchmarks, with competitive results on tasks like MMLU, ARC, and GSM8K.

However, Mercury claims dramatic speed improvements. Their Mercury Coder Mini scores 88.0 percent on HumanEval and 77.1 percent on MBPP—comparable to GPT-4o Mini—while reportedly operating at 1,109 tokens per second compared to GPT-4o Mini’s 59 tokens per second. This represents roughly a 19x speed advantage over GPT-4o Mini while maintaining similar performance on coding benchmarks.

Mercury’s documentation states its models run “at over 1,000 tokens/sec on Nvidia H100s, a speed previously possible only using custom chips” from specialized hardware providers like Groq,

Tags: diffusion Models

0Likes

Written by

HackTech

View all posts by HackTech

New AI text diffusion models break speed barriers by pulling words from noise by WithinReason

New AI text diffusion models break speed barriers by pulling words from noise by WithinReason

Share This Article

Newsletter

HackTech

Leave a comment Cancel reply

Editor's Choice

New AI text diffusion models break speed barriers by pulling words from noise by WithinReason

New AI text diffusion models break speed barriers by pulling words from noise by WithinReason

Share This Article

Newsletter

HackTech

Leave a comment Cancel reply

Editor's Choice

Sign Up to Our Newsletter