Reshare: Introducing Mercury, the first commercial-scale diffusion large language model

lqdev☀02/26/2025

https://www.inceptionlabs.ai/news

We are announcing the Mercury family of diffusion large language models (dLLMs), a new generation of LLMs that push the frontier of fast, high-quality text generation.

Mercury is up to 10x faster than frontier speed-optimized LLMs. Our models run at over 1000 tokens/sec on NVIDIA H100s, a speed previously possible only using custom chips.

Permalink: /feed/mercury-first-commercial-scale-llm/

Tags: #diffusion #llm #ai

Back to feed

Send me a message or webmention