Product Launchdiffusion llmreasoninginception labsopenai compatible

Inception Labs Launches Mercury 2 Diffusion LLM

thenewstack.io

|March 2, 2026

9.1

Relevance Score

Inception Labs Launches Mercury 2 Diffusion LLM

Last week Inception Labs launched Mercury 2, a diffusion-based large language model that generates over 1,000 tokens per second and delivers five to ten times lower end-to-end latency than speed-optimized autoregressive models, CEO Stefano Ermon told The New Stack. Mercury 2 is available via an OpenAI-compatible API, with AWS Bedrock integration coming soon, targeting faster, cheaper inference for reasoning workloads.

Inception Labs Launches Mercury 2 Diffusion LLM

More AI & Data Science News

US Military Uses Anthropic Claude For Operations

Israel Kills Iran's Supreme Leader Ayatollah Khamenei

Union Minister Promotes AI-Led Transformation For Punjab

India And Canada Sign Strategic Energy Partnership

Scoring Rationale

Sources