
How DiffusionGemma accelerates text generation with parallel refinement
Google’s new DiffusionGemma model generates 256 tokens at once, self-correcting as it refines output in parallel instead of token-by-token. Early benchmarks show up to 6x speed gains, but quality trade-offs remain.