Isn't that exactly how draft models speed up inference, though? Validating a bat...

		Balinares 34 days ago \| parent \| context \| favorite \| on: Introspective Diffusion Language Models Isn't that exactly how draft models speed up inference, though? Validating a batch of tokens is significantly faster than generating them.