Muse- a Text-To-Image Generation AI tool
Muse- a Text-To-Image Generation AI tool - Flow Card Image

Muse is a text-to-image transformer model developed by Google Research that outperforms diffusion and autoregressive models in terms of image generation while using significantly less resources. The model is trained on a masked modeling task in a discrete token space, where it predicts randomly masked image tokens using text embeddings from a pre-trained large language model.

Muse's efficiency is due to its use of discrete tokens, which makes it more efficient than pixel-space diffusion models such as Imagen and DALL-E 2, as well as its parallel decoding, which makes it more efficient than autoregressive models like Parti.

If you want to learn more about Muse, you can read about it in this blog.

Categories : Computer Science . Machine Learning

Press Ask Flow below to get a link to the resource

     

Related