The AI imagery competition is getting personal. Google this week unveiled a new challenger to OpenAI’s vaunted DALLE-2 text-to-image generator — and took shots at its rival’s efforts. Both models convert text prompts into pictures. But Google’s researchers claim their system provides “unprecedented photorealism and deep language understanding.” Human raters preferred Imagen over DALLE-2 for both sample quality and image-text alignment. Credit: Saharia et al.The cringingly-named Imagen system uses a large pre-trained language model as a text encoder. A cascade of diffusion models then turn the user’s words into pictures. In tests, the Google team said Imagen “significantly outperformed” DALL-E 2. Imagen particularly…
This story continues at The Next Web
Or just read more coverage about: Google