Grok's image generation abilities with a new model, code-named Aurora. Aurora is an autoregressive mixture-of-experts network trained to predict the next token from interleaved text and image data
Luma Photon and Photon Flash. The most creative, intelligent and personalizable image generation models built on a new groundbreaking architecture that delivers ultra high quality and 10x higher cost efficiency.
HiDream-I1 is a new open-source image generative foundation model with 17B parameters that achieves state-of-the-art image generation quality within seconds.
SwD, a scale-wise distillation framework of diffusion models (DMs), which effectively employs next-scale prediction ideas for diffusion-based few-step generators
BAGEL, the open-source Unified Multimodal Model , offering comparable functionality to proprietary systems like GPT-4o and Gemini 2.0. By ByteDance-Seed
An image customization framework designed to support a wide range of tasks while facilitating seamless integration of multiple conditions. By Bytedance
A unified multimodal model that combines the reasoning and instruction following strength of autoregressive models with the generative power of diffusion models
Krea 1 offers accurate skin textures, dynamic camera angles, and expressive color. Discover striking visuals in an exceptionally artistic latent space.
Cosmos-Predict2 is a core WFM model for Physical AI, focused on future state prediction via advanced world modeling. It supports two key tasks: text-to-image and video-to-world generation.
The Juggernaut Flux Series has been designed to be a drop-in replacement for Flux Schnell, Flux Dev, and Flux 1.1 Pro, offering enhanced sharpness, colors, skin tone, reduced blur, and overall improved visual aesthetics.
Bytedance UXO Team. USO, a Unified Style-Subject Optimized customization model; UMO, a Unified Multi-identity Optimization framework; UNO, A Universal Customization Method for Both Single and Multi-Subject Conditioning
Vidu Q2 image generation is built to match the latest generation of top image platforms in quality, while pushing further on full function and consistency.
Nucleus Image introduces the first sparse Mixture-of-Experts architecture to diffusion-based image generation — activating only 2B of 17B total parameters per forward pass.