AI Image Models — Full Overview

21 companies · 29 models in our quiz · 92+ versions documented

Same Prompt. 29 Models.

Choose a prompt and see how every model responds to the exact same input. Models from the same company appear side by side.

Prompt

A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Imagen 4 — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Imagen 3 Pro (Nano Banana Pro) — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

FLUX.2 [max] — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Seedream 4.5 — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Stable Diffusion 3.5 Large Turbo — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Stable Diffusion 1.5 — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Kling 3.0 Omni — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Kolors — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Qwen Image — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Z-Image — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Hunyuan Image 3 — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Microsoft Designer — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Ideogram V3 — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

HiDream I1 — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Juggernaut Flux Pro — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

OpenArt Photorealistic — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Flux Realism — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Deliberate — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Reve — A cat curled up asleep on an open book, next to it a steaming cup of tea, cozy living room atmosphere.

Image Duel · Free to play

🎯

Can you spot the AI?

You've seen how these models compare — now put yourself to the test. Pick any model, get 10 rounds of real vs. AI, and find out if you can beat it.

Play now 29 models · 10 rounds

⭐ Big Players

ChatGPT Images 2.0

OpenAI

OpenAI's most capable image model (April 2026). Available in Instant and Thinking variants — the Thinking mode reasons through layout and context before generating. Best-in-class text rendering, multilingual support, and capable of full magazine layouts and comic book pages. Replaces DALL-E 3 and GPT Image 1.x in the API.

Thinking modeBest-in-class text in imagesMagazine & layout generation

FLUX.2 [max]

Black Forest Labs

The premium tier of Black Forest Labs' FLUX.2 series. Maximum quality variant with improved photorealism, superior facial detail, accurate anatomy, and enhanced text rendering — pushing the FLUX architecture to its current ceiling.

Maximum photorealismSuperior facial detailEnhanced text in images

Seedream 4.5

ByteDance / BytePlus

ByteDance's image generation model, marketed under the BytePlus enterprise brand. Builds on ByteDance's deep video and media processing expertise to produce cinematic, high-quality imagery with excellent composition and color accuracy.

Cinematic color gradingEnterprise APIStrong composition

Imagen 3 Pro (Nano Banana Pro)

Google

Google Imagen 3 Pro (internal codename: Nano Banana Pro) is Google's 2025 image generation and editing model built on the Gemini 3 architecture. Designed for professional design workflows, it combines high-fidelity image synthesis with precise instruction-following for complex editing tasks.

Professional design workflowsImage generation & editingGemini 3 architecture

Model Cards

GPT Image 1.5

OpenAI

OpenAI's second-generation GPT Image model. Improved consistency, photorealism and editing over GPT Image 1. Deprecated in May 2026 in favour of ChatGPT Images 2.0.

DALL-E 3

OpenAI

OpenAI's widely adopted image model, integrated into ChatGPT and Microsoft products. Excels at following complex text instructions and generating photorealistic scenes. Its images often have a slightly 'polished' look — almost too perfect.

Imagen 2

Google

Google DeepMind's second-generation text-to-image model, widely integrated across Google products including Google Slides, Workspace, and Search Generative Experience. Substantially improved photorealism and text rendering over Imagen 1, with a focus on diverse, high-quality imagery for productivity applications.

Imagen 4

Google

Google DeepMind's fourth-generation text-to-image model. Substantially improved photorealism, fine-grained detail, and text accuracy over previous versions. Particularly strong in landscape photography, architecture, and product visualization.

Stable Diffusion 3.5 Large Turbo

Stability AI

A fast, distilled variant of Stability AI's SD 3.5 architecture. Generates high-quality images in fewer sampling steps without sacrificing significant quality. Improved typography, compositional understanding, and human anatomy over SD 2.x.

Stable Diffusion 1.5

Stability AI

The open-source model that democratized AI image generation. Because it's fully open, hundreds of fine-tuned versions exist. Quality varies widely — some versions rival top proprietary models, others show clear artifacts especially in hands and faces.

FLUX.1.1 [pro]

Black Forest Labs

Black Forest Labs' enhanced commercial API model, succeeding FLUX.1 Pro with 6× faster generation and improved image quality. Stronger prompt adherence, better detail rendering, and improved color accuracy — accessible via the BFL API and partnered platforms.

FLUX.1 [dev]

Black Forest Labs

Black Forest Labs' open-weights guidance-distilled model for non-commercial use. Derived from FLUX.1 Pro via guidance distillation, it produces high-quality outputs comparable to the Pro variant but optimized for local deployment and research.

FLUX.1 [schnell]

Black Forest Labs

Black Forest Labs' open-source (Apache 2.0) fast generation model, designed for local development and personal use. Produces results in 1–4 steps at the cost of some fine detail and photorealism compared to Dev or Pro variants.

Kling 3.0 Omni

Kuaishou

Kuaishou's multimodal image-and-video model with a unified architecture. Exceptional at consistent character generation, style-faithful rendering, and maintaining visual identity across multiple outputs.

Kolors

Kuaishou

Open-source image generation model from Kuaishou's AI lab. Based on an enhanced SDXL architecture, recognized for vibrant color reproduction and strong aesthetic composition — particularly excelling in portrait photography with diverse skin tone rendering.

Qwen Image

Alibaba

Alibaba's image generation model from the Qwen AI family. Produces high-quality imagery with particular strengths in East Asian aesthetics, product photography, and diverse cultural representations. Part of Alibaba's broader multimodal AI platform.

Hunyuan Image 3

Tencent

Tencent's third-generation image model with significantly improved photorealism and detail rendering. Strong in cinematic compositions, portrait photography, and lifestyle imagery. Part of Tencent's Hunyuan multimodal AI platform.

Microsoft Designer

Microsoft

Microsoft Designer uses OpenAI's GPT-Image and DALL-E models with Microsoft's own safety filters and style guidance on top. Available across Windows, Edge, and Microsoft 365 — bringing AI image generation to a massive mainstream audience.

Ideogram V3

Ideogram

Ideogram's third-generation model from the New York startup founded by former Google Brain researchers. Recognized as one of the strongest models for rendering legible, stylistically integrated text within images — the go-to for graphic design, posters, and typographic artwork.

HiDream I1

HiDream.ai

HiDream.ai's flagship image generation model, recognized for exceptional detail in fashion, beauty, and lifestyle photography. Strong color accuracy, style consistency, and skin tone rendering make it popular among professional creative applications.

Juggernaut Flux Pro

RunDiffusion

RunDiffusion's photorealistic fine-tune of the FLUX architecture. A community favorite for portrait photography and commercial imagery, known for natural skin textures, accurate studio-quality lighting, and high-detail rendering.

OpenArt Photorealistic

OpenArt

OpenArt's flagship model trained specifically for maximum photorealism in portrait and product photography. Optimized for studio-quality lighting simulations, natural skin rendering, and commercial-grade output.

Flux Realism

XLabs AI

XLabs AI's FLUX fine-tune optimized specifically for photorealistic output. One of the most downloaded FLUX fine-tunes in the open-source community, with improved natural detail rendering, accurate color grading, and enhanced realism across portrait and landscape subjects.

Lucid Realism

Civitai Community

A popular Stable Diffusion fine-tune from the Civitai community focused on hyperrealistic portrait photography. Combines multiple checkpoint merges to achieve natural skin tones, accurate facial anatomy, and cinematic lighting without the characteristic 'plastic' look of base SD models.

FLUX 2 LoRA Realism

Community / LoRA Gallery

A community realism LoRA (Low-Rank Adaptation) applied on top of the FLUX.2 architecture, sourced from the LoRA Gallery collection. Fine-tunes the base model toward hyperrealistic photography — enhancing skin detail, natural lighting, and scene authenticity without full retraining.

Deliberate

XpucT

A community fine-tune of Stable Diffusion 1.5, created by XpucT on Civitai. One of the most downloaded checkpoints in the open-source SD ecosystem. Deliberate (v1–v3) is optimized for photorealism, cinematic compositions, and anatomical accuracy — achieving high-quality output with minimal prompt engineering.

Reve

Reve AI

Reve AI's image generation model, known for strong prompt adherence and richly detailed output. Produces high-quality images with a distinctive visual style that blends photorealism with artistic polish — versatile across photography, illustration, and conceptual art.

Z-Image

Alibaba

Z-Image (造相) is an open-source image generation model family by Alibaba's Tongyi Lab. It uses a novel S3-DiT (Scalable Single-Stream Diffusion Transformer) architecture that processes text and image in a unified single stream — enabling efficient, high-quality generation with strong compositional control and instruction following.

AWS

Titan Image Generator v1

Amazon

Amazon's first-generation text-to-image model, part of the Amazon Bedrock foundation model platform. Designed for enterprise use cases including product visualization, marketing assets, and content generation at scale. Features built-in safety controls and watermarking capabilities.

All Companies & Version History

21 companies · 92+ model versions · release dates and benchmark links

OpenAI

USA openai.com ↗

DALL-E

Pioneering text-to-image model integrated into ChatGPT and Microsoft

Jan 2021

DALL-E 1 Jan 2021

First version — introduced the concept of text-to-image generation to the public. 12B parameter transformer model.

Apr 2022

DALL-E 2 Apr 2022

Major quality leap. Introduced inpainting and outpainting. 4× higher resolution than DALL-E 1. Used CLIP image embeddings.

FID score study (arXiv)

Oct 2023

DALL-E 3 Oct 2023

Dramatically improved prompt adherence. Deeply integrated into ChatGPT. Strong text rendering in images.

T2I-CompBench evaluation

GPT Image

Next-gen image generation built into the GPT-4o architecture

Mar 2025

GPT Image 1 Mar 2025

First native image generation model integrated into GPT-4o. Best-in-class text rendering, strong instruction following.

May 2025

GPT Image 1.5 May 2025

Improved consistency, photorealism, and editing capabilities over GPT Image 1.

Apr 2026

ChatGPT Images 2.0 (GPT Image 2) Apr 2026

Major generational leap. Introduced Instant and Thinking variants — Thinking mode researches context before generating. Best-in-class text rendering, multilingual support, full magazine/comic layout generation. Replaces DALL-E 3 and GPT Image 1.x in the API (deprecated May 12, 2026).

OpenAI announcement

Google DeepMind

USA deepmind.google ↗

Imagen

Google's flagship diffusion model — consistently top-ranked photorealism

May 2022

Imagen 1 May 2022

Introduced by Google Brain. Outperformed DALL-E 2 on COCO FID at launch.

Imagen paper (arXiv)

Dec 2023

Imagen 2 Dec 2023

Substantially improved photorealism, text rendering, and multilingual support. Launched in Google Bard and Vertex AI.

Aug 2024

Imagen 3 Aug 2024

Highest quality Imagen to date at launch. Improved detail, lighting, and artifact reduction.

Imagen 3 paper (arXiv)

May 2025

Imagen 4 May 2025

Next-gen text-to-image with best-in-class landscape, architecture, and product photography. Integrated into Gemini.

Gemini Image (Nano Banana)

Image generation built natively into the Gemini 3 architecture

Mar 2025

Gemini 2.5 Flash Image (Nano Banana) Mar 2025

Fast image generation + editing model based on Gemini 2.5 Flash. Integrated into Gemini apps.

Jun 2025

Gemini 3 Pro Image (Nano Banana Pro) Jun 2025

Professional-grade image generation and editing. Precise instruction following, near-seamless inpainting.

Stability AI

UK / USA stability.ai ↗

Stable Diffusion

The open-source model that democratized AI image generation

Aug 2022

Stable Diffusion 1.4 Aug 2022 Open Source

First public release. Latent diffusion model — runs on consumer GPUs. Sparked the open-source AI art movement.

LDM paper (arXiv)

Oct 2022

Stable Diffusion 1.5 Oct 2022 Open Source

Improved training. Still the most widely used SD checkpoint — thousands of community fine-tunes built on it.

Nov 2022

Stable Diffusion 2.0 Nov 2022 Open Source

Higher resolution (768px default), new text encoder (OpenCLIP), new depth model.

Dec 2022

Stable Diffusion 2.1 Dec 2022 Open Source

Improved NSFW filtering approach. Less over-filtering than 2.0, better prompt adherence.

Jul 2023

Stable Diffusion XL (SDXL) Jul 2023 Open Source

Major architecture jump — 3.5B parameters. Native 1024px output. Two-stage pipeline with refiner model.

SDXL paper (arXiv)

Jun 2024

Stable Diffusion 3 Jun 2024 Open Weights

Multimodal Diffusion Transformer (MMDiT) architecture. Major improvement in text rendering and composition.

SD3 paper (arXiv)

Jun 2024

Stable Diffusion 3 Medium Jun 2024 Open Weights

2B parameter variant of SD 3. Lighter weight, runs on consumer hardware. Same MMDiT architecture as SD 3 but optimized for accessibility.

Oct 2024

Stable Diffusion 3.5 Medium Oct 2024 Open Weights

2.5B parameter model. Balanced between quality and speed — ideal for local use on consumer GPUs with less than 8 GB VRAM.

Oct 2024

Stable Diffusion 3.5 Large Oct 2024 Open Weights

8B parameter model. Best quality in the SD 3.x family.

Oct 2024

Stable Diffusion 3.5 Large Turbo Oct 2024 Open Weights

Distilled 4-step variant. Near SD 3.5 Large quality at a fraction of inference time.

SD Community Fine-tunes

The most widely used community checkpoints built on Stable Diffusion

Jan 2023

Deliberate (XpucT) Jan 2023 Open Source

One of the most downloaded SD 1.5 fine-tunes on Civitai. Optimized for photorealism, cinematic compositions, and anatomical accuracy — achieves high quality with minimal prompt engineering.

Deliberate on CivitAI

Jun 2023

Lucid Realism Jun 2023 Open Source

Community checkpoint merge focused on hyperrealistic portrait photography. Combines multiple SD models to achieve natural skin tones and accurate facial anatomy without the plastic look of base SD.

Lucid Realism on CivitAI

Midjourney

USA midjourney.com ↗

Midjourney

Artistic aesthetic, Discord-native — the model that went viral

Feb 2022

v1 Feb 2022

Initial alpha release via Discord.

Apr 2022

v2 Apr 2022

Improved coherence and style.

Jul 2022

v3 Jul 2022

Better quality, more artistic control.

Nov 2022

v4 Nov 2022

New proprietary model architecture. Major quality jump. More coherent scenes.

Mar 2023

v5 Mar 2023

Photorealistic quality leap. Detailed hands and faces. Introduced --ar and --style flags.

May 2023

v5.1 May 2023

Improved default aesthetics, more accurate with simple prompts.

Jun 2023

v5.2 Jun 2023

Sharpened images, improved aesthetics, introduced zoom-out feature.

Dec 2023

v6 Dec 2023

Much more literal prompt following, improved text in images, better photorealism.

Jul 2024

v6.1 Jul 2024

Refinement of v6 — improved image quality, better human anatomy, sharper details.

Apr 2025

v7 Apr 2025

Major architecture change. New personalization system, significantly faster generation, improved realism.

BFL

Black Forest Labs

Germany blackforestlabs.ai ↗

FLUX

State-of-the-art open model from former Stability AI researchers

Aug 2024

FLUX.1 [schnell] Aug 2024 Open Source

Apache 2.0 open-source, 4-step distilled model. Fast generation, consumer-ready.

Aug 2024

FLUX.1 [dev] Aug 2024 Open Weights

Guidance-distilled, 12B parameter Flow Matching model. Better quality than schnell. Non-commercial license.

FLUX.1 paper (arXiv)

Aug 2024

FLUX.1 [pro] Aug 2024

API-only commercial model. Highest quality in FLUX.1 family.

Oct 2024

FLUX.1.1 [pro] Oct 2024

Improved version of FLUX.1 pro — 6× faster, better prompt adherence and quality.

Feb 2025

FLUX.2 [dev] Feb 2025 Open Weights

32B parameter model — generation, editing, and multi-reference image combining in one. Open weights for research and non-commercial use. 226k+ downloads on Hugging Face.

FLUX.2 [dev] on Hugging Face

Feb 2025

FLUX.2 [pro] Feb 2025

Commercial API tier. Production-grade generation and editing with 4MP output and multi-reference control.

Mar 2025

FLUX.2 [flex] Mar 2025

Flexible tier between dev and pro — balances quality and cost for scalable production workflows.

Apr 2025

FLUX.2 [max] Apr 2025

Premium tier. Superior facial detail, enhanced text rendering, maximum photorealism.

Jan 2026

FLUX.2 [klein] 4B & 9B Jan 2026 Open Weights

Smallest and fastest FLUX models to date — sub-second inference on capable hardware. Available in 4B and 9B parameter versions. Open weights.

FLUX.2 [klein] on Hugging Face

FLUX Community Fine-tunes

The most important open-source fine-tunes built on the FLUX architecture

Aug 2024

Flux Realism (XLabs AI) Aug 2024 Open Source

One of the first and most downloaded FLUX fine-tunes. Optimized for photorealistic output — improved natural color grading, skin detail, and landscape rendering. Fully open source.

XLabs AI on Hugging Face

Sept 2024

Hyper-FLUX-8Steps (ByteDance) Sept 2024 Open Source

Consistency distillation of FLUX.1 [dev] down to 8 sampling steps with near-identical quality. Makes local FLUX generation significantly faster on consumer hardware.

Hyper-SD on Hugging Face

Sept 2024

PuLID-FLUX (ToTheBeach) Sept 2024 Open Source

Face identity preservation for FLUX. Generate consistent portraits of a specific person without fine-tuning — just provide a reference photo. Popular for character consistency workflows.

PuLID on GitHub

Oct 2024

Juggernaut Flux Pro (RunDiffusion) Oct 2024 Open Source

Community-favorite portrait fine-tune. Natural skin textures, studio-quality lighting, and sharp hair detail. Widely regarded as the best FLUX fine-tune for commercial portrait work.

Juggernaut on CivitAI

Oct 2024

XFlux / FLUX-Controlnet (XLabs AI) Oct 2024 Open Source

ControlNet implementation for FLUX — enables structural guidance via depth maps, canny edges, and pose skeletons. Brings the precise layout control from SD/SDXL to the FLUX architecture.

XLabs-AI/x-flux on GitHub

Nov 2024

FLUX.1-Turbo-Alpha (Alimama/Alibaba) Nov 2024 Open Source

Adversarial distillation of FLUX.1 [dev] to just 8 steps with minimal quality loss. Developed by Alibaba's Alimama team — one of the fastest high-quality FLUX variants.

FLUX-Turbo on Hugging Face

Mar 2025

FLUX 2 LoRA Realism (LoRA Gallery) Mar 2025 Open Source

Realism LoRA for FLUX.2 from the community LoRA Gallery collection. Enhances skin detail, natural lighting, and photographic authenticity on top of the FLUX.2 base model.

Adobe

USA www.adobe.com/products/firefly.html ↗

Adobe Firefly

Commercially safe, trained on licensed content — built into Creative Cloud

May 2023

Firefly 1 May 2023

First commercially licensed AI image generator. Trained on Adobe Stock + Creative Commons. Integrated into Photoshop.

Oct 2023

Firefly 2 Oct 2023

Improved photorealism and detail. Photo settings for lighting and depth of field.

Apr 2024

Firefly 3 Apr 2024

Major quality improvement. Introduced Structure Reference and Style Reference features.

May 2025

Firefly 4 May 2025

Advanced multi-entity generation, improved realism, better Creative Cloud integration.

Meta AI

USA ai.meta.com ↗

Emu

Meta's image generation model — available inside Instagram and WhatsApp

Sept 2023

Emu Sept 2023

Meta's first image generation model. Fine-tuned on curated high-quality data. Integrated into Instagram.

Emu paper (arXiv)

Nov 2023

Emu Edit Nov 2023

Instruction-based image editing model. Precise local and global edits via text commands.

Emu Edit paper (arXiv)

Jan 2024

Emu 2 Jan 2024

Largest generative multimodal model from Meta. 37B parameters, strong few-shot visual generation.

Emu 2 paper (arXiv)

ByteDance / BytePlus

China www.byteplus.com ↗

Seedream

Cinematic, enterprise-grade image generation from the makers of TikTok

Sept 2024

Seedream 3.0 Sept 2024

Strong composition and color accuracy. Designed for professional and enterprise workflows.

Mar 2025

Seedream 4.5 Mar 2025

Improved cinematic quality, better text support, enhanced style fidelity.

Kuaishou

China klingai.com ↗

Kolors

Open-source SDXL fine-tune with vivid colors and strong portrait quality

Jul 2024

Kolors 1.0 Jul 2024 Open Source

Open-source release. Built on enhanced SDXL. Recognized for vibrant colors and diverse skin tone rendering.

Kolors paper (arXiv)

Kling Omni

Unified image + video generation model

Jun 2024

Kling 1.0 (Image) Jun 2024

Image generation component of Kling. Consistent character generation and style fidelity.

Feb 2025

Kling 3.0 Omni Feb 2025

Unified image and video architecture. Exceptional character consistency across outputs.

ALI

Alibaba (Tongyi Lab)

China tongyi.aliyun.com ↗

Wanx (通义万象)

Alibaba's flagship image model with strong cultural diversity

Jan 2024

Wanx 2.0 Jan 2024

High-quality generation with East Asian aesthetic strengths. Available via Alibaba Cloud API.

Sept 2024

Qwen Image (Wanx 2.5) Sept 2024

Integrated into the Qwen multimodal model family. Strong product photography and cultural representations.

Z-Image (造相)

Open-source S3-DiT architecture — unified text and image stream

Jan 2025

Z-Image 1.0 Jan 2025 Open Source

Novel S3-DiT (Scalable Single-Stream Diffusion Transformer). Efficient unified text+image processing.

Tencent

China hunyuan.tencent.com ↗

Hunyuan Image

Cinematic image generation from Tencent's Hunyuan platform

Jan 2024

Hunyuan Image 1 Jan 2024

First public release. Strong in portrait and lifestyle imagery.

Jul 2024

Hunyuan Image 2 Jul 2024

Improved realism and cinematic composition.

Jan 2025

Hunyuan Image 3 Jan 2025

Best-in-class portrait photorealism from Tencent. Warm color grading, studio-quality lighting.

Ideogram

USA (ex-Google Brain) ideogram.ai ↗

Ideogram

Best-in-class text rendering — go-to for graphic design and typography

Aug 2023

Ideogram 1.0 Aug 2023

Launch. Immediately recognized for best text rendering of any image model at the time.

Nov 2024

Ideogram 2.0 Nov 2024

Significantly improved photorealism while maintaining text advantages. New Style and Color Palette features.

ELO benchmark (Ideogram blog)

Mar 2025

Ideogram 3.0 Mar 2025

Further photorealism improvements. Strongest text-in-image model available. New canvas editing features.

Microsoft

USA designer.microsoft.com ↗

Microsoft Designer / Copilot Image

DALL-E and GPT Image powered — built into Windows, Edge, and Microsoft 365

Oct 2023

Designer (DALL-E 3 powered) Oct 2023

Microsoft Designer launched with DALL-E 3 backend. Integrated into Bing Image Creator and Edge.

Apr 2025

Copilot Image (GPT Image 1) Apr 2025

Upgraded to GPT Image 1 backend. Integrated across Microsoft 365, Windows Copilot, and Designer.

HiDream.ai

China hidream.ai ↗

HiDream I-Series

Fashion and beauty specialist — open-source with commercial options

Mar 2025

HiDream I1 Fast Mar 2025 Open Source

Fast variant. 17B parameter model, 4-step distillation.

Mar 2025

HiDream I1 Dev Mar 2025 Open Weights

Development/research variant. Non-commercial license.

Mar 2025

HiDream I1 Full Mar 2025 Open Source

Full quality model. Exceptional skin tone accuracy, fashion and beauty photography.

HiDream paper (arXiv)

xAI

USA x.ai ↗

Aurora / Grok Image

Elon Musk's AI lab — extremely realistic output with minimal safety filtering

Dec 2024

Aurora (Grok-2 Image) Dec 2024

xAI's first public image model, integrated into Grok on X (formerly Twitter). FLUX-based architecture. Notably less restrictive safety filters produce a hyper-realistic look, especially for people and social scenes.

Jun 2025

Grok-3 Image Jun 2025

Next-generation model. Continues the Aurora lineage with improved photorealism and native Grok-3 multimodal integration.

Recraft

UK recraft.ai ↗

Recraft

The first model to master graphic design — text, logos, and vector art

Oct 2023

Recraft v1 Oct 2023

Early release focused on vector-style and design-oriented generation.

Jun 2024

Recraft v2 Jun 2024

Improved style consistency, introduced SVG output support.

Nov 2024

Recraft v3 Nov 2024

#1 on Hugging Face text-to-image leaderboard at launch. Industry-leading text rendering, logo generation, and vector graphic output. The go-to model for professional graphic designers.

Hugging Face T2I Leaderboard

LEO

Leonardo AI (Canva)

Australia leonardo.ai ↗

Leonardo Phoenix / Kino

Cinematic Hollywood-style imagery — one of the world's largest AI art platforms

Feb 2024

Kino XL Feb 2024

Cinematic-style SDXL fine-tune. Strong in dramatic lighting, movie-like scenes, and character consistency.

Sept 2024

Leonardo Phoenix Sept 2024

Leonardo's first proprietary foundation model. Improved text rendering, prompt adherence, and photorealism. Millions of daily generations across the platform.

Jan 2025

Leonardo Phoenix 1.0 Jan 2025

Refined flagship model post-Canva acquisition. Cinematic quality with better anatomy and enhanced style controls.

APL

Apple

USA apple.com/apple-intelligence ↗

Apple Intelligence Image

Deeply integrated into iOS/macOS — the AI model most people encounter daily

Dec 2024

Image Playground (iOS 18.2) Dec 2024

First public Apple image generation feature. Clean, illustrative style — Animation, Illustration, and Sketch modes. Runs fully on-device. Available on iPhone 15 Pro and later.

Dec 2024

Image Wand (iPadOS 18.2) Dec 2024

Sketch-to-image feature in Apple Notes. Turns rough hand-drawn sketches into polished illustrations. On-device, privacy-first.

Dec 2024

Genmoji (iOS 18.2) Dec 2024

AI-generated custom emoji from text descriptions. Integrated into keyboard and Messages.

Sept 2025

Image Playground v2 (iOS 19) Sept 2025

Expanded style options and improved quality expected with iOS 19. Broader device support.

OpenArt

USA openart.ai ↗

OpenArt Photorealistic

Studio-quality portrait and product photography model

Jun 2024

OpenArt Photorealistic Jun 2024

OpenArt's flagship model trained for maximum photorealism in portrait and product photography. Optimized for studio lighting simulation, natural skin rendering, and commercial-grade output.

AWS

Amazon

USA aws.amazon.com/bedrock/titan ↗

Titan Image Generator

Enterprise image generation on AWS Bedrock with built-in safety

Nov 2023

Titan Image Generator v1 Nov 2023

Amazon's first image generation model on Bedrock. Designed for enterprise workflows — product visualization, marketing assets, content generation at scale. Features built-in watermarking.

Aug 2024

Titan Image Generator v2 Aug 2024

Improved quality, new image conditioning features including background removal and outpainting. Better instruction following.