Text to Image AI Model & Provider Leaderboard

Analysis and comparison of Text to Image generation models & API providers. Artificial Analysis has analyzed text to image models and hosting providers across quality, generation time, and price. For further details, see our methodology page.

Text to image models & providers compared: Playground v2.5, Stable Diffusion 3 Medium, Stable Diffusion XL 1.0, SDXL Lightning, Amazon Titan G1 (Standard), DALLE 2, DALLE 3 HD, DALLE 3, Midjourney v6, Midjourney v6.1, Amazon Titan G1 v2 (Standard), Playground v3 (beta), Ideogram v2, FLUX.1 [schnell], Stable Diffusion 1.5, Ideogram v2 Turbo, Ideogram v1, FLUX1.1 [pro], FLUX.1 [pro], and FLUX.1 [dev].

Highlights

Quality ELO
Quality ELO: ELO score of model in Image Arena, Higher is better
Generation Time
Generation time: Seconds to generate 1 image, Lower is better
Price
Price: USD per 1000 image generations, Lower is better

Summary Analysis

Quality vs. Price

Quality ELO: Relative ELO score of the models as determined by >100,000 responses from users in Artificial Analysis' Image Arena. Some models may not be shown due to not yet having enough votes.
Price: Price per 1k images generated by the model. For detail on how we calculate price per image for providers price based on inference time or steps, see our methodology page.
Note on Midjourney: Midjourney does not have an API. We benchmark Midjourney as an API using ImagineAPI which serves as a tool to access the Midjourney Discord.

Quality vs. Generation Time

Quality ELO: Relative ELO score of the models as determined by >100,000 responses from users in Artificial Analysis' Image Arena. Some models may not be shown due to not yet having enough votes.
Generation Time: Median time the provider takes to generate an image over the past 14 days of measurements. This includes downloading the image from the provider where a URL is provided rather than an image response.
Note on Midjourney: Midjourney does not have an API. We benchmark Midjourney as an API using ImagineAPI which serves as a tool to access the Midjourney Discord.

Generation Time vs. Price

Generation Time: Median time the provider takes to generate an image over the past 14 days of measurements. This includes downloading the image from the provider where a URL is provided rather than an image response.
Price: Price per 1k images generated by the model. For detail on how we calculate price per image for providers price based on inference time or steps, see our methodology page.
Note on Midjourney: Midjourney does not have an API. We benchmark Midjourney as an API using ImagineAPI which serves as a tool to access the Midjourney Discord.

Quality ELO (Image Arena)

Quality ELO: ELO score of model in Image Arena, Higher is better
Quality ELO: Relative ELO score of the models as determined by >100,000 responses from users in Artificial Analysis' Image Arena. Some models may not be shown due to not yet having enough votes.

Arena Win Rate

Arena Win Rate: % Win rate in Image Arena, Higher is better
Win Rate: Proportion of time an image generated by the model was selected as preferred compared to the other image present in Artificial Analysis' Image Arena.

 Participate in the Image Arena to contribute to the crowdsourced quality evaluations

Generation Time

Generation Time

Generation time: Seconds to generate 1 image, Lower is better
Generation Time: Median time the provider takes to generate an image over the past 14 days of measurements. This includes downloading the image from the provider where a URL is provided rather than an image response.
Note on Midjourney: Midjourney does not have an API. We benchmark Midjourney as an API using ImagineAPI which serves as a tool to access the Midjourney Discord.

Generation Time, Variance

Generation time: Seconds to generate 1 image, Results by percentile, Lower is better
Median, Other points represent 5th, 25th, 75th, 95th Percentiles respectively
Generation Time: Median time the provider takes to generate an image over the past 14 days of measurements. This includes downloading the image from the provider where a URL is provided rather than an image response.
Boxplot: Shows variance of measurements
Picture of the author

Generation Time, Over Time

Generation time: Seconds to generate 1 image, Lower is better
Generation Time: Median time the provider takes to generate an image over the past 14 days of measurements. This includes downloading the image from the provider where a URL is provided rather than an image response.
Over time measurement: Median measurement per day, based on 4 measurements each day at different times. Labels represent start of week's measurements.

Price

Price: USD per 1000 image generations, Lower is better
Price: Price per 1k images generated by the model. For detail on how we calculate price per image for providers price based on inference time or steps, see our methodology page.
Summary of key metrics & further information
ProviderFurther
Details
FLUX1.1 [pro] logoBlack Forest Labs
FLUX1.1 [pro] logofal.ai
FLUX1.1 [pro] logoTogether.ai
FLUX.1 [pro] logoTogether.ai
FLUX.1 [pro] logofal.ai
Ideogram v2 logoIdeogram
Midjourney v6.1 logoMidjourney
FLUX.1 [dev] logofal.ai
Ideogram v2 Turbo logoIdeogram
Midjourney v6 logoMidjourney
Ideogram v1 logoIdeogram
FLUX.1 [schnell] logoOctoAI
FLUX.1 [schnell] logoTogether.ai
FLUX.1 [schnell] logofal.ai
Playground v3 (beta) logoPlayground AI
Playground v2.5 logofal.ai
Playground v2.5 logoFireworks
Playground v2.5 logoPlayground AI
DALLE 3 HD logoOpenAI
DALLE 3 logoOpenAI
Stable Diffusion 3 Medium logofal.ai
Stable Diffusion 3 Medium logoOctoAI
Amazon Titan G1 v2 (Standard) logoAmazon Bedrock
Amazon Titan G1 (Standard) logoAmazon Bedrock
SDXL Lightning logofal.ai
SDXL Lightning logoOctoAI
Stable Diffusion XL 1.0 logofal.ai
Stable Diffusion XL 1.0 logoAmazon Bedrock
Stable Diffusion XL 1.0 logoLepton AI
Stable Diffusion XL 1.0 logoFireworks
Stable Diffusion XL 1.0 logoTogether.ai
Stable Diffusion XL 1.0 logoOctoAI
DALLE 2 logoOpenAI
Stable Diffusion 1.5 logoOctoAI