Skip to content

App comparisons

9 min read

Midjourney vs. ChatGPT (formerly DALL·E 3): Which image generator is better? [2025]

By Harry Guinness · May 22, 2025
Hero image with the Midjourney and OpenAI logos

ChatGPT and Midjourney are two of the best AI image generators available. Both can take a text prompt and generate a matching image, no matter how weird or wild your request. While ChatGPT now uses the GPT-4o model instead of DALL·E 3, the DALL·E name is still so strong that I suspect lots of people haven't even realized the change. So if this is all news to you, don't worry—things have only changed for the better.

I've been testing both of these image generators, both professionally and personally, since they were released, and there's a lot to unpack. So let's dive in.

Table of contents:

How do ChatGPT and Midjourney work?

While DALL·E 3 and Midjourney were trained in similar ways, GPT-4o, which now powers ChatGPT, was trained a little differently.

Basically, Midjourney was trained on millions or billions of text-image pairs, which allows it to comprehend concepts like dogs, deerstalker hats, and dark moody lighting. GPT-4o is a full multimodal model that was also trained on text, audio, video, and more. It's a much bigger model, so it has a far deeper understanding of the world. Both models can parse what a prompt like "an impressionist oil painting of a Canadian man riding a moose through a forest of maple trees" is asking them to create, but ChatGPT is able to understand things with a lot more nuance.

When it comes to actually generating images, Midjourney uses a process called diffusion. It starts with a random field of noise and then, over a number of steps, edits it to better match its interpretation of your prompt. This is why you can get different results every time, even if you try the same prompt a second time: the randomness of the starting seed can totally change the end result. The process is kind of like looking up at a cloudy sky, finding a cloud that looks kind of like a dog, and then being able to snap your fingers to keep making it more and more dog-like. While there's more to it than that, it's not a bad way to think of things.

Four images of a dog in a cloud looking more and more like a dog each time

DALL·E 3 was a diffusion model, but GPT-4o reportedly also uses a process called visual autoregressive modeling. Instead of starting with a blank field of noise, it’s essentially able to come up with a rough draft and then improve things from there. Combined with its understanding of language, this makes ChatGPT incredibly powerful.

ChatGPT creating four clouds become more and more dog-like

In addition to the differences in models, there's a lot more that affects the final results. How each model interprets your prompt, the weight it puts on the various parameters, what other features you use, and the philosophies of the companies responsible for developing it all massively affect what the output will look like. 

Here's a take on "an impressionist oil painting of a Canadian man riding a moose through a forest of maple trees" from ChatGPT.

An image created by DALL-E using the prompt above

And here's one from Midjourney.

An image created by Midjourney based on the prompt above

I think they both look great, so the good news is both ChatGPT and Midjourney are solid options—though there are still a lot of differences to tease out.

ChatGPT vs. Midjourney at a glance

ChatGPT image generator and Midjourney both do similar things, but there are some big differences. Here's a short summary of the major distinctions, but read on for a more detailed breakdown. 

ChatGPT

Midjourney

Quality

⭐⭐⭐⭐⭐ Exceptional AI-generated images, with the best understanding of your prompts

⭐⭐⭐⭐ Solid AI-generated images that can take a bit of work to get right

Ease of use

⭐⭐⭐⭐⭐ Collaborate with a chatbot

⭐⭐⭐⭐ Surprisingly easy-to-use web app, though there's a learning curve to using all the advanced features

Power and control

⭐⭐⭐⭐⭐ ChatGPT makes editing simple

⭐⭐⭐⭐ Best-in-class customization, if you can figure out all the quirks

ChatGPT is easier to use

GPT-4o is available through ChatGPT. This makes it a doddle to use: you just ask ChatGPT to create whatever image you can imagine.

Using the image generator in ChatGPT

If you want to change anything, you just ask ChatGPT to make an adjustment.

Asking ChatGPT to adjust an image

Because of GPT-4o's deep understanding of the real world and ChatGPT's chat interface, you don't even have to be particularly careful with your prompts. You can just tell ChatGPT what you want and it will get it right most of the time.

ChatGPT editing an image further

The two downsides are that GPT-4o is very slow to generate images, and that it only generates one at a time. So, while the results are great and it's easy to use, iterating on an idea can take a lot of time.

Editing the image again in ChatGPT

Free ChatGPT users have limited access, while ChatGPT Plus subscribers may be subject to some rate limits, though in my testing, I could use it as much as I wanted to. It remains to be seen if it will roll out to the Bing Image Creator, Microsoft Paint, and other services that used the DALL·E 3 API.

Midjourney works much the same way, though it isn't as conversational and still has a few quirks. Originally, the only way you could use it was through Discord, a team chat app, but you can now also log in to a web app with a Google account. Midjourney occasionally offers free trials, but for the most part, you need to sign up for a paid plan to use it. (We'll look at pricing properly a little later on.)

Once you're in, you just enter what you want Midjourney to create in the Imagine bar, and it will generate four options. Simple—but editing requires diving deeper into Midjourney's options—which is where things get complicated.

The Imagine bar in Midjourney

Midjourney gives you more customization options

ChatGPT has three big features that allow you to control its results:

  • You can go back and forth with ChatGPT in the chat interface.

  • You can upload an image and use it as the base for something.

    Uploading an image to create a new image in ChatGPT
  • You can select an area of an image and replace it with AI-generated content.

    Editing a specific part of an image in ChatGPT
    The edited image (the dog's chain collar is replaced with a collar with its name)

All three of these work exceptionally well, but it's still a far more limited and less customizable feature set than Midjourney.

For example, to use the latest version of Midjourney's algorithm, you have to click through 200 pairs of images and select your favorite to fine-tune it to your preferences.

Fine-tuning an image in Midjourney

This means that by default, Midjourney is going to generate images tailored to your preferences. On top of that, here is an abbreviated list of the customization options available in Midjourney. You can:

  • Control how strongly Midjourney applies its default style, how weird it's allowed to be, and how much variety there is among your images.

    Controlling the styling in Midjourney

  • Select from different versions of the Midjourney model.

  • Use images as the basis of a prompt, as a style reference, as a character reference, and as an "omni-reference," which means it tries to include whatever is in the photo.

  • Take any generated image and create iterative variations with or without tweaking the prompt.

    More control options in Midjourney
  • Expand any generated image in any direction, change its aspect ratio, or zoom out. 

  • Create a personalized style by ranking images so Midjourney knows what you like. 

And honestly, this is barely scratching the surface. Dig into Midjourney's help docs, and you'll find loads more ways you can prompt, tweak, combine, and otherwise get creative with your images. The downside of this is that using these features is necessary to get the most out of Midjourney.

GPT-4o is now the better model

While both GPT-4o and Midjourney are capable of creating incredible images, GPT-4o is the better model for a couple of reasons.

For starters, GPT-4o can render accurate text, while Midjourney can't reliably do it.

ChatGPT rendering text
ChatGPT
Midjourney attempting to render text
Midjourney

GPT-4o also has a far better understanding of numbers, can keep track of multiple objects, and can position things in space. And especially for one-off prompts, it's more likely to get you the results you want.

ChatGPT rendering numbers and position correctly
ChatGPT
Midjourney having trouble rendering numbers and positions correctly
Midjourney

The catch is that if you use Midjourney to its fullest and figure out its quirks, you'll be able to reliably create AI images that better fit your vision. With ChatGPT, you get awesome results, but you can't really shape them. With Midjourney, you have to do more work, but you have a lot more control.

Pricing depends on your needs

ChatGPT's pricing is super simple: it costs $20/month as part of ChatGPT Plus for nearly unlimited use (though too many requests in quick succession will probably get you cut off) or you get limited use on the free plan.

Midjourney has no free option, but the Basic Plan starts at $10/month and entitles you to 200 minutes of GPU time. Which, of course, is where things get complicated. Midjourney says that's good for roughly 200 generations a month, but it totally depends on what you're getting it to do. If you create lots of variations and upscale them all to the maximum amount, you'll burn through those GPU hours faster than if you create lots of low-res images. 

Midjourney's pricing page

And to make things more complicated, starting with the $30/month Standard plan, you get more fast GPU hours, but you can generate unlimited images in Relax mode—which only runs when there's free GPU power.

Given the massive differences between the two apps, I'm incredibly reluctant to make any judgments based on price. The free ChatGPT plan is obviously the cheapest way to start generating images, but if you want to create a lot of images, the $10/month Midjourney plan is a good balance of features and price. And then, for $20/month, you can get ChatGPT Plus—which also has all of ChatGPT's other generative AI features.

Commercial use is complicated

If you're planning to use ChatGPT or Midjourney for commercial use, things get a bit complicated. Both models allow commercial use, but the full legal implications haven't really been explored. 

In a ruling in February 2023, the U.S. Copyright Office decided that images created by Midjourney, and by extension, other generative AIs, can't be copyrighted. They've reaffirmed this stance, and the courts have sided with their interpretation. This means you have limited protections if someone takes your images and uses them in ways you don't want them to. Technically, using someone else's image goes against Midjourney's terms of service, but that's not exactly a very strong legal shield if you're trying to build a brand, design a logo, or create character designs using the app. The worst that Midjourney is likely to be able to do is ban whoever takes your images.

From a technical standpoint, I'd probably recommend Midjourney if you want to somehow monetize your AI creations, simply because its model gives you more freedom and you can use things like character references and style references to generate images that look a bit more consistent.

Midjourney is still a bit weird

Midjourney used to be super transparent about how weird it was—you had to sign up through Discord, after all. Now that it's got a more sensible seeming website, it feels a bit more mainstream, though there is still an undercurrent of oddities.

For example, Midjourney is also a community. The Discord still exists, and you can see a reskinned version through the Chat section of the web app. And, unless you're on the $60/month Pro plan and activate Stealth Mode, all your images are automatically published to Midjourney's member gallery, where anyone can see them, download them, and copy your prompts. (This is where all the cool images you see as soon as you log into Midjourney come from.)

You can automate ChatGPT

ChaGPT connects to Zapier, which means you can use it directly from the apps you spend the most time in. For example, you can create images based on chat messages, database records, form responses, spreadsheet entries, or anything else—and send the image through to any other app you want. Learn more about how to automate AI image generation with Zapier, or get started with one of these pre-made workflows.

Create images with DALL.E based on Slack messages and send as a channel message

  • Slack logo
  • OpenAI (GPT-4, DALL-E, Whisper) logo
  • Slack logo
Slack + OpenAI (GPT-4, DALL-E, Whisper)
More details
    Activate this Zap to automatically create an image with DALL.E based on new Slack messages. The image it generates will then be sent as a channel message in Slack to effortlessly enhance your team communication.

    Generate images based on Google Form responses and save them in Google Sheets

    • Google Forms logo
    • OpenAI (GPT-4, DALL-E, Whisper) logo
    • Google Sheets logo
    Google Forms + OpenAI (GPT-4, DALL-E, Whisper) + Google Sheets
    More details
      Want to generate DALL.E images based on the content from Google Forms responses? Set up this Zap and seamlessly create any image that's represented inside a form submission. Once the image has been generated, it will automatically be added to a spreadsheet row in Google Sheets so you can easily find it.

      Generate OpenAI (DALL.E) images from new Airtable records and send as an email

      • Airtable logo
      • OpenAI (GPT-4, DALL-E, Whisper) logo
      • Gmail logo
      Airtable + OpenAI (GPT-4, DALL-E, Whisper) + Gmail
      More details
        Automatically generate unique and creative images using DALL.E based on the content from new Airtable records, and then send those images via Gmail to your desired recipients.

        Zapier is a no-code automation tool that lets you connect your apps into automated workflows, so that every person and every business can move forward at growth speed. Learn more about how it works.

        ChatGPT vs. Midjourney: Which should you use?

        The choice between ChatGPT and Midjourney should be relatively straightforward for most people:

        • If you want the best and easiest-to-use AI image generator currently available, go with ChatGPT. While it doesn't have as many customization options, you can still do a huge amount—and there's no learning curve. 

        • If you want a powerful option that allows you to create and customize your images, then Midjourney is the better choice. While it's harder to get the best from, its wide range of options and control are why it's still so popular with AI artists.

        Alternatively, you could also check out some of the other art generators—there are plenty to choose from.

        Related reading:

        This article was originally published in December 2023. The most recent update was in May 2025.

        Get productivity tips delivered straight to your inbox

        We’ll email you 1-3 times per week—and never share your information.

        Related articles

        Improve your productivity automatically. Use Zapier to get your apps working together.

        A Zap with the trigger 'When I get a new lead from Facebook,' and the action 'Notify my team in Slack'