ChatGPT and Midjourney are two of the best AI image generators available. Both can take a text prompt and generate a matching image, no matter how weird or wild your request. While ChatGPT now uses the GPT-4o model instead of DALL·E 3, the DALL·E name is still so strong that I suspect lots of people haven't even realized the change. So if this is all news to you, don't worry—things have only changed for the better.
I've been testing both of these image generators, both professionally and personally, since they were released, and there's a lot to unpack. So let's dive in.
Table of contents:
How do ChatGPT and Midjourney work?
While DALL·E 3 and Midjourney were trained in similar ways, GPT-4o, which now powers ChatGPT, was trained a little differently.
Basically, Midjourney was trained on millions or billions of text-image pairs, which allows it to comprehend concepts like dogs, deerstalker hats, and dark moody lighting. GPT-4o is a full multimodal model that was also trained on text, audio, video, and more. It's a much bigger model, so it has a far deeper understanding of the world. Both models can parse what a prompt like "an impressionist oil painting of a Canadian man riding a moose through a forest of maple trees" is asking them to create, but ChatGPT is able to understand things with a lot more nuance.
When it comes to actually generating images, Midjourney uses a process called diffusion. It starts with a random field of noise and then, over a number of steps, edits it to better match its interpretation of your prompt. This is why you can get different results every time, even if you try the same prompt a second time: the randomness of the starting seed can totally change the end result. The process is kind of like looking up at a cloudy sky, finding a cloud that looks kind of like a dog, and then being able to snap your fingers to keep making it more and more dog-like. While there's more to it than that, it's not a bad way to think of things.

DALL·E 3 was a diffusion model, but GPT-4o reportedly also uses a process called visual autoregressive modeling. Instead of starting with a blank field of noise, it’s essentially able to come up with a rough draft and then improve things from there. Combined with its understanding of language, this makes ChatGPT incredibly powerful.

In addition to the differences in models, there's a lot more that affects the final results. How each model interprets your prompt, the weight it puts on the various parameters, what other features you use, and the philosophies of the companies responsible for developing it all massively affect what the output will look like.
Here's a take on "an impressionist oil painting of a Canadian man riding a moose through a forest of maple trees" from ChatGPT.

And here's one from Midjourney.

I think they both look great, so the good news is both ChatGPT and Midjourney are solid options—though there are still a lot of differences to tease out.
ChatGPT vs. Midjourney at a glance
ChatGPT image generator and Midjourney both do similar things, but there are some big differences. Here's a short summary of the major distinctions, but read on for a more detailed breakdown.
| ChatGPT | Midjourney |
---|---|---|
Quality | ⭐⭐⭐⭐⭐ Exceptional AI-generated images, with the best understanding of your prompts | ⭐⭐⭐⭐ Solid AI-generated images that can take a bit of work to get right |
Ease of use | ⭐⭐⭐⭐⭐ Collaborate with a chatbot | ⭐⭐⭐⭐ Surprisingly easy-to-use web app, though there's a learning curve to using all the advanced features |
Power and control | ⭐⭐⭐⭐⭐ ChatGPT makes editing simple | ⭐⭐⭐⭐ Best-in-class customization, if you can figure out all the quirks |
ChatGPT is easier to use
GPT-4o is available through ChatGPT. This makes it a doddle to use: you just ask ChatGPT to create whatever image you can imagine.

If you want to change anything, you just ask ChatGPT to make an adjustment.

Because of GPT-4o's deep understanding of the real world and ChatGPT's chat interface, you don't even have to be particularly careful with your prompts. You can just tell ChatGPT what you want and it will get it right most of the time.

The two downsides are that GPT-4o is very slow to generate images, and that it only generates one at a time. So, while the results are great and it's easy to use, iterating on an idea can take a lot of time.

Free ChatGPT users have limited access, while ChatGPT Plus subscribers may be subject to some rate limits, though in my testing, I could use it as much as I wanted to. It remains to be seen if it will roll out to the Bing Image Creator, Microsoft Paint, and other services that used the DALL·E 3 API.
Midjourney works much the same way, though it isn't as conversational and still has a few quirks. Originally, the only way you could use it was through Discord, a team chat app, but you can now also log in to a web app with a Google account. Midjourney occasionally offers free trials, but for the most part, you need to sign up for a paid plan to use it. (We'll look at pricing properly a little later on.)
Once you're in, you just enter what you want Midjourney to create in the Imagine bar, and it will generate four options. Simple—but editing requires diving deeper into Midjourney's options—which is where things get complicated.

Midjourney gives you more customization options
ChatGPT has three big features that allow you to control its results:
You can go back and forth with ChatGPT in the chat interface.
You can upload an image and use it as the base for something.
You can select an area of an image and replace it with AI-generated content.
All three of these work exceptionally well, but it's still a far more limited and less customizable feature set than Midjourney.
For example, to use the latest version of Midjourney's algorithm, you have to click through 200 pairs of images and select your favorite to fine-tune it to your preferences.

This means that by default, Midjourney is going to generate images tailored to your preferences. On top of that, here is an abbreviated list of the customization options available in Midjourney. You can:
Control how strongly Midjourney applies its default style, how weird it's allowed to be, and how much variety there is among your images.
Select from different versions of the Midjourney model.
Use images as the basis of a prompt, as a style reference, as a character reference, and as an "omni-reference," which means it tries to include whatever is in the photo.
Take any generated image and create iterative variations with or without tweaking the prompt.
Expand any generated image in any direction, change its aspect ratio, or zoom out.
Create a personalized style by ranking images so Midjourney knows what you like.
And honestly, this is barely scratching the surface. Dig into Midjourney's help docs, and you'll find loads more ways you can prompt, tweak, combine, and otherwise get creative with your images. The downside of this is that using these features is necessary to get the most out of Midjourney.
GPT-4o is now the better model
While both GPT-4o and Midjourney are capable of creating incredible images, GPT-4o is the better model for a couple of reasons.
For starters, GPT-4o can render accurate text, while Midjourney can't reliably do it.


GPT-4o also has a far better understanding of numbers, can keep track of multiple objects, and can position things in space. And especially for one-off prompts, it's more likely to get you the results you want.


The catch is that if you use Midjourney to its fullest and figure out its quirks, you'll be able to reliably create AI images that better fit your vision. With ChatGPT, you get awesome results, but you can't really shape them. With Midjourney, you have to do more work, but you have a lot more control.
Pricing depends on your needs
ChatGPT's pricing is super simple: it costs $20/month as part of ChatGPT Plus for nearly unlimited use (though too many requests in quick succession will probably get you cut off) or you get limited use on the free plan.
Midjourney has no free option, but the Basic Plan starts at $10/month and entitles you to 200 minutes of GPU time. Which, of course, is where things get complicated. Midjourney says that's good for roughly 200 generations a month, but it totally depends on what you're getting it to do. If you create lots of variations and upscale them all to the maximum amount, you'll burn through those GPU hours faster than if you create lots of low-res images.

And to make things more complicated, starting with the $30/month Standard plan, you get more fast GPU hours, but you can generate unlimited images in Relax mode—which only runs when there's free GPU power.
Given the massive differences between the two apps, I'm incredibly reluctant to make any judgments based on price. The free ChatGPT plan is obviously the cheapest way to start generating images, but if you want to create a lot of images, the $10/month Midjourney plan is a good balance of features and price. And then, for $20/month, you can get ChatGPT Plus—which also has all of ChatGPT's other generative AI features.
Commercial use is complicated
If you're planning to use ChatGPT or Midjourney for commercial use, things get a bit complicated. Both models allow commercial use, but the full legal implications haven't really been explored.
In a ruling in February 2023, the U.S. Copyright Office decided that images created by Midjourney, and by extension, other generative AIs, can't be copyrighted. They've reaffirmed this stance, and the courts have sided with their interpretation. This means you have limited protections if someone takes your images and uses them in ways you don't want them to. Technically, using someone else's image goes against Midjourney's terms of service, but that's not exactly a very strong legal shield if you're trying to build a brand, design a logo, or create character designs using the app. The worst that Midjourney is likely to be able to do is ban whoever takes your images.
From a technical standpoint, I'd probably recommend Midjourney if you want to somehow monetize your AI creations, simply because its model gives you more freedom and you can use things like character references and style references to generate images that look a bit more consistent.
Midjourney is still a bit weird
Midjourney used to be super transparent about how weird it was—you had to sign up through Discord, after all. Now that it's got a more sensible seeming website, it feels a bit more mainstream, though there is still an undercurrent of oddities.
For example, Midjourney is also a community. The Discord still exists, and you can see a reskinned version through the Chat section of the web app. And, unless you're on the $60/month Pro plan and activate Stealth Mode, all your images are automatically published to Midjourney's member gallery, where anyone can see them, download them, and copy your prompts. (This is where all the cool images you see as soon as you log into Midjourney come from.)
You can automate ChatGPT
ChaGPT connects to Zapier, which means you can use it directly from the apps you spend the most time in. For example, you can create images based on chat messages, database records, form responses, spreadsheet entries, or anything else—and send the image through to any other app you want. Learn more about how to automate AI image generation with Zapier, or get started with one of these pre-made workflows.
More details
Zapier is a no-code automation tool that lets you connect your apps into automated workflows, so that every person and every business can move forward at growth speed. Learn more about how it works.
ChatGPT vs. Midjourney: Which should you use?
The choice between ChatGPT and Midjourney should be relatively straightforward for most people:
If you want the best and easiest-to-use AI image generator currently available, go with ChatGPT. While it doesn't have as many customization options, you can still do a huge amount—and there's no learning curve.
If you want a powerful option that allows you to create and customize your images, then Midjourney is the better choice. While it's harder to get the best from, its wide range of options and control are why it's still so popular with AI artists.
Alternatively, you could also check out some of the other art generators—there are plenty to choose from.
Related reading:
This article was originally published in December 2023. The most recent update was in May 2025.