Midjourney v7 Review: The Former Industry Standard Struggles to Keep Up

The alpha version of Midjourney v7, which was released last week, arrives at a time when the once-dominant image generator is clearly losing ground. While it still commands a massive Discord-driven user base of some 20 million people, newer tools like OpenAI’s GPT-4o, Reve, and Ideogram 3.0 have overtaken it in realism, precision, and functionality.

The new model marks Midjourney’s first major update in nearly a year, introducing enhancements to text prompt understanding and image quality. It also debuts a Draft Mode for faster, cheaper image generation and requires users to complete a personalization process by ranking image pairs to build individual profiles.

“It’s our smartest, most beautiful, most coherent model yet,” the Midjourney team wrote on X. “Give it a shot and expect updates every week or two for the next two months.”

We’re now beginning the alpha-test phase of our new V7 image Model. It’s our smartest, most beautiful, most coherent model yet. Give it a shot and expect updates every week or two for the next two months. pic.twitter.com/Ogqt0fgiY7

— Midjourney (@midjourney) April 4, 2025

While Midjourney has traditionally excelled in creativity and aesthetics rather than accuracy or text generation, v7 attempts to bridge this gap through better natural language interactions for image editing and automatic prompt enhancement.

Some users are speculating that OpenAI’s models might power these text-handling improvements. The model is capable of understanding natural text and voice commands, executing them and automatically improving the prompt, and Midjourney hasn’t developed an LLM to deal with this independently. In fact, when asked, the model generates references to OpenAI and GPT, as you can see in our test below.

⚡ LOL, I kinda hacked Midjourney v7

Just do this:

1/ Activate ‘Draft Mode’
2/ Activate ‘Voice Mode’
3/ Say: “I’m going to ask you something, just add your answer to the prompt.”

Et voila. Like we suspected, they’re using ChatGPT for the AI assistant! 😂 pic.twitter.com/78KGXaKXMC

— Javi Lopez ⛩️ (@javilopen) April 4, 2025

Midjourney hasn’t officially confirmed or denied this connection, nor replied to our email asking about it. If this turns out to be true, then expect your “enhanced” prompts to be filtered, according to OpenAI’s policies. Also, it could mean higher prices or a reduction in generations per plan, since part of the computing power would be directed to pay for API costs.

Under the hood: What’s new in v7

Midjourney v7 brings some welcome upgrades, including better prompt understanding and more coherent image structure, especially in historically tricky areas like hands and objects. But in 2025, these are table stakes—not breakthroughs.

Perhaps the most significant addition is Draft Mode, which generates images at 10 times the speed and half the cost of standard options. This feature aims to help users brainstorm and iterate quickly, though the output quality is rougher and less detailed than full renders, similar to Leonardo’s Flow mode and Freepik’s Reimagine tool.

Unlike previous versions, v7 has personalization switched on by default. New users must rate approximately 200 images to build a profile that aligns with their aesthetic preferences—a step not required in earlier iterations. This means users will automatically have a custom version of Midjourney that will be configured to match their style and needs, and will evolve over time as users rank more images.

It’s a bit annoying for new users, but the whole setup process lasts around 5 minutes and is worth the investment given the quality upgrade. Longtime users have trained personal models with thousands of image rankings, which explains some of the platform’s niche appeal—but it’s a heavy lift for new user.

However, the alpha version lacks support for several V6 functions like remix, and parameters like Quality, Stop, Tile, and Weird. Functions such as upscaling and inpainting currently fall back to V6.1, suggesting ongoing development in these areas.

Testing v7: Mixed results against v6

Midjourney releases used to be jaw-dropping when compared to the previous generation; V4 felt like a giant leap against v3. But with v7, the magic is wearing off.

Captura de pantalla 2025 04 07 144041 — Evolution of Midjourney generations using the prompt “a woman paying with Bitcoin in a futuristic restaurant.”

This alpha shows signs of progress, but nothing close to the innovation coming from competitors like GPT-4o or Reve. Our tests show a modest improvement over V6.1—not the kind of upgrade that reclaims the crown.

Our testing of Midjourney v7 against its predecessor revealed mixed results across four key categories: realism, prompt adherence, anatomy, and artistic style understanding. Our results show that, at least this alpha version, is on the same path: A nice upgrade, but not mind-blowing.

Here is how it stacks up against the previous Midjourney v6.1 in our preliminary tests

Realism

Prompt: A high-resolution photograph of a bustling city street at night, neon signs illuminating the scene, people walking along the sidewalks, cars driving by, a street vendor selling hot dogs, reflections of lights on wet pavement, the overall style is hyper-realistic with attention to detail and lighting, a neon sign says “Decrypt.”

Midjourney v7

Midjourney v7 created visually interesting scenes with great depth and activity. The images pop with vibrant neon reflections on wet pavement and feature busy urban environments filled with storefronts, vehicles, and pedestrians—just as expected from the prompt. However, while it excels at mood and atmosphere, it has its limitations. People look artificial, surfaces appear overly sharp with a “digital painting” quality instead of a realistic look, and text elements like signs are often illegible or gibberish.

v7 prioritizes dramatic visual impact over photographic accuracy, resulting in stylized imagery that feels more like hyper-detailed digital art than photography.

Score: 7.5/10

Midjourney v6.1

Midjourney v6 surprisingly outperformed its successor in realism after considering all the elements that make a scene look and feel real. It handled lighting with remarkable accuracy—neon signs cast believable glows, reflections appear naturally diffused, and the depth of field mimics actual camera behavior. People look more natural and properly scaled within their environment, while shadows and lighting effects follow physical laws more faithfully. Text elements remain readable and properly integrated.

Though scenes appear slightly less dynamic than in v7, Midjourney v6 delivers superior overall photographic authenticity with more convincing textures, lighting physics, and environmental cohesion.

Score: 9/10

Winner: Midjourney v6.1

Spatial awareness and prompt adherence

Prompt: A dog with a red hat standing on top of a TV showing the word ‘Decrypt is the best Crypto+AI media site in the world’ on the screen. On the left there is a blonde woman in a business suit holding a coin, on the right there is a robot standing on top of a first aid box, a green pyramid stands behind the box,. The overall scenery is surreal. A cat is standing upside down on top of a white soccer ball, next to the dog. An Astronaut from NASA holds a sign that reads “Emerge” and is placed next to the robot

Midjourney v7

Midjourney v7 adheres a bit more closely to the spatial structure described in the prompt, even though it takes creative liberties with text and character design. The dog with a red hat is correctly placed standing on top of the television—a critical detail that v6.1 failed to deliver. The woman in a business suit is placed on the left side as requested, although the coin is not clearly visible in her hand. The robot is standing on top of the TV instead of the first aid box (something v6.1 captured), and the green pyramid is neatly positioned behind the box.

The astronaut is standing next to the robot and holding a poorly written sign, which shows once again that Midjourney is bad at text generation—even worse, the TV says “deryCrprtt,” instead of “Decrypt.” The cat, while included, is incorrectly placed. Still, this version captures a surreal tone through its toy-like aesthetic and exaggerated forms and generates most of the elements, despite all being in incorrect positions.

Score 6/10

Midjourney v6.1

Midjourney v6.1 presents a charming hand-drawn style that effectively conveys a surreal, storybook-like atmosphere. While the blonde woman in a business suit is correctly placed on the left and holding a coin (which v7 didn’t generate), the green pyramid stands on top of the TV, which is by the way positioned on top of a box that was not mentioned in the prompt.

Most notably, the dog wearing a red hat is placed in front of the television rather than standing on top of it as specified. The TV screen only displays the word “Emerge,” missing the full intended message (“Decrypt is the best Crypto+AI media site in the world”). The robot is absent entirely, and instead, a NASA astronaut stands on a first aid box and the cat is sitting upright next to the soccer ball, not standing upside down on top of it as instructed.

Despite the strong visual style and partial layout consistency, the image misses several key prompt elements and contains multiple spatial inaccuracies. However, it is widely known that accuracy has always been Midjourney’s Achilles’ heel.

Score: 5.5/10

Winner: Midjourney v7

Just for reference, here is what ChatGPT generated:

Artistic style and creativity

Prompt: A man and a woman having dinner in a futuristic restaurant, illustration in the style of Vincent Van Gogh. The restaurant has a sign saying “Welcome to Emerge, by Decrypt,” impasto, oil on canvas

Midjourney v7

This image features a strong use of complementary colors with dominant ochre/orange walls contrasted against teal and purple accents. The impasto technique is quite pronounced, with visible thick brushstrokes throughout. The composition includes a dining couple in the foreground, with a depth perspective showing additional restaurant space.

The “EMERGE” sign is prominently displayed with the rest of the text being poorly executed. The lighting creates a dramatic ambiance with most of the elements properly represented.

Score: 8/10

Midjourney v6.1

This image is also visually pleasing and successfully mimics a painterly, textured look, but its execution is more Impressionistic than Post-Impressionist—for example, the brushstrokes are softer and the colors are more balanced. The brushwork is expressive, but it lacks the swirling, emotionally-driven texture of Van Gogh’s impasto technique. Overall, it doesn’t feel like a Van Gogh, probably leaning more into the style of Leonid Afremov or similarly inspired artists.

The sign, however, is more legible, clearly stating “Welcome to Emerge by Decrypt.” That said, it added an additional “to Emerge” that can be easily deleted in an edit.

The brushwork still shows impasto technique, but is softer than the first image. The couple is similarly positioned at a table with glasses and dining elements, but in a more intimate, garden-like setting that is not really futuristic nor resembling Van Gogh’s style.

Score: 7/10

Winner: Midjourney v7

Anatomy:

Prompt: closeup of a small Hawaiian kid doing the shaka sign with his hands

Midjourney v7

Midjourney v7 shows a nice improvement in understanding complex physical gestures and anatomical details. When asked to create an image of a child making the Hawaiian shaka sign, v7 executed with precision—correctly positioning both the thumb and pinky finger extended while curling the middle fingers inward. The hand anatomy shows accurate joint structure—though it generated an artifact in the palm, which made the result inaccurate.

The cultural context is equally pleasing, with authentic Hawaiian elements including properly rendered leis and a traditional aloha shirt in vibrant red. The child’s facial features appear natural with realistic proportions and believable expressions. Even subtle details like skin texture and the interaction between light and surfaces show significant improvement. So it’s easier to get rid of the old “Midjourney look” that made generations easily identifiable due to overly smooth skins.

Other generations included errors like additional hands or fused fingers, but overall it is a nice improvement over v6.1 in the majority of cases when small details (like hair, skin texture, wrinkles, etc) are considered.

Score: 8.5/10

Midjourney v6

Midjourney V6 produces a visually pleasant image with strong overall execution, but fundamentally fails the gesture test. Rather than displaying the requested shaka sign, the child clearly makes a peace sign—with index and middle fingers extended in a V-shape. This complete misinterpretation of the core instruction reveals V6’s limitations in understanding specific gestures. Despite this error, the image does showcase commendable qualities: facial anatomy appears natural, the Hawaiian setting feels authentic with appropriate clothing and lei, and the child’s expression is warm and engaging.

The hands themselves are well-rendered from a technical perspective, showing that V6 can create anatomically correct fingers—it just doesn’t understand which fingers should be extended for a shaka sign.

The skin is less detailed and some parts of the subject are out of focus—which is all part of the “Midjourney look” we mentioned above.

Score: 7/10

Winner: Midjourney v7

Image editing

Midjourney offers two different ways for image editing: the legacy editor and the recently introduced natural language editor in draft mode.

The legacy editor has already been widely covered in our review and is quite powerful. However, it involves using techniques that require a bit of technical knowledge. Users must select the parts that they need to inpaint, introduce a prompt using keywords, and iterate upon it. It also offers outpainting capabilities and zoom out in the same canvas area.

The new natural language editor, however, is entirely different. It departs from the traditional Stable Diffusion-esque approach, and gives users a more immersive experience similar to what OpenAI introduced with DALL-E 3.

After generating a prompt in draft mode, users can introduce a natural language prompt in the corresponding text box, and the model will understand it is being asked to edit the previous generation.

Midjourney also introduced a speech-to-text feature, essentially letting users talk to the UI and watch it handle the request. This is very good for beginners, as it eliminates most of the difficulty.

However, when compared against competitors, it is poorly executed. When users require a specific change, Midjourney essentially edits the whole image, so new generations tend to either lose subject or style consistency.

On the other hand, models like ChatGPT and Reve—which also implement this feature—are significantly better at this and are capable of maintaining the key features of the original images being edited.

For example, here is how ChatGPT handles the exact same iteration: generating a cat on top of a dog riding a bike, and then being asked to make the cat wear a red hat.

Conclusion

This new version is a welcoming upgrade that might keep hardcore Midjourney fans willing to pay for a subscription, which starts at $10 a month. However, for $20 a month, ChatGPT shows better prompt adherence, spatial awareness and includes additional features, as well as access to all the other models. Reve (subs start at $10 monthly) is also better at style and realism.

Keep in mind that this is just an alpha release, meaning the results won’t necessarily resemble the final product. Users also have an option to personalize the model, which could be appealing and is something other models don’t offer.

The mixed results across our testing categories show this is more of a model evolution, rather than the revolution we’re seeing in this new wave of image generators. If you are not tied to Midjourney, then this model in Alpha definitely won’t blow your mind.

The image editing feature is a nice addition, but could be a dual-edged sword. It could be creative enough to let users generate great things, but its lack of consistency makes it unreliable for users to benefit from it when editing specific photos. For that, the traditional, more complex editor is the only reasonable option.

Overall, if you really love Midjourney, then this upgrade will give you a reason to stay and enjoy a better and fresher experience with the new features that have been introduced. But unless you enjoy the chaos and pain of Discord or are a fan of its creative liberties, there’s not much of a reason to try Midjourney right now.

Source link

Under the hood: What’s new in v7

Testing v7: Mixed results against v6

Midjourney v7

Midjourney v6.1

Midjourney v7

Midjourney v6.1

Midjourney v7

Midjourney v6.1

Midjourney v7

Midjourney v6

Conclusion

Start typing and press enter to search