Be a part of our day by day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Study Extra
Midjourney, the boot-strapped startup considered by many AI energy customers because the “gold customary” of AI picture era since its launch in 2022, has now launched the much-anticipated, most superior model of its generator mannequin, Midjourney v7.
The headline characteristic is a brand new solution to immediate the mannequin to create pictures.
Beforehand, customers had been restricted to typing in textual content prompts and attaching different pictures to assist information generations (the mannequin might incorporate a wide range of user-uploaded and connected pictures, together with different Midjourney generations, to affect the fashion and topics of recent generations).
Now, the person can merely converse aloud to Midjourney’s alpha web site (alpha.midjourney.com) — offered they’ve a microphone in/on/connected to their laptop (or utilizing a networked gadget with audio enter, similar to headphones or a smartphone) — and the mannequin will pay attention and conjure up its personal textual content prompts based mostly on the person’s spoken audio descriptions, producing pictures from this.
It’s unclear whether or not or not Midjourney created a brand new voice enter mannequin (speech-to-text) from scratch or is utilizing a fine-tuned or out-of-the-box model of 1 from one other supplier similar to ElevenLabs or OpenAI. I requested Midjourney founder David Holz on X, however he has but to reply.
Utilizing Draft Mode and conversational Voice Enter to immediate in a circulation state
Going hand-in-hand with this enter methodology is a brand new “Draft Mode” that generates pictures extra quickly than Midjourney v6.1, essentially the most speedy previous model, usually in lower than a minute and even 30 seconds in some circumstances.
Whereas the photographs are initially of decrease high quality than v6.1, the person can click on on the “improve” or “fluctuate” buttons situated to the suitable of every era to re-render the draft at full high quality.

The concept is that the human person can be blissful to make use of each collectively — in truth, it is advisable have “Draft Mode” turned on to activate audio enter — to enter a extra seamless circulation state of artistic drafting with the mannequin, spending much less time on refining the precise language of prompts and extra on seeing new generations, reacting to them in realtime, and adjusting them or tweaking them as wanted extra naturally and quickly by merely talking the ideas out to the mannequin.
“Make this look extra detailed, darker, lighter, extra reasonable, extra kinetic, extra vibrant,” and so on. are a number of the directions the person might present by the brand new audio interface in response to generations to provide new, adjusted ones that higher match their artistic imaginative and prescient.
Getting began with Midjourney v7
To enter these modes, beginning with the brand new “Draft” characteristic, the person should first bounce by one new hurdle: Midjourney’s personalization characteristic.
Whereas this characteristic had been launched beforehand on Midjourney v6 again in June 2024, it was non-obligatory, permitting the person to create a private “fashion” that may very well be utilized to all generations going ahead by score 200 pairs of pictures (deciding on which on the person favored finest) by the Midjourney web site. The person might then toggle on a method that matched the photographs they favored finest throughout the pairwise score course of.
Now, Midjourney v7 requires customers to generate a brand new v7-specific personalised fashion earlier than even utilizing it in any respect within the first place.

As soon as the person does that, they’ll land on the acquainted Midjourney Alpha web site dashboard the place they will click on “Create” from the left facet rail to open a the creation tab.

Then, within the immediate entry bar on the prime, the person can click on on the brand new “P” button to the suitable of the bar to activate their personalization mode.

Midjourney founder and chief David Holz confirmed to VentureBeat on X that older personalization kinds from v6 may be chosen, however not the separate “moodboards” — kinds made up of user-uploaded picture collections — although Midjourney’s X account individually said that characteristic can be returning quickly as nicely. Nevertheless, I didn’t see the chance to pick out my older v6 fashion.
Nonetheless, the person can then click on on the brand new “Draft Mode” button to the suitable of the Personalization button (additionally additional to the suitable of the textual content immediate entry field) to activate this sooner picture era mode.

As soon as that’s been chosen with the cursor, it can flip orange indicating it’s turned on, after which a brand new button with a microphone icon ought to seem to the suitable of this one. That is the voice prompting mode, which the person can as soon as once more click on on to activate.

As soon as the person has pressed this microphone button to enter the voice prompting mode, they need to see the microphone icon change from white to orange to point it’s engaged, and a waveform line will seem to the suitable of it that ought to start undulating in time with the person’s speech.


The mannequin will then be capable to hear you and also needs to hear while you end talking. In follow, I generally obtained an error message saying “Realtime API disconnected,” however stopping and restarting the voice entry mode and refreshing the webpage often cleared it shortly.
After just a few seconds of talking, Midjourney will start flashing some key phrase home windows under the immediate entry textbox on the prime and in addition generate a full textual content immediate to the suitable because it generates a brand new set of 4 pictures based mostly on what the person stated.

The person can then additional modify these new generations by chatting with the mannequin once more, toggling voice mode on and off as wanted.
Right here’s a fast demo video of me utilizing it at this time to generate some pattern imagery. You’ll see the method is much from good, however it’s actually quick and does permit for extra of an interrupted state of prompting, refining, and receiving pictures from the mannequin.
Extra new options…but in addition many lacking options and limitations from v6/6.1
Midjourney v7 is launching with two operational modes: Turbo and Calm down. Turbo Mode gives excessive efficiency at twice the price of a regular v6 job, whereas Draft Mode prices half as a lot (when it comes to jobs). An ordinary-speed mode is presently in growth and can be launched as soon as optimized.
At launch, options similar to upscaling, inpainting, and retexturing will quickly depend on the v6 mannequin. Midjourney plans to transition these capabilities to v7 in future updates.
The corporate is committing to common growth over the following two months, with updates scheduled each one to 2 weeks. A serious upcoming addition can be a brand new character and object reference system designed particularly for v7, options discovered on older variations of Midjourney by making use of arcane textual content immediate suffixes similar to –cref and –sref (for fashion) to have a tendency of a person’s textual content immediate.
Midjourney plans to interact its group by public sharing areas and suggestions channels, and it’ll host a roadmap rating session to assist prioritize future growth efforts.
Midjourney emphasizes that v7 is a totally new mannequin with its personal strengths and challenges. Customers are inspired to experiment with totally different immediate kinds and report their experiences to assist refine the platform.
Preliminary response is combined…removed from the near-unanimous reward of prior Midjourney releases
Whereas a lot of the older Midjourney releases had been met with overwhelming pleasure and adulation, the preliminary reception to v7 is decidedly extra combined.
Though Midjourney was cautious to name this an “alpha” launch in its weblog and on social media, many customers nonetheless anticipating a bigger bounce in picture high quality and immediate adherence (how nicely the picture generations matched the person’s particular directions in textual content or audio), in addition to had been hoping for improved human anatomical understanding (notably of fingers, a standard AI picture era problem) and textual content era (additionally one thing picture fashions have struggled with, although Ideogram and OpenAI’s native GPT-4o picture generator seem to get it way more persistently correct than Midjourney v7 based mostly on preliminary person reviews).
“Gotta say it: kinda disenchanted.
OpenAI set the bar sky-high. speak to your picture gen prefer it’s your bro? Thoughts = blown.MJ7 seems “extra reasonable”. however did we actually want that?
MJ + Magnific already nailed it.May pause my sub tbh.”
“The issue is v7 doesn’t actually really feel like v7. It feels extra like v6.2,” posted Magnific AI founder Javi Lopez on X, citing the incremental seeming nature of the updates.
Certainly, Ethan Mollick, the Pennsylvania Wharton Faculty of Enterprise professor and AI influencer, additionally chimed in to say: “I like their new releases however the issue with the brand new v7 (proper) launched at this time is that v6 (left) was already actually good.”
“Equivalent prompts from v6 are worse in v7,” wrote self-described “AI maximalist” David Shapiro on X.
“All of the outdated Favorites which might be getting approach too outdated,” stated artist and musician @CaptainHaHaa: “Arms, Textual content nonetheless a problem, no cref, srefs have gone wack. However its okay as a result of you’ll be able to speak to it whereas it disappoints you.”
Others had been extra forgiving and delighted with their preliminary take a look at generations on v7, with AI energy person Dreaming Tulpa saying on X it had “higher picture high quality” and was “tremendous inventive.”
Equally, AI artist and designer Tatiana Tsiguleva voiced that Midjourney v7 was a “Large bounce in high quality!”
It’s nonetheless early days for Midjourney v7, nonetheless, and the preliminary response might swing again in both path — both adulation or frustration with the brand new mannequin and design options. For now, it’s accessible to anybody with a Midjourney account to start utilizing.