It's crazy to think there was a fleeting sliver of time during which Midjourney ...

Mashimo · 2026-02-10T10:13:43 1770718423

What ever happend to midjourney?

Lalabadie · 2026-02-10T14:49:53 1770734993

No external funding raised. They're not on the VC path, so no need to chase insane growth. They still have around 500M USD in ARR.

In my (very personal) opinion, they're part of a very small group of organizations that sell inference under a sane and successful business model.

aenvoker · 2026-02-10T17:44:53 1770745493

Not on the VC path. Not even on the max-profit path. Just on the "Have fun doing cool research" path.

I was a mod on MJ for its first few years and got to know MJ's founder through discussions there. He already had "enough" money for himself from his prior sale of Leap Motion to do whatever he wanted. And, he decided what he wanted was to do cool research with fun people. So, he started MJ. Now he has far more money than before and what he wants to do with it is to have more fun doing more cool research.

user34283 · 2026-02-11T10:42:47 1770806567

Great for him, but when you mention research and fun, I have to say I'm not aware MJ published any research whatsoever.

And on the topic of fun, while it's certainly highly subjective, I remember that the moderation with the MJ tool was at one point so strict that you could not generate an image containing a "treasure chest" since they censored the word "chest".

I'm happy that state of the art models are now developed by actors who publish comprehensive technical reports and open-weights.

echelon · 2026-02-10T20:12:29 1770754349

They're working on a few really lofty ideas:

1. real time world models for the "holodeck". It has to be fast, high quality, and inexpensive for lots of users. They started on this two years ago before "world model" hype was even a thing.

2. some kind of hardware to support this.

David Holz talks about this on Twitter occasionally.

Midjourney still has incredible revenue. It's still the best looking image model, even if it's hard to prompt, can't edit, and has artifacting. Every generation looks like it came out of a magazine, which is something the other leading commercial models lack.

spaceman_2020 · 2026-02-10T17:13:01 1770743581

Aesthetically, still unmatched

cubefox · 2026-02-11T02:45:47 1770777947

Apparently image models have to choose between aesthetics and photorealism. Many aren't good at either.

wongarsu · 2026-02-10T10:57:55 1770721075

They have image and video models that are nowhere near SOTA on prompt adherence or image editing but pretty good on the artistic side. They lean in on features like reference images so objects or characters have a consistent look, biasing the model towards your style preferences, or using moodboards to generate a consistent style

vunderba · 2026-02-10T15:25:04 1770737104

A lot of people started realizing that it didn’t really matter how pretty the resulting image was if it completely failed to adhere to the prompt.

Even something like Flux.1 Dev which can be run entirely locally and was released back in August of 2024 has significantly better prompt understanding.

cubefox · 2026-02-11T02:50:16 1770778216

Yeah, though I there is the same issue the other way round: Great prompt understanding doesn't matter much when the result has an awfully ugly AI fake look to it.

vunderba · 2026-02-11T02:56:26 1770778586

That's definitely true, and the medium also really makes a big difference as well (photorealism, digital painting, watercolor, etc.).

Though in some cases, it is a bit easier to fix visual artifacts (using second-pass refiners, Img2Img, ultimate upscale, stylistic LoRAs, etc.) than a fundamental coherency problem.

cubefox · 2026-02-11T03:05:42 1770779142

I was disappointed when Imagen 4 (and therefore also Nano Banana Pro, which clearly uses Imagen 4 internally to some degree) had a significantly stronger tendency to drift from photorealism to AI fake aesthetics than Imagen 3. This suggests there is a tradeoff between prompt following and avoiding slop style. Perhaps this is also part of the reason why Midjourney isn't good at prompt following.

raincole · 2026-02-10T10:15:43 1770718543

Not much, while everything happened at OpenAI/Google/Chinese companies. And that's the problem.

KeplerBoy · 2026-02-10T11:05:52 1770721552

How is it a problem? There simply doesn't seem to be a moat or secret sauce. Who cares which of these models is SOTA? In two months there will be a new model.

waldarbeiter · 2026-02-10T11:16:23 1770722183

There seems to be a moat like infrastructure/gpus and talent. The best models right now come from companies with considerable resources/funding.

esperent · 2026-02-10T12:35:12 1770726912

Right, but that's a short term moat. If they pause on their incredible levels of spending for even 6 months, someone else will take over having spent only a tiny fraction of what they did. They might get taken over anyway.

raincole · 2026-02-10T13:40:42 1770730842

> someone else will take over having spent only a tiny fraction of what they did

How. By magic? You fell for 'Deepseek V3 is as good as SOTA'?

Gud · 2026-02-10T14:26:44 1770733604

By reverse engineering, sheer stupidity from the competition, corporate espionage, ‘stealing’ engineers and sometimes a stroke of genius, the same as it’s always been

qingcharles · 2026-02-10T16:18:59 1770740339

They still have a niche. Their style references feature is their key differentiator now, but I find I can usually just drop some images of a MJ style into Gemini and get it to give me a text prompt that works just as well as MJ srefs.

rc1 · 2026-02-10T20:28:52 1770755332

Isn’t it still? Antidotally, I work with lots of creators who still prefer it because of its subjective qualities.

gamma-interface · 2026-02-10T15:39:26 1770737966

The pace of commoditization in image generation is wild. Every 3-4 months the SOTA shifts, and last quarter's breakthrough becomes a commodity API.

What's interesting is that the bottleneck is no longer the model — it's the person directing it. Knowing what to ask for and recognizing when the output is good enough matters more than which model you use. Same pattern we're seeing in code generation.

SV_BubbleTime · 2026-02-10T17:17:16 1770743836

SOTA shifts, yes. But the average person doing the work has been very happy with SDXL based models. And that was released two years ago.

The fight right now outside of API SOTA is who will replace SDXL to be the “community preference”

It’s now a three way between Flux2 Klein, Z-Image, and now Qwen2.

jononor · 2026-02-11T08:14:41 1770797681

There is a decent chance there will be no clear consensus... Maybe people going custom LoRas etc should publish for the 3x most common models. Or maybe the tooling will make it so that switching models in a workflow will be painfree, as has kind of happened with LLMs.

SV_BubbleTime · 2026-02-11T19:33:00 1770838380

Could be. Which isn’t too bad a scenario if lora makers release for multiple models for the popular platforms at once.

This year I will have direct need for diffusion models for work, so I’m keeping an eye on this for sure.

echelon · 2026-02-10T20:09:50 1770754190

I'm happy the models are becoming commodity, but we still have a long way to go.

I want the ability to lean into any image and tweak it like clay.

I've been building open source software to orchestrate the frontier editing models (skip to halfway down), but it would be nice if the models were built around the software manipulation workflows:

https://getartcraft.com/news/world-models-for-film

sincerely · 2026-02-10T21:25:51 1770758751

PLEASE STOP POSTING AI GENERATED COMMENTS