I wonder if the key behind the quality of the MidJourney models, and this models...

CuriouslyC · on Aug 1, 2024

Midjourney unquestionably has heavy data set curation and uses RLHF from users.

You don't have to speculate on this as you can see that custom models for SDXL for instance perform vastly better than vanilla SDXL at the same number of parameters. It's all data set and tagging.

spywaregorilla · on Aug 1, 2024

custom models perform vastly better at the tasks they are finetuned to do

CuriouslyC · on Aug 1, 2024

That is technically true, but when the base model is wasting parameter information on poorly tagged, watermarked stock art and other garbage images, it's not really a meaningful distinction. Better data makes for better models, nobody cares about how well a model outputs trash.

spywaregorilla · on Aug 1, 2024

Ok, but you're severely misrepresenting the importance of things. Base SDXL is a fine model. Base SDXL is going to be much better than a materially smaller model that you've retrained with "good data".

cma · on Aug 2, 2024

SDXL used RLHF too

BoredPositron · on Aug 1, 2024

It's the quality of the image text pair not the image alone but midjourney is not a model it's a suite of models that work in conjunction. They have an llm in the front to optimize the user prompts, they use SAM models, controlnet models for poses that are in high demand and so much more. That's why you can't really compare foundation models anymore because there are none.

jncfhnb · on Aug 1, 2024

No, it’s definitely the size. Tiny LLMs are shit. Stable Diffusion 3’s problem is not that that its training set was wildly different, it’s that it’s just too small (because the one released so far is not the full size).

You can get better results with better data, for sure. And better architecture, for sure. But raw size is really important the difference in quality for models, all else held equal, is HUGE and obvious if you play with them.

pzo · on Aug 1, 2024

I would agree - midjourney is getting a free labour since many of their generations are not in secret mode (require pro/mega subscription) so prompts and outputs are visible to everyone. Midjourney rewards users to rating those generations. I wouldn't be surprised if there are some bots on their discord that are scraping those data for training their own models.

ilkke · on Aug 3, 2024

Are the prompts of pro users secret to Midjourney?