Readit News logoReadit News
amanzi · 2 years ago
Even the good AI image generators struggle with details like fingers, and ear lobes, and other small but important details.

Also, I found this quote interesting: "...Stability AI’s insistence on censoring adult content from SD3’s training data". Da Vinci figured this out hundreds of years ago, that to draw accurate pictures of humans, you need to understand the human body.

kergonath · 2 years ago
And sometimes that body is nude. On one hand I hope that this will help some people understand that their standards of what is SFW or NSFW need to be adjusted. Also, why the hell are we using workplaces as a yard stick for what is acceptable in our private life? Why on earth is it a problem if kids see tits at home?

Anyway, I hope standards will change in the US faster than the rest of the world finishes shifting to those utterly stupid standards.

ben_w · 2 years ago
Indeed, what counts as safe and not-safe varies wildly by culture.

I live in Berlin, and outside my apartment there is a spinning cube of adverts; the face of that cube advertising for Dildo King is between the face advertising for family cargo bikes and the face advertising for Edeka (a supermarket). There are also several nudist beaches within the city limits.

From outside the USA, I sometimes hear things such as Florida wanting to treat cross-dressing as inherently sexualised and therefore criminal if done in the presence of minors. (Was that a true story? When I search for it, I get transgender issues, rather than cross-dressing, so I can't find out if Hillary Clinton's trousers are a literal fashion crime in Florida).

roenxi · 2 years ago
Although it'll be fascinating to see how things like the finger situation change as we get models in common use that also handle video well. One thing that becomes obvious quickly with Stable Diffusion is that it doesn't have a good concept of a 3D world model to work with; or any grasp of physics. It is practically impossible for a human artist with SD levels of competence to accidentally draw multiple limbs and that speaks to how it just doesn't get that humans are a 3D existence.

Once video data is involved though it seems likely that will change. And I reckon a side effect is that a lot of the trickier details will improve. It'll be a great experiment to figure out whether the models also need lessons in anatomy or whether they figure it out through pure observation.

lifeisstillgood · 2 years ago
I think the point is you need a naked human body - art classes use volunteers, midjourney presumably uses pornhub. I guess there is not a lot of naked human images out there that’s not porn.
semi-extrinsic · 2 years ago
At least since v5 and barring any prompt engineering, midjourney is a real prude. I would be very surprised if it's trained on porn. Try something like "women at the beach" and everyone is wearing wetsuits. Or the infamous "treasure chest is a banned term".

FWIW this is better for my purposes, with the older versions I recall trying to generate illustrations of female scientists for use in professional settings, and having to do a lot of tweaking to avoid ahem chest issues.

It seems obvious that midjourney is trained on copyrighted material though. I've seen the latest version generate straight-up "Tom Cruise in Top Gun 2" and similar.

TeMPOraL · 2 years ago
I imagine if they filtered out just the bits with some action going on, they'd still have approximately infinite feed of naked human bodies in all possible poses, which would be unoffensive out of context (or at least less offensive), and would be equivalent to art classes models for the purpose of training.
kergonath · 2 years ago
It really is not difficult to find breasts, bottoms and well proportioned limbs on the Internet. Even in underwear if you must because you’re afraid of something.
lovethevoid · 2 years ago
Good AI image generators handle those details well already. Even if you don't get it 100%, you can inpaint fix very quickly.

The current problem for them really isn't the details in isolation but rather cohesive details throughout the entire picture in one attempt. It's very lacking and requires a lot of manual input, filtering, reliance on multiple tools, etc.

kergonath · 2 years ago
> Good AI image generators handle those details well already. Even if you don't get it 100%, you can inpaint fix very quickly.

But that should not be the case. A human body is not more complicated than a horse’s or a cat’s, and those are usually much better. There really is a problem in our relation with our own bodies.

TeMPOraL · 2 years ago
Yeah, it's especially ironic given how big a role nude modelling plays in art schools - it's fundamental to drawing, painting and sculpting.
turtleyacht · 2 years ago
I've never seen a drawing book start from outside-in. We learn from the massed blocks, sketched curves, and mirrored proportions. The shapes, the skeleton, the muscles and skin.
8f2ab37a-ed6c · 2 years ago
Has genAI for images hit an asymptote in the last year or so? Can't quite tell if things have gotten noticeably better since around when DALLE 3 was launched, it all looks about the same quality.
elpocko · 2 years ago
Yeah, the mainstream stuff that you get to use while subjected to heavy surveillance and censorship is about the same. There's a whole universe underneath that, with no restrictions and basically unlimited creativity, if you have the required hardware.
8f2ab37a-ed6c · 2 years ago
Any pointers for where people working on the DIY side of image genai hang out? Any specific forums or subreddits?
haunter · 2 years ago
Thanks to the horseshoe [0] puritanism we will live in a safer world. It's all right, everything is all right, the struggle is finished.

0, https://en.wikipedia.org/wiki/Horseshoe_theory

spacecadet · 2 years ago
Oh man do I mention Horseshoe alot... Its all extremism either way.

Dead Comment

margorczynski · 2 years ago
Just what I needed - some nightmare fuel before I go to sleep. But putting that aside - how is Stability still going? The VC money didn't dry out yet?
bitwize · 2 years ago
"Suppose you get a salamander. You know, a salamander can regrow its limbs if you cut them off. So all these nasty biology students do things like, they'll cut a limb of the salamander -- a hand, the arm off, just between the shoulder and the elbow, and at the same time, cut the same one between the elbow and the hand, flip around that segment, and re-sew it back on. Nasty thing to do to a salamander. What do you think you get? Does anybody here know? What you get is three elbows. It grows two new elbows to make up for the fact that the wrong thing is connected to the wrong thing." --Gerald Sussman, "We Really Don't Know How To Compute!" (https://www.youtube.com/watch?v=HB5TrK7A4pI&t=2m49s)

These images look like someone spliced salamander regeneration DNA into a human, cut them into many pieces, and connected the wrong thing to the wrong thing all over the place. Maybe that's what the AI wants to do to us, AM style (I Have No Mouth and I Must Scream)?

elpocko · 2 years ago
SD 1.5, on the the other hand, with the right checkpoints generates flawless human bodies. Older tech, much better results.

Can finetuning fix this?

m463 · 2 years ago
> generates flawless human bodies

wonder what the right checkpoints are. I could have sworn sd 1.5 will sometimes generate 3 flawless arms.

elpocko · 2 years ago
Yeah, sometimes. But it can also do this, for example (NSFW!): https://files.catbox.moe/le4v1y.jpg