But do i find myself conflicted about dismissing it as a potential technical skill all together.
I have seen comfy-ui workflows that are build in a very complex way, some have the canvas devided in different zones, each having its own prompts. Some have no prompts and extract concepts like composition or color values from other files.
I compare these with collage-art which also exists from pre existing material to create something new.
Such tools take practice, there are choices to be made, there is a creative process but its mostly technological knowledge so if its about such it would be right to call it a technical skill.
The sad reality however, is how easy it is to remove parts of that complexity “because its to hard” and barebones it to simple prompt to output. At which point all technical skill fades and it becomes no different from the online generators you find.
All of that’s great and everything, but at the end of the day all of the commercial VLM art generators are trained on stolen art. That includes most of the VLMs that comfyui uses as a backend. They have their own cloud service now, that ties in with all the usual suspects.
So even if it has some potentially genuine artistic uses I have zero interest in using a commercial entity in any way to ‘generate’ art that they’ve taken elements for from artwork they stole from real artists. Its amoral.
If it’s all running locally on open source VLMs trained only on public data, then maybe - but that’s what… a tiny, tiny fraction of AI art? In the meantime I’m happy to dismiss it altogether as Ai slop.
How is that any different from “stealing” art in a collage, though? While courts have disagreed on the subject (in particular there’s a big difference between visual collage and music sampling with the latter being very restricted) there is a clear argument to be made that collage is a fair use of the original works, because the result is completely different.
Collage art retains the original components of the art, adding layers the viewer can explore and seek the source of, if desired.
VLMs on the other hand intentionally obscure the original works by sending them through filters and computer vision transformations to make the original work difficult to backtrace. This is no accident, its designed obfuscation.
The difference is intent - VLMs literally steal copies of art to generate their work for cynical tech bros. Classical collages take existing art and show it in a new light, with no intent to pass off the original source materials as their own creations.
The original developers of Stable Diffusion and similar models made absolutely no secret about the source data they used. Where are you getting this idea that they “intentionally obscure the original works… to make [them] difficult to backtrace.”? How would an image generation model even work in a way that made the original works obvious?
Literally steal
Copying digital art wasn’t “literally stealing” when the MPAA was suing Napster and it isn’t today.
For cynical tech bros
Stable Diffusion was originally developed by academics working at a University.
Your whole reply is pretending to know intent where none exists, so if that’s the only difference you can find between collage and AI art, it’s not good enough.
If you download a checkpoint from non trustworthy sources definitely and that is the majority of people, but also the majority that does not use the technical tools that deep nor cares about actual art (mostly porn if the largest distributor of models civitai is a reference).
The technical tool that allow actual creativity is called comfyui, and this is open source. I have yet to see anything that is even comparable. Other creative tools (like the krita plugin) use it as a backend.
I am willing to believe that someone with a soul for art and complex flows would also make their own models, which naturally allows much more creativity and is not that hard to do.
I think there’s a stark difference between crafting your own comfyui workflow, getting the right nodes and control nets and checkpoints and whatever, tweaking it until you get what you want, and someone telling an AI “make me a picture/video of X.”
The least AI-looking AI art is the kind that someone took effort to make their own. Just like any other tool.
Unfortunately, gen AI is a tool that gives relatively good results without any skill at all. So most people won’t bother to do the work to make it their own.
I think that, like nearly everything in life, there is nuance to this. But at the same time, we aren’t ready for the nuance because we’re being drowned by slop and it’s horrible.
That was a beautiful read.
But do i find myself conflicted about dismissing it as a potential technical skill all together.
I have seen comfy-ui workflows that are build in a very complex way, some have the canvas devided in different zones, each having its own prompts. Some have no prompts and extract concepts like composition or color values from other files.
I compare these with collage-art which also exists from pre existing material to create something new.
Such tools take practice, there are choices to be made, there is a creative process but its mostly technological knowledge so if its about such it would be right to call it a technical skill.
The sad reality however, is how easy it is to remove parts of that complexity “because its to hard” and barebones it to simple prompt to output. At which point all technical skill fades and it becomes no different from the online generators you find.
All of that’s great and everything, but at the end of the day all of the commercial VLM art generators are trained on stolen art. That includes most of the VLMs that comfyui uses as a backend. They have their own cloud service now, that ties in with all the usual suspects.
So even if it has some potentially genuine artistic uses I have zero interest in using a commercial entity in any way to ‘generate’ art that they’ve taken elements for from artwork they stole from real artists. Its amoral.
If it’s all running locally on open source VLMs trained only on public data, then maybe - but that’s what… a tiny, tiny fraction of AI art? In the meantime I’m happy to dismiss it altogether as Ai slop.
How is that any different from “stealing” art in a collage, though? While courts have disagreed on the subject (in particular there’s a big difference between visual collage and music sampling with the latter being very restricted) there is a clear argument to be made that collage is a fair use of the original works, because the result is completely different.
Collage art retains the original components of the art, adding layers the viewer can explore and seek the source of, if desired.
VLMs on the other hand intentionally obscure the original works by sending them through filters and computer vision transformations to make the original work difficult to backtrace. This is no accident, its designed obfuscation.
The difference is intent - VLMs literally steal copies of art to generate their work for cynical tech bros. Classical collages take existing art and show it in a new light, with no intent to pass off the original source materials as their own creations.
The original developers of Stable Diffusion and similar models made absolutely no secret about the source data they used. Where are you getting this idea that they “intentionally obscure the original works… to make [them] difficult to backtrace.”? How would an image generation model even work in a way that made the original works obvious?
Copying digital art wasn’t “literally stealing” when the MPAA was suing Napster and it isn’t today.
Stable Diffusion was originally developed by academics working at a University.
Your whole reply is pretending to know intent where none exists, so if that’s the only difference you can find between collage and AI art, it’s not good enough.
only a note: LLMs are for text
Thanks. I edited
If you download a checkpoint from non trustworthy sources definitely and that is the majority of people, but also the majority that does not use the technical tools that deep nor cares about actual art (mostly porn if the largest distributor of models civitai is a reference).
The technical tool that allow actual creativity is called comfyui, and this is open source. I have yet to see anything that is even comparable. Other creative tools (like the krita plugin) use it as a backend.
I am willing to believe that someone with a soul for art and complex flows would also make their own models, which naturally allows much more creativity and is not that hard to do.
I think there’s a stark difference between crafting your own comfyui workflow, getting the right nodes and control nets and checkpoints and whatever, tweaking it until you get what you want, and someone telling an AI “make me a picture/video of X.”
The least AI-looking AI art is the kind that someone took effort to make their own. Just like any other tool.
Unfortunately, gen AI is a tool that gives relatively good results without any skill at all. So most people won’t bother to do the work to make it their own.
I think that, like nearly everything in life, there is nuance to this. But at the same time, we aren’t ready for the nuance because we’re being drowned by slop and it’s horrible.