Stable Diffusion together with Controlnet. You basically feed it the text as a black and white image and provide it with a description of the picture of cats. It will then generate this output while using the black and white image as a base. It’s fairly simple to do but it can take a while to get a quality result such as this one.
How do they make these?!
Stable Diffusion together with Controlnet. You basically feed it the text as a black and white image and provide it with a description of the picture of cats. It will then generate this output while using the black and white image as a base. It’s fairly simple to do but it can take a while to get a quality result such as this one.
yoo i made a thing https://files.catbox.moe/egdau8.png