Show HN: an app to create images with AI

scubakid · on April 19, 2022

I haven't read much about how these systems work yet, so this is probably a novice question, but I'd be interested to hear more about how the algorithm handles text input and feeds it into the generator. Does the training process include a ton of tagged images or something, and then the model learns to be able to generate stuff that corresponds reasonably to those tags?

jollyrosso · on April 16, 2022

How that is different with DALLE ? https://openai.com/blog/dall-e/

dvrp · on April 16, 2022

in terms of ai, this uses latent diffusion models: smaller model, does not use clip, and it was trained on noisy data.

legaloslotr · on April 18, 2022

Why invent your own models when DALL-E exists? Is it supposed to be better/more-accurate?

dvrp · on April 19, 2022

We are using already existing models based on open-source research. We have other models that OpenAI does not offer (e.g. super resolution) but they’re not in the app.

The reality though is that DALL-E 2 is still not open for access; open-source models are.

If you know how to get API access—besides waitlist of course—please let me know!

gillis · on April 16, 2022

What image dataset/generation library does this use?

dvrp · on April 16, 2022

latent difussion models (see: https://github.com/CompVis/latent-diffusion).

we also use our own open-source library https://github.com/thegeniverse/geniverse

dalewebb · on April 16, 2022

The image wouldn't generate for me

dvrp · on April 16, 2022

had issues with the queuing system... fixed!

mike_r_parsons · on April 16, 2022

This is awesome!