Cartoonish output is a problem across the board. If you explicitly ask Dall-E fo... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		BugsJustFindMe on Sept 20, 2024 \| parent \| context \| favorite \| on: New AI diffusion model approach solves the aspect ... Cartoonish output is a problem across the board. If you explicitly ask Dall-E for a "photograph" of something, you will very often get a result that looks like a cartoonified illustration. Prompt writers resort to specifying exact camera models and lenses to try to constrain the process.

adamanonymous on Sept 20, 2024 [–]

There are fine tuned models out there that can generate near photo-realistic results. The base SD models and those offered by the major AI service sites have a more stylized look to them. Probably partially to work on a wider array of prompts that may include non photorealistic subjects, and partially for safety.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact