I guess you can't read Japanese.

volkk · on Feb 15, 2024

maybe for now, only a matter of time before stuff like this is fixed

zogwarg · on Feb 16, 2024

And I guess you haven't actually been to Tokyo, the number of details which are subtly wrong is actually very high, and it isn't limited to text, heck detecting those flaws isn't even limited by knowledge of Japan:

- Uncanny texture and shape for the manhole cover

- Weirdly protruding yellow line in the middle of the road, where it doesn't make sense - Weird double side-curb on the right, which can't really be called steps.

- Very strange gait for the "protagonist", with the occasional leg swap.

- Not quite sensical geometry for the crosswalks, some of them leading nowhere (into the wet road, but not continuing further)

- Weird glowy inside behind the columns on the right.

- What was previously a crosswalk, becoming wet "streaks" on the road.

- No good reason for crosswalks being the thing visible in the reflection of the sunglasses.

- Absurd crosswalk orientation at the end. (90 degrees off)

- Massive difference in lighting between the beginning of the clip and the end, suggesting an impossible change of day.

Nothing suggests to me that these are easy artifacts to remove, given how the technology is described as "denoising" changes between frames.

This is probably disruptive to some forms of video production, but the high-end stuff I suspect will still use filming mostly ground in truth, this could highly impact how VFX and post-production is done, maybe.

padolsey · on Feb 16, 2024

With everything we've seen in the last couple years, do you sincerely believe that all of those points won't be solved pretty soon? There are many intermediary models that can be used to remove these kind of artefacts. Human motion can be identified and run through a pose/control-net filter, for example. If these generations are effectively one-shot without subsequent domain-specific adjustments, then we should expect for every single one of your identified flaws to be remedied pretty soon.

serf · on Feb 15, 2024

the world is getting increasingly surveilled as well, I guess the presumption is that eventually you'll just be able to cross reference a 'verified' recording of the scene against whatever media exists.

"We ran the vid against the nationally-ran Japanese scanners, turns out that there are no streets that look like this, nor individuals."

in other words I think that the sudden leap of usable AI into real life is going to cause another similar leap towards non-human verification of assets and media.