I wouldn't hate this, even if it's imperfect - the most common false positives would be brand-pushing content I don't want even if it's not actually paid advertising.
But you still have to identify features and use an algorithm that basically goes "users identify images with features A, B, and C as ads, and they identify images with features X, Y, and Z as not ads, so put positive weights on A, B, and C and negative weights on X, Y, and Z, and block anything with a negative enough score".