Alternatively, is there a heuristic that reliably classifies websites in to tiny/not-tiny?
How could we get this started?