In a sample of 10,000, the number who have trait T9 is about 20. The chance of a...

mjburgess · 2024-10-29T16:27:25 1730219245

No, I just wasnt clear.

The issue is that we have T1..Tn in an individual, so there's a very large number of ways you can get one group to have confounders.

The role of the powerlaw is to imply that the generative process which distributes these traits isnt "nice", so that one group can easily get a T9 that the other group doesnt have, and so on, for all T1...Tn

So you have this, let's say adversarial, background generative process which is giving you these confounding traits but never enough of each that you get nice mixtures.

You could see it as a problem of uniform sampling across many powerlaw factors to deliver uniform distributions of those factors. I havent written a simulation, but I don't see why this wouldnt be a serious problem for randomisation.

nilstycho · 2024-10-29T16:42:13 1730220133

With a large sample size, it's astronomically unlikely for there to be a confounding trait that is important to the outcome and is also widespread in only one of the experimental arms. If you were to write a simulation showing the issue I might be able explain more specifically why I think the simulation doesn't reflect reality.