In this paper, they test different probability models to detect bias in iPod's shuffling algorithm and eventually conclude that:
> Our statistical tests show the long-term occurrences of these events are within expectations under the assumption of a random shuffle."
Regarding sorting by artists or groups, they found that:
> We failed to find any evidence to support the claim of users like Steven Levy of favoritism of certain groups in the shuffle.
In this paper, they test different probability models to detect bias in iPod's shuffling algorithm and eventually conclude that:
> Our statistical tests show the long-term occurrences of these events are within expectations under the assumption of a random shuffle."
Regarding sorting by artists or groups, they found that:
> We failed to find any evidence to support the claim of users like Steven Levy of favoritism of certain groups in the shuffle.