A lot of evaluating a theory means making subjective judgements on the probability of things if there's not perfect priors established. Like debugging: You end up with things that could have a few ways of happening. Nothing seems to line up perfectly. Different team members can make arguments on what the most likely culprit is. If something doesn't feel right, then you perhaps lean towards another hypothesis.