Does this dataset have an auxiliary field for each data point to note if it's been human reviewed? We used to train models with only those datapoints that had n=3 concordance in manual reviewers.
It doesn't, but since the original was supposedly human generated and I've now checked them all (twice) it should be n=2. Would love for someone to triple check! If you find errors let me know and I'll be happy to correct them.