well you could validate data integrity based on
- feedback from consumers of the commodity
- some kind of built in monitoring with growth model benchmarking the different suppliers and instead of proof work do a decentralized/federated machine learning competition as a hard to falsify computation which uses private data for training and verification