We started a company called ChubbyBrain (CB is our tribute to that). 0% of our data is from crunch.
In terms of where it comes from:
80% of our data comes via software we've built to parse news, SEC, investor, corporate websites (we crawl about 12k of them daily).
About 20% of our data comes directly from investors. The biggest contingent is angel data which we get via a partnership we have with Silicon Valley Bank and the Angel Capital Association.