My work in statistical learning is always about multicollinearity, variable cardinality, and model selection. Understanding the topology of the data space is critical, and day-to-day concerns of data cleaning had made me lose sight of that. To that degree, this article was a fantastic reminder.