There is no special insight required here. Merely, a simple diagonalization argument (https://news.ycombinator.com/item?id=43257904) which given a partial process to detect if a Turing machine halts, constructs a new machine which does not halt but was not detected by the earlier process.