From what little I've gathered about the subject, I think so. Spiking neural networks would be orders of magnitude more energy efficient than deep neural networks for image and audio processing. The main hurdle lies in designing network structure and weights; the cost function of a classical neural network can be differentiated using basic calculus, but optimizing a spiking neural network is not as easy.
Once you describe the specific modulation and noise properties you could recover the equivalent real valued continuous time neutral network from spiking neutral network. Then you can similarly map the training algorithms from gradient networks to "spike gradients".
The next necessary step is to use (potentially quantum) interference in addition to integration or summation to train the interconnections.
You will likely end up with quite complicated Hessians in both time and value.