Chloe I want to understand what batch normalization is in deep learning. How does it help improve training speed, stability, and model performance? Can someone also explain how it works within a neural network?