Benjamin I want to understand what feature scaling means in machine learning. Why is it important to normalize or standardize data before training models? Can someone also explain common methods like Min-Max scaling and standardization?