Benjamin I want to understand what reinforcement learning means in AI. How does an agent learn through rewards and penalties by interacting with an environment? Can someone also explain simple real-world examples like games or robotics?