Jack I am trying to understand the concept of reinforcement learning in AI. How does an agent learn by interacting with an environment using rewards and penalties? Can someone also give simple real-world examples where reinforcement learning is used?