What is HDFS and how does it function within the Hadoop ecosystem? How does HDFS store and manage large datasets across distributed systems? What are the key components and architecture of HDFS? How does HDFS provide fault tolerance and scalability in big data processing? What are the common use cases and limitations of HDFS in data engineering?