If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly larg...

Buy Now From Amazon

Product Review

If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.

Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical deployments.

  • Get a high-level overview of HDFS and MapReduce: why they exist and how they work
  • Plan a Hadoop deployment, from hardware and OS selection to network requirements
  • Learn setup and configuration details with a list of critical properties
  • Manage resources by sharing a cluster across multiple groups
  • Get a runbook of the most common cluster maintenance tasks
  • Monitor Hadoop clusters—and learn troubleshooting with the help of real-world war stories
  • Use basic tools and techniques to handle backup and catastrophic failure


Similar Products

Hadoop: The Definitive Guide: Storage and Analysis at Internet ScaleHadoop Application Architectures: Designing Real-World Big Data ApplicationsProgramming Hive: Data Warehouse and Query Language for HadoopKafka: The Definitive Guide: Real-Time Data and Stream Processing at ScaleDesigning Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable SystemsHigh Performance Spark: Best Practices for Scaling and Optimizing Apache SparkLearning Spark: Lightning-Fast Big Data AnalysisHadoop Security: Protecting Your Big Data PlatformPython for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython