Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS

Author:   Sam Alapati
Publisher:   Pearson Education (US)
ISBN:  

9780134597195


Pages:   848
Publication Date:   19 January 2017
Format:   Paperback
Availability:   In Print   Availability explained
This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us.

Our Price $62.95 Quantity:  
Add to Cart

Share |

Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS


Add your own review!

Overview

In Expert Hadoop® Administration, leading Hadoop administrator Sam R. Alapati brings together authoritative knowledge for creating, configuring, securing, managing, and optimising production Hadoop clusters in any environment. Drawing on his experience with large-scale Hadoop administration, Alapati integrates action-oriented advice with carefully researched explanations of both problems and solutions. He covers an unmatched range of topics and offers an unparalleled collection of realistic examples. Alapati demystifies complex Hadoop environments, helping readers understand exactly what happens behind the scenes when they administer their cluster. Students will gain unprecedented insight as they walk through building clusters from scratch and configuring high availability, performance, security, encryption, and other key attributes.

Full Product Details

Author:   Sam Alapati
Publisher:   Pearson Education (US)
Imprint:   Addison Wesley
Dimensions:   Width: 17.20cm , Height: 4.00cm , Length: 23.60cm
Weight:   1.326kg
ISBN:  

9780134597195


ISBN 10:   0134597192
Pages:   848
Publication Date:   19 January 2017
Audience:   College/higher education ,  Tertiary & Higher Education
Format:   Paperback
Publisher's Status:   Active
Availability:   In Print   Availability explained
This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us.

Table of Contents

Part I: Introduction to Hadoop—Architecture and Hadoop Clusters Chapter 1: Introduction to Hadoop and Its Environment Chapter 2: An Introduction to the Architecture of Hadoop Chapter 3: Creating and Configuring a Simple Hadoop Cluster Chapter 4: Planning for and Creating a Fully Distributed Cluster Part II: Hadoop Application Frameworks Chapter 5: Running Applications in a Cluster—The MapReduce Framework (and Hive and Pig) Chapter 6: Running Applications in a Cluster—The Spark Framework Chapter 7: Running Spark Applications Part III: Managing and Protecting Hadoop Data and High Availability Chapter 8: The Role of the NameNode and How HDFS Works Chapter 9: HDFS Commands, HDFS Permissions and HDFS Storage Chapter 10: Data Protection, File Formats and Accessing HDFS Chapter 11: NameNode Operations, High Availability and Federation Part IV: Moving Data, Allocating Resources, Scheduling Jobs and Security Chapter 12: Moving Data Into and Out of Hadoop Chapter 13: Resource Allocation in a Hadoop Cluster Chapter 14: Working with Oozie to Manage Job Workflows Chapter 15: Securing Hadoop Part V: Monitoring, Optimization and Troubleshooting Chapter 16: Managing Jobs, Using Hue and Performing Routine Tasks Chapter 17: Monitoring, Metrics and Hadoop Logging Chapter 18: Tuning the Cluster Resources, Optimizing MapReduce Jobs and Benchmarking Chapter 19: Configuring and Tuning Apache Spark on YARN Chapter 20: Optimizing Spark Applications Chapter 21: Troubleshooting Hadoop—A Sampler Chapter 22: Installing VirtualBox and Linux and Cloning the Virtual Machines

Reviews

Author Information

Sam R. Alapati has been working with various aspects of the Hadoop environment for the past six years. He is currently the principal Hadoop administrator at Sabre Corporation in Westlake, Texas, and works on a daily basis with multiple large Hadoop 2 clusters. In addition to being the point person for all Hadoop administration at Sabre, Sam manages multiple critical data-science- and data-analysis-related Hadoop job flows and is also an expert Oracle Database Administrator. His vast knowledge of relational databases and SQL contributes to his work with Hadoop related projects. Sam’s recognition in the database and middleware area includes having published 18 well-received books over the past 14 years, mostly on Oracle Database Administration and Oracle Weblogic Server. His experience dealing with numerous configuration, architectural, and performance-related Hadoop issues over the years led him to the realization that many working Hadoop administrators and developers would appreciate having a handy reference such as this book to turn to when creating, managing, securing and optimizing their Hadoop infrastructure.

Tab Content 6

Author Website:  

Customer Reviews

Recent Reviews

No review item found!

Add your own review!

Countries Available

All regions
Latest Reading Guide

wl

Shopping Cart
Your cart is empty
Shopping cart
Mailing List