Beginning Hadoop: Understanding Hadoop Scalability and Performance of Clusters
Autor Gurmukh Singhen Limba Engleză Paperback – 7 apr 2016
There are many challenges in setting up and scaling distributed frameworks like hadoop.
Despite, Hadoop being an Open Source product and with so many good documentations and books, it is difficult for an individual or an enterprise to define various use cases or working models, that too with a clear understanding of its workings and tuning it for optimal performance.
Pro Hadoop Administration by Gurmukh Singh, a Hadoop specialist and an infrastructure architect, takes a deep dive into configuring Hadoop services and its integration with various tools or frameworks. The book covers the processes right from scratch to building a Hadoop cluster at the production level, with best practices and optimal performance.
You will learn:
- Use Cases and set of recipes for the Hadoop production environment.
- From Compiling Hadoop to setting up Cluster with Highly available services.
- It's integration with various tools like Sqoop, Flume, HBase, Hive and many more.
- Performance tuning and Cluster Planning.
- Hadoop security like Kerberos, Encryption and other aspects of security like OS and Network Level.
Preț: 189.60 lei
Nou
Puncte Express: 284
Preț estimativ în valută:
36.29€ • 37.69$ • 30.14£
36.29€ • 37.69$ • 30.14£
Carte nepublicată încă
Doresc să fiu notificat când acest titlu va fi disponibil:
Se trimite...
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9781484213544
ISBN-10: 1484213548
Pagini: 250
Ilustrații: Bibliographie
Dimensiuni: 178 x 254 mm
Ediția:1st ed. 2016
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
ISBN-10: 1484213548
Pagini: 250
Ilustrații: Bibliographie
Dimensiuni: 178 x 254 mm
Ediția:1st ed. 2016
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
Public țintă
Popular/generalCuprins
Chapter 1: Introduction to Distributed Computing and Hadoop.Chapter Goal: Talk about the Distributed computing, challenges and some of the existing platforms in the market.
Sub -Topics
Sub - Topics
Sub - Topics:
4. Failover to Secondary
Chapter 4: Concepts of redundancy and Data AccessChapter Goal: Understand how replication works and setup rack awareness
Sub - Topics:
Chapter 4: Hadoop Administration TasksChapter Goal: Learn about day-to-day activities, which are performed by Hadoo
p Admins like Cluster balancing, disk space issues etc
Sub -Topics
- Introduction to Distributed computing.
- Introduction to Hadoop and its history
- Current Hadoop distributions and its market.
- Problem statement why Hadoop is needed and its use cases
Sub - Topics
- Hadoop Compilation.
- Hadoop Installation and its various modes
- Hadoop Daemons Configuration.
- Basic Hadoop Configuration Parameters.
Sub - Topics:
- Secondary NameNode Setup.
- Namenode Metadata Concepts.
4. Failover to Secondary
Chapter 4: Concepts of redundancy and Data AccessChapter Goal: Understand how replication works and setup rack awareness
Sub - Topics:
- Configure Hadoop Clients
- Multi-A record Clients
Chapter 4: Hadoop Administration TasksChapter Goal: Learn about day-to-day activities, which are performed by Hadoo
p Admins like Cluster balancing, disk space issues etc
- Hadoop Cluster balancing.
- Cluster Membership.
- Adding Disks to Data Nodes
- NameNode Metadata Operations
- Trash Space Configuration
- User Management.
- Space Quota Management.
- Job Schedulers
- Queue setup and management.
- ACL’s for Queues.
- Introduction to Hadoop 2.x.
- Hadoop 2.x features.
- Introduction to YARN and its components.
- Installation and Configuration of YARN.
- Setup Job Queues.
- Namenode HA using Shared Storage.
- Namenode HA using QJM.
- Resource Manager HA.
- Introduction to Data Ingestion.
- Introduction to PIG and its installation.
- Introduction to Hive and its installation.
- Introduction to SQOOP and its installation.
- Introduction to Flume and its installation.
- Examples for Data Ingestion.
- Introduction to HBase.
- HBase Installation.
- HBase with Hive
- Im
- Phoneix with Hbase
- Introduction to Kerberos.
- Installation and Configuring Kerberos.
- Hadoop with Kerberos.
- Securing Hadoop at the OS level.
- Hadoop Cluster Planning.
- Map Reduce Phases.
- Performance tuning.
- Hadoop Benchmarking.
- Introduction to Hadoop Federation.
- Setup Hadoop Federation.
- Introduction to Snapshots and its configuration.
- NFSv3 configuration for Hadoop.
- WebHDFS for REST API calls.
Notă biografică
Gurmukh has over 12 years of experience in Infrastructure design, scalability, performance tuning and distributed Computing. He recently, Co-Founded "Netxillon Technologies", which is into BigData Consultancy services and trainings. Prior to starting his venture, he worked with companies like Yahoo, HP, JP Morgan on various technologies like OpenVMS, Yahoo Web Analytics platform and many network and security appliances. His areas of expertise include Scalability and Performance Engineering, Databases, Optimising Hadoop Infrastructure, Proxy Appliances and Automation. In addition to this he mentors and trains engineers on latest technologies and market trends.
Caracteristici
- Practical Use Cases of the Hadoop production environment.
- Easy to follow with examples and code snippets.
- Hands-on Manual with right mixture of concepts.
- Best practices for Production Clusters.