Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools
Autor Deepak Vohraen Limba Engleză Paperback – oct 2016
Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.
While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.
While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.
What You Will Learn:
- Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5
- Run a MapReduce job
- Store data with Apache Hive, and Apache HBase
- Index data in HDFS with Apache Solr
- Develop a Kafka messaging system
- Stream Logs to HDFS with Apache Flume
- Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop
- Create a Hive table over Apache Solr
- Develop a Mahout User Recommender System
Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
Preț: 345.11 lei
Preț vechi: 431.39 lei
-20% Nou
Puncte Express: 518
Preț estimativ în valută:
66.04€ • 68.53$ • 55.20£
66.04€ • 68.53$ • 55.20£
Carte tipărită la comandă
Livrare economică 15-29 martie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9781484221983
ISBN-10: 1484221982
Pagini: 300
Ilustrații: XX, 421 p. 311 illus., 293 illus. in color.
Dimensiuni: 178 x 254 x 26 mm
Greutate: 0.77 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
ISBN-10: 1484221982
Pagini: 300
Ilustrații: XX, 421 p. 311 illus., 293 illus. in color.
Dimensiuni: 178 x 254 x 26 mm
Greutate: 0.77 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
Cuprins
Part I. Fundamentals.- Introduction.- 1. HDFS and MapReduce.- Part II Storing & Querying.- 2. Apache Hive.- 3. Apache HBase.- Part III Bulk Transferring & Streaming.- 4. Apache Sqoop.- 5. Apache Flume.- Part IV Serializing.- 6. Apache Avro.- 7. Apache Parquet.- Part V Messaging & Indexing.- 8. Apache Kafka.- 9. Apache Solr.- 10.Apache Mahout.
Notă biografică
Deepak Vohra is a coder, developer, programmer, book author, and technical reviewer.
Textul de pe ultima copertă
This book is a practical guide on using the Apache Hadoop projects including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From setting up the environment to running sample applications each chapter is a practical tutorial on using a Apache Hadoop ecosystem project. While several books on Apache Hadoop are available, most are based on the main projects MapReduce and HDFS and none discusses the other Apache Hadoop ecosystem projects and how these all work together as a cohesive big data development platform.
What you'll learn
- How to set up environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5.
- How to run a MapReduce job
- How to store data with Apache Hive, Apache HBase
- How to index data in HDFS with Apache Solr
- How to develop a Kafka messaging system
- How to stream Logs to HDFS with Apache Flume
- How to transfer data from MySQL database to Hive, HDFS and HBase with Sqoop
- How create a Hive table over Apache Solr
Caracteristici
In-depth book covering topics that are not covered elsewhere, and how they all work together Provides practical examples Presents one of the two most popular big data frameworks, Hadoop