Pro Hadoop Data Analytics: Designing and Building Big Data Systems using the Hadoop Ecosystem
Autor Kerry Koitzschen Limba Engleză Paperback – 29 dec 2016
Learn advanced analytical techniques and leverage existing tool kits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems that go beyond the basics of classification, clustering, and recommendation.
Pro Hadoop Data Analytics emphasizes best practices to ensure coherent, efficient development. A complete example system will be developed using standard third-party components that consist of the tool kits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system.
The book also highlights the importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. You'll discover the importance of mix-and-match or hybrid systems, using different analytical components in one application. This hybrid approach will be prominent in the examples.
What You'll Learn
Pro Hadoop Data Analytics emphasizes best practices to ensure coherent, efficient development. A complete example system will be developed using standard third-party components that consist of the tool kits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system.
The book also highlights the importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. You'll discover the importance of mix-and-match or hybrid systems, using different analytical components in one application. This hybrid approach will be prominent in the examples.
What You'll Learn
- Build big data analytic systems with the Hadoop ecosystem
- Use libraries, tool kits, and algorithms to make development easier and more effective
- Apply metrics to measure performance and efficiency of components and systems
- Connect to standard relational databases, noSQL data sources, and more
- Follow case studies with example components to create your own systems
Who This Book Is For
Software engineers, architects, and data scientists with an interest in the design and implementation of big data analytical systems using Hadoop, the Hadoop ecosystem, and other associated technologies.
Preț: 263.66 lei
Preț vechi: 329.56 lei
-20% Nou
Puncte Express: 395
Preț estimativ în valută:
50.46€ • 52.49$ • 42.29£
50.46€ • 52.49$ • 42.29£
Carte tipărită la comandă
Livrare economică 13-27 martie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9781484219096
ISBN-10: 1484219090
Pagini: 298
Ilustrații: XXI, 298 p. 161 illus., 152 illus. in color.
Dimensiuni: 178 x 254 x 17 mm
Greutate: 0.56 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
ISBN-10: 1484219090
Pagini: 298
Ilustrații: XXI, 298 p. 161 illus., 152 illus. in color.
Dimensiuni: 178 x 254 x 17 mm
Greutate: 0.56 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
Cuprins
Chapter 1: Overview: Building Data Analytic Systems with Hadoop.- Chapter 2: A Scala and Python Refresher.- Chapter 3: Standard Toolkits for Hadoop and Analytics.- Chapter 4: Relational, noSQL, and Graph Databases.- Chapter 5: Data Pipelines and How to Construct Them.- Chapter 6: Advanced Search Techniques with Hadoop, Lucene, and Solr.- Chapter 7: An Overview of Analytical Techniques and Algorithms.- Chapter 8: Rule Engines, System Control, and System Orchestration.- Chapter 9: Putting it All Together: Designing a Complete Analytical System.- Chapter 10: Data Visualizers: Seeing and Interacting with the Analysis.- Chapter 11: A Case Study in Bioinformatics: Analyzing Microscope Slide Data.- Chapter 12: A Bayesian Analysis Software Component: Identifying Credit Card Fraud.- Chapter 13: Searching for Oil: Geological Data Analysis with Mahout.- Chapter 14: ‘Image as Big Data’ Systems: Some Case Studies.- Chapter 15:A Generic Data Pipeline Analytical System.- Chapter 16: Conclusions and The Future of Big Data Analysis.
Notă biografică
Kerry Koitzsch is a software engineer and interested in the early history of science, particularly chemistry. He frequently publishes papers and attends conferences on scientific and historical topics, including early chemistry and alchemy, and sociology of science. He has presented many lectures, talks, and demonstrations on a variety of subjects for the United States Army, the Society for Utopian Studies, American Association for Artificial Intelligence (AAAI), Association for Studies in Esotericism (ASE), and others. He has also published several papers and written two historical books.
Kerry was educated at Interlochen Arts Academy, MIT, and the San Francisco Conservatory of Music. He served in the United States Army and United States Army Reserve, and is the recipient of the United States Army Achievement Medal. He has been a software engineer specializing in computer vision, machine learning, and database technologies for 30 years, and currently lives and works in Sunnyvale, California.
Textul de pe ultima copertă
Learn advanced analytical techniques and leverage existing toolkits to make your analytic applications more powerful, precise, and efficient. This book provides the right combination of architecture, design, and implementation information to create analytical systems which go beyond the basics of classification, clustering, and recommendation.
In Pro Hadoop Data Analytics best practices are emphasized to ensure coherent, efficient development. A complete example system will be developed using standard third-party components which will consist of the toolkits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system.
The book emphasizes four important topics:
In Pro Hadoop Data Analytics best practices are emphasized to ensure coherent, efficient development. A complete example system will be developed using standard third-party components which will consist of the toolkits, libraries, visualization and reporting code, as well as support glue to provide a working and extensible end-to-end system.
The book emphasizes four important topics:
- The importance of end-to-end, flexible, configurable, high-performance data pipeline systems with analytical components as well as appropriate visualization results. Deep-dive topics will include Spark, H20,Vopal Wabbit (NLP), Stanford NLP, and other appropriate toolkits and plugins. Best practices and structured design principles. This will include strategic topics as well as the how to example portions.
- The importance of mix-and-match or hybrid systems, using different analytical components in one application to accomplish application goals. The hybrid approach will be prominent in the examples.
- Use of existing third-party libraries is key to effective development. Deep dive examples of the functionality of some of these toolkits will be showcased as you develop the example system.
Caracteristici
Provides useful code examples of real-world situations and solutions to common problems Provides an end-to-end example solution which can be expanded upon by the reader Gives extensive case studies and application examples from a variety of domains and problem areas