Large-Scale Graph Processing Using Apache Giraph
Autor Sherif Sakr, Faisal Moeen Orakzai, Ibrahim Abdelaziz, Zuhair Khayyaten Limba Engleză Hardback – 12 ian 2017
The book is organized as follows: Chapter 1 starts by providing a general background of the big data phenomenon and a general introduction to the Apache Giraph system, its abstraction, programming model and design architecture. Next, chapter 2 focuses on Giraph as a platform and how to use it. Based on a sample job, even more advanced topics like monitoring the Giraph application lifecycle and different methods for monitoring Giraph jobs are explained. Chapter 3 then provides an introduction to Giraph programming, introduces the basic Giraph graph model and explains how to write Giraph programs. In turn, Chapter 4 discusses in detail the implementation of some popular graph algorithms including PageRank, connected components, shortest paths and triangle closing. Chapter 5 focuses on advanced Giraph programming, discussing common Giraph algorithmic optimizations, tunable Giraph configurations that determine the system’s utilization of the underlying resources, and how to write a custom graph input and output format. Lastly, chapter 6 highlights two systems that have been introduced to tackle the challenge of large scale graph processing, GraphX and GraphLab, and explains the main commonalities and differences between these systems and Apache Giraph.
This book serves as an essential reference guide for students, researchers and practitioners in the domain of large scale graph processing. It offers step-by-step guidance, with several code examples and the complete source code available in the related github repository. Students will find a comprehensive introduction to and hands-on practice with tackling large scale graph processing problems using the Apache Giraph system, while researchers will discover thorough coverage of the emerging and ongoing advancements in big graph processing systems.
Toate formatele și edițiile | Preț | Express |
---|---|---|
Paperback (1) | 351.26 lei 6-8 săpt. | |
Springer International Publishing – 7 iul 2018 | 351.26 lei 6-8 săpt. | |
Hardback (1) | 357.40 lei 6-8 săpt. | |
Springer International Publishing – 12 ian 2017 | 357.40 lei 6-8 săpt. |
Preț: 357.40 lei
Preț vechi: 446.75 lei
-20% Nou
Puncte Express: 536
Preț estimativ în valută:
68.43€ • 71.26$ • 56.78£
68.43€ • 71.26$ • 56.78£
Carte tipărită la comandă
Livrare economică 14-28 februarie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9783319474304
ISBN-10: 3319474308
Pagini: 241
Ilustrații: XXV, 197 p. 102 illus., 87 illus. in color.
Dimensiuni: 155 x 235 x 14 mm
Greutate: 0.49 kg
Ediția:1st ed. 2016
Editura: Springer International Publishing
Colecția Springer
Locul publicării:Cham, Switzerland
ISBN-10: 3319474308
Pagini: 241
Ilustrații: XXV, 197 p. 102 illus., 87 illus. in color.
Dimensiuni: 155 x 235 x 14 mm
Greutate: 0.49 kg
Ediția:1st ed. 2016
Editura: Springer International Publishing
Colecția Springer
Locul publicării:Cham, Switzerland
Cuprins
1. Introduction.- 2. Getting started with Giraph.- 3. Giraph-In-Action: Implementing Popular Graph Algorithms using Giraph.- 4. Giraph Programming Optimizations: Tips and Tricks.- 5. Similar Systems to Giraph.- 6. Conclusions.
Recenzii
“This volume is a cookbook on Giraph. … Its virtue is that it will help newcomers to Giraph to get up and running quickly. … Users who need to bring up Giraph quickly and who have no experience with the Hadoop-Giraph ecosystem will find the volume a helpful introduction to these powerful tools.” (Computing Reviews, October, 2017)
Notă biografică
Sherif Sakr is currently a professor of computer and information science in the Health Informatics department at King Saud bin Abdulaziz University for Health Sciences. He is also affiliated with the University of New South Wales and DATA61/CSIRO (formerly NICTA). He had held visiting appointments in several academic and research institutes including Microsoft Research (2011), Alcatel-Lucent Bell Labs (2012), Humboldt University of Berlin (2015), University of Zurich (2016) and TU Dresden (2016). In 2013, Sherif has been awarded the Stanford Innovation and Entrepreneurship Certificate.
Faisal Moeen Orakzai is a joint PhD candidate at Université Libre de Bruxelles (ULB) Belgium and Aalborg University (AAU) Denmark. In addition to doing research, he works as a consultant and helps companies setting up their distributed data processing architectures and pipelines. He is a Big Data management and analytics enthusiast and currently working on a Giraph based framework for spatio-temporal pattern mining.
Ibrahim Abdelaziz is a Computer Science PhD candidate at King Abdullah University of Science and Technology (KAUST). Prior to joining KAUST, he used to work on pattern recognition and information retrieval in several research organizations in Egypt. His current research interests are Data Mining over large scale graphs, Distributed Systems and Machine Learning.
Zuhair Khayyat is a PhD candidate in the InfoCloud group at King Abdullah University of Science and Technology (KAUST) focusing on Big Data, Analytics and Graphs.
Faisal Moeen Orakzai is a joint PhD candidate at Université Libre de Bruxelles (ULB) Belgium and Aalborg University (AAU) Denmark. In addition to doing research, he works as a consultant and helps companies setting up their distributed data processing architectures and pipelines. He is a Big Data management and analytics enthusiast and currently working on a Giraph based framework for spatio-temporal pattern mining.
Ibrahim Abdelaziz is a Computer Science PhD candidate at King Abdullah University of Science and Technology (KAUST). Prior to joining KAUST, he used to work on pattern recognition and information retrieval in several research organizations in Egypt. His current research interests are Data Mining over large scale graphs, Distributed Systems and Machine Learning.
Zuhair Khayyat is a PhD candidate in the InfoCloud group at King Abdullah University of Science and Technology (KAUST) focusing on Big Data, Analytics and Graphs.
Textul de pe ultima copertă
This book takes its reader on a journey through Apache Giraph, a popular distributed graph processing platform designed to bring the power of big data processing to graph data. Designed as a step-by-step self-study guide for everyone interested in large-scale graph processing, it describes the fundamental abstractions of the system, its programming models and various techniques for using the system to process graph data at scale, including the implementation of several popular and advanced graph analytics algorithms.
The book is organized as follows: Chapter 1 starts by providing a general background of the big data phenomenon and a general introduction to the Apache Giraph system, its abstraction, programming model and design architecture. Next, chapter 2 focuses on Giraph as a platform and how to use it. Based on a sample job, even more advanced topics like monitoring the Giraph application lifecycle and different methods for monitoring Giraph jobs are explained. Chapter 3 then provides an introduction to Giraph programming, introduces the basic Giraph graph model and explains how to write Giraph programs. In turn, Chapter 4 discusses in detail the implementation of some popular graph algorithms including PageRank, connected components, shortest paths and triangle closing. Chapter 5 focuses on advanced Giraph programming, discussing common Giraph algorithmic optimizations, tunable Giraph configurations that determine the system’s utilization of the underlying resources, and how to write a custom graph input and output format. Lastly, chapter 6 highlights two systems that have been introduced to tackle the challenge of large scale graph processing, GraphX and GraphLab, and explains the main commonalities and differences between these systems and Apache Giraph.
This book serves as an essential reference guide for students, researchers and practitioners in the domain of large scale graph processing. It offers step-by-step guidance, with several code examples and the complete source code available in the related github repository. Students will find a comprehensive introduction to and hands-on practice with tackling large scale graph processing problems using the Apache Giraph system, while researchers will discover thorough coverage of the emerging and ongoing advancements in big graph processing systems.
The book is organized as follows: Chapter 1 starts by providing a general background of the big data phenomenon and a general introduction to the Apache Giraph system, its abstraction, programming model and design architecture. Next, chapter 2 focuses on Giraph as a platform and how to use it. Based on a sample job, even more advanced topics like monitoring the Giraph application lifecycle and different methods for monitoring Giraph jobs are explained. Chapter 3 then provides an introduction to Giraph programming, introduces the basic Giraph graph model and explains how to write Giraph programs. In turn, Chapter 4 discusses in detail the implementation of some popular graph algorithms including PageRank, connected components, shortest paths and triangle closing. Chapter 5 focuses on advanced Giraph programming, discussing common Giraph algorithmic optimizations, tunable Giraph configurations that determine the system’s utilization of the underlying resources, and how to write a custom graph input and output format. Lastly, chapter 6 highlights two systems that have been introduced to tackle the challenge of large scale graph processing, GraphX and GraphLab, and explains the main commonalities and differences between these systems and Apache Giraph.
This book serves as an essential reference guide for students, researchers and practitioners in the domain of large scale graph processing. It offers step-by-step guidance, with several code examples and the complete source code available in the related github repository. Students will find a comprehensive introduction to and hands-on practice with tackling large scale graph processing problems using the Apache Giraph system, while researchers will discover thorough coverage of the emerging and ongoing advancements in big graph processing systems.
Caracteristici
Describes the fundamental abstractions of the Apache Giraph, its programming models and various techniques Offers step-by-step coverage of the implementation of several popular and advanced graph analytics algorithms, including related optimization details All source code presented in the book is available for download from an associated github repository Includes supplementary material: sn.pub/extras