Cantitate/Preț
Produs

Problem-solving in High Performance Computing: A Situational Awareness Approach with Linux

Autor Igor Ljubuncic
en Limba Engleză Paperback – 27 sep 2015
Problem-Solving in High Performance Computing: A Situational Awareness Approach with Linux focuses on understanding giant computing grids as cohesive systems. Unlike other titles on general problem-solving or system administration, this book offers a cohesive approach to complex, layered environments, highlighting the difference between standalone system troubleshooting and complex problem-solving in large, mission critical environments, and addressing the pitfalls of information overload, micro, and macro symptoms, also including methods for managing problems in large computing ecosystems.
The authors offer perspective gained from years of developing Intel-based systems that lead the industry in the number of hosts, software tools, and licenses used in chip design. The book offers unique, real-life examples that emphasize the magnitude and operational complexity of high performance computer systems.


  • Provides insider perspectives on challenges in high performance environments with thousands of servers, millions of cores, distributed data centers, and petabytes of shared data
  • Covers analysis, troubleshooting, and system optimization, from initial diagnostics to deep dives into kernel crash dumps
  • Presents macro principles that appeal to a wide range of users and various real-life, complex problems
  • Includes examples from 24/7 mission-critical environments with specific HPC operational constraints
Citește tot Restrânge

Preț: 38162 lei

Preț vechi: 56611 lei
-33% Nou

Puncte Express: 572

Preț estimativ în valută:
7304 7705$ 6087£

Carte tipărită la comandă

Livrare economică 26 decembrie 24 - 09 ianuarie 25

Preluare comenzi: 021 569.72.76

Specificații

ISBN-13: 9780128010198
ISBN-10: 0128010193
Pagini: 320
Dimensiuni: 191 x 235 x 8 mm
Greutate: 0.66 kg
Editura: ELSEVIER SCIENCE

Cuprins

  1. Identifying Problems
  2. Beginning an Investigation
  3. First Level Debugging and Analysis
  4. System Internals
  5. Systematic Troubleshooting
  6. Analyzing Crashed Applications
  7. Solving Problems
  8. Monitoring and Prevention
  9. Implementing Safe Policies
  10. Fine-tuning System Performance
  11. Summary and Conclusions