User-Defined Tensor Data Analysis: SpringerBriefs in Computer Science
Autor Bin Dong, Kesheng Wu, Suren Bynaen Limba Engleză Paperback – 30 sep 2021
The SpringerBrief introduces FasTensor, a powerful parallel data programming model developed for big data applications. This book also provides a user's guide for installing and using FasTensor. FasTensor enables users to easily express many data analysis operations, which may come from neural networks, scientific computing, or queries from traditional database management systems (DBMS). FasTensor frees users from all underlying and tedious data management tasks, such as data partitioning, communication, and parallel execution.
This SpringerBrief gives a high-level overview of the state-of-the-art in parallel data programming model and a motivation for the design of FasTensor. It illustrates the FasTensor application programming interface (API) with an abundance of examples and two real use cases from cutting edge scientific applications. FasTensor can achieve multiple orders of magnitude speedup over Spark and other peer systems in executing big data analysis operations. FasTensor makes programming for data analysis operations at large scale on supercomputers as productively and efficiently as possible. A complete reference of FasTensor includes its theoretical foundations, C++ implementation, and usage in applications.
Scientists in domains such as physical and geosciences, who analyze large amounts of data will want to purchase this SpringerBrief. Data engineers who design and develop data analysis software and data scientists, and who use Spark or TensorFlow to perform data analyses, such as training a deep neural network will also find this SpringerBrief useful as a reference tool.
Din seria SpringerBriefs in Computer Science
- 20% Preț: 296.17 lei
- Preț: 475.83 lei
- 20% Preț: 325.63 lei
- Preț: 446.47 lei
- 20% Preț: 166.97 lei
- 20% Preț: 120.62 lei
- 20% Preț: 335.65 lei
- 20% Preț: 406.90 lei
- 20% Preț: 323.00 lei
- 20% Preț: 323.00 lei
- 20% Preț: 322.81 lei
- 20% Preț: 322.35 lei
- 20% Preț: 321.85 lei
- Preț: 375.45 lei
- 20% Preț: 232.68 lei
- 20% Preț: 323.00 lei
- 20% Preț: 324.17 lei
- 20% Preț: 322.17 lei
- 20% Preț: 322.50 lei
- 20% Preț: 323.34 lei
- 20% Preț: 324.17 lei
- 20% Preț: 323.46 lei
- 20% Preț: 322.17 lei
- 20% Preț: 322.02 lei
- 20% Preț: 323.46 lei
- Preț: 374.08 lei
- Preț: 341.50 lei
- 20% Preț: 324.49 lei
- Preț: 344.47 lei
- Preț: 376.80 lei
- Preț: 377.18 lei
- 20% Preț: 324.17 lei
- 20% Preț: 352.26 lei
- 20% Preț: 321.32 lei
- 20% Preț: 322.17 lei
- 20% Preț: 324.17 lei
- 20% Preț: 322.02 lei
- Preț: 374.46 lei
- 20% Preț: 320.21 lei
- 20% Preț: 323.34 lei
- 20% Preț: 324.17 lei
- 20% Preț: 231.84 lei
- 20% Preț: 294.95 lei
- 20% Preț: 322.50 lei
- Preț: 408.23 lei
- 20% Preț: 321.52 lei
- 20% Preț: 323.34 lei
- 20% Preț: 323.00 lei
- 20% Preț: 323.80 lei
- 20% Preț: 323.80 lei
Preț: 380.38 lei
Preț vechi: 475.48 lei
-20% Nou
Puncte Express: 571
Preț estimativ în valută:
72.80€ • 75.03$ • 61.46£
72.80€ • 75.03$ • 61.46£
Carte tipărită la comandă
Livrare economică 04-18 martie
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9783030707491
ISBN-10: 3030707490
Pagini: 101
Ilustrații: XII, 101 p. 23 illus.
Dimensiuni: 155 x 235 mm
Greutate: 0.17 kg
Ediția:1st ed. 2021
Editura: Springer International Publishing
Colecția Springer
Seria SpringerBriefs in Computer Science
Locul publicării:Cham, Switzerland
ISBN-10: 3030707490
Pagini: 101
Ilustrații: XII, 101 p. 23 illus.
Dimensiuni: 155 x 235 mm
Greutate: 0.17 kg
Ediția:1st ed. 2021
Editura: Springer International Publishing
Colecția Springer
Seria SpringerBriefs in Computer Science
Locul publicării:Cham, Switzerland
Cuprins
1. Introduction.- 1.1 Lessons from Big Data Systems.- 1.2 Data Model.- 1. 3 Programming Model High-Performance Data Analysis for Science.- 2. FasTensor Programming Model.- 2.1 Introduction to Tensor Data Model.- 2.2 FasTensor Programming Model.- 2.2.1 Stencils.- 2.2.2 Chunks.- 2.2.3 Overlap.- 2.2.4 Operator: Transform.- 2.2.5 FasTensor Execution Engine.- 2.2.6 FasTensor Scientific Computing Use Cases.- 2.3 Summary.- Illustrated FasTensor User Interface.- 3.1 An Example.- 3.2 The Stencil Class.- 3.2.1 Constructors of the Stencil.- 3.2.2 Parenthesis operator () and ReadPoint.- 3.2.3 SetShape and GetShape.- 3.2.4 SetValue and GetValue.- 3.2.5 ReadNeighbors and WriteNeighbors.- 3.2.6 GetOffsetUpper and GetOffsetLower.- 3.2.7 GetChunkID.- 3.2.8 GetGlobalIndex and GetLocalIndex.- 3.2.9 Exercise of the Stencil class.- 3.3 The Array Class.- 3.3.1 Constructors of Array.- 3.3.2 SetChunkSize, SetChunkSizeByMem, SetChunkSizeByDim, and GetChunkSize.- 3.3.3 SetOverlapSize, SetOverlapSizeByDetection,GetOverlapSize, SetOverlapPadding, and SyncOverlap.- 3.3.4 Transform.- 3.3.5 SetStride and GetStride.- 3.3.6 AppendAttribute, InsertAttribute, GetAttribute and EraseAttribute.- 3.3.7 SetEndpoint and GetEndpoint.- 3.3.8 ControlEndpoint.- 3.3.9.- ReadArray and WriteArray.- 3.3.10 SetTag and GetTag.- 3.3.11 GetArraySize and SetArraySize.- 3.3.12 Backup and Restore.- 3.3.13 CreateVisFile.- 3.3.14 ReportCost.- 3.3.15 EP_DIR Endpoint.- 3.3.16 EP_HDF5 and Other Endpoints.- Other Functions in FasTensor.- 3.4.1 FT_Init.- 3.4.2 FT_Finalize.- 3.4.3 Data types in FasTensor.- 4. FasTensor in Real Scientific Applications.- 4.1 DAS: Distributed Acoustic Sensing.- 4.2 VPIC: Vector Particle-In-Cell.- Appendix.- A.1 Installation Guide of FasTensor.- A.2 How to Develop a New Endpoint Protocol.- Alphabetical Index.- Bibliography.- References.
Notă biografică
Dr. Bin Dong is a Research Scientist in Lawrence Berkeley National Laboratory in Berkeley, California, USA. Bin has the Ph.D degree in computing science and technology. Bin has wide research interests in big scientific data analysis, parallel computing, parallel I/O, machine learning, etc. He has co-authored more than 62 technical publications.
Dr. Kesheng Wu is a Senior Scientist at Lawrence Berkeley National Laboratory. He works extensively on data management, data analysis, and scientific computing. He is the developer of a number of widely used algorithms including FastBit bitmap indexes for querying large scientific datasets, Thick-Restart Lanczos (TRLan) algorithm for solving eigenvalue problems, and IDEALEM for statistical data reduction and feature extraction. He has co-authored more than 200 technical publications.
Dr. Suren Byna is a Computer Scientist in the Scientific Data Management (SDM) Group at Lawrence Berkeley National Laboratory in Berkeley, California, USA. His research interests are in scalable scientific data management. More specifically, he works on optimizing parallel I/O and on developing systems for managing scientific data. He leads the ExaIO project in the Exascale Computing Project (ECP) that contributes advanced I/O features to HDF5 and develops a new file system called UnifyFS. He also leads efforts that develop object-centric data management systems (Proactive Data Containers - PDC) and experimental and observational data (EOD) management strategies. He has co-authored more than 150 technical publications.
Textul de pe ultima copertă
Ths SpringerBrief introduces FasTensor, a powerful parallel data programming model developed for big data applications. This book also provides a user's guide for installing and using FasTensor. FasTensor enables users to easily express many data analysis operations, which may come from neural networks, scientific computing, or queries from traditional database management systems (DBMS). FasTensor frees users from all underlying and tedious data management tasks, such as data partitioning, communication, and parallel execution.
This SpringerBrief gives a high-level overview of the state-of-the-art in parallel data programming model and a motivation for the design of FasTensor. It illustrates the FasTensor application programming interface (API) with an abundance of examples and two real use cases from cutting edge scientific applications. FasTensor can achieve multiple orders of magnitude speedup over Spark and other peer systems in executing big data analysis operations. FasTensor makes programming for data analysis operations at large scale on supercomputers as productively and efficiently as possible. A complete reference of FasTensor includes its theoretical foundations, C++ implementation, and usage in applications.
Scientists in domains such as physical and geosciences, who analyze large amounts of data will want to purchase this SpringerBrief. Data engineers who design and develop data analysis software and data scientists, and who use Spark or TensorFlow to perform data analyses, such as training a deep neural network will also find this SpringerBrief useful as a reference tool.
Caracteristici
FasTensor can achieve multiple orders of magnitude speedup over Spark and other peer systems in executing big data analysis operations FasTensor makes programming for data analysis operations at large scale on supercomputers as productively and efficiently as possible A complete reference of FasTensor includes its theoretical foundations, C++ implementation, and usage in applications