Data Profiling: Synthesis Lectures on Data Management
Autor Ziawasch Abedjan, Lukasz Golab, Felix Naumann, Thorsten Papenbrocken Limba Engleză Paperback – 8 noi 2018
This book provides a classification of the various types of profilable metadata, discusses popular data profiling tasks,and surveys state-of-the-art profiling algorithms. While most of the book focuses on tasks and algorithms for relational data profiling, we also briefly discuss systems and techniques for profiling non-relational data such as graphs and text. We conclude with a discussion of data profiling challenges and directions for future work in this area.
Din seria Synthesis Lectures on Data Management
- 20% Preț: 275.28 lei
- 20% Preț: 272.81 lei
- 20% Preț: 298.32 lei
- 20% Preț: 378.71 lei
- 20% Preț: 133.37 lei
- 20% Preț: 162.07 lei
- 20% Preț: 322.81 lei
- 20% Preț: 220.07 lei
- 20% Preț: 220.89 lei
- 20% Preț: 176.27 lei
- 20% Preț: 218.79 lei
- 20% Preț: 162.07 lei
- 20% Preț: 163.36 lei
- 20% Preț: 323.93 lei
- 20% Preț: 349.57 lei
- 20% Preț: 219.43 lei
- 20% Preț: 379.68 lei
- 20% Preț: 221.53 lei
- 20% Preț: 225.59 lei
- 20% Preț: 218.79 lei
- 20% Preț: 379.68 lei
- 20% Preț: 173.68 lei
- 20% Preț: 162.07 lei
- 20% Preț: 220.40 lei
- 20% Preț: 219.93 lei
- 20% Preț: 161.56 lei
- 20% Preț: 221.53 lei
- 20% Preț: 223.17 lei
- 20% Preț: 223.00 lei
- 20% Preț: 218.79 lei
- 20% Preț: 223.00 lei
- 20% Preț: 217.96 lei
- 20% Preț: 377.94 lei
- 20% Preț: 346.82 lei
- 20% Preț: 222.67 lei
- 20% Preț: 222.35 lei
- 20% Preț: 323.46 lei
- 20% Preț: 350.52 lei
- 20% Preț: 245.51 lei
- 20% Preț: 411.12 lei
- 20% Preț: 220.07 lei
- 20% Preț: 218.79 lei
- 20% Preț: 221.07 lei
- 20% Preț: 265.09 lei
- 20% Preț: 220.40 lei
- 20% Preț: 163.36 lei
- 20% Preț: 245.51 lei
Preț: 348.75 lei
Preț vechi: 435.93 lei
-20% Nou
Puncte Express: 523
Preț estimativ în valută:
66.74€ • 69.33$ • 55.44£
66.74€ • 69.33$ • 55.44£
Carte tipărită la comandă
Livrare economică 03-17 februarie 25
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9783031007378
ISBN-10: 3031007379
Pagini: 136
Ilustrații: XV, 136 p.
Dimensiuni: 191 x 235 mm
Greutate: 0.28 kg
Editura: Springer International Publishing
Colecția Springer
Seria Synthesis Lectures on Data Management
Locul publicării:Cham, Switzerland
ISBN-10: 3031007379
Pagini: 136
Ilustrații: XV, 136 p.
Dimensiuni: 191 x 235 mm
Greutate: 0.28 kg
Editura: Springer International Publishing
Colecția Springer
Seria Synthesis Lectures on Data Management
Locul publicării:Cham, Switzerland
Cuprins
Preface.- Acknowledgments.- Discovering Metadata.- Data Profiling Tasks.- Single-Column Analysis.- Dependency Discovery.- Relaxed and Other Dependencies.- Use Cases.- Profiling Non-Relational Data.- Data Profiling Tools.- Data Profiling Challenges.- Conclusions.- Bibliography.- Authors' Biographies .
Notă biografică
Ziawasch Abedjan is Assistant Professor and Head of the ""Big Data Management"" (BigDaMa) Group at the Technische Universitat Berlin. Before Ziawasch was a postdoc at the ""Computer Science and Artificial Intelligence Laboratory"" at MIT working on various data integration topics. Ziawasch received his Ph.D. from the Hasso Plattner Institute in Potsdam, Germany. His research interests include, data mining, data integration, and data profiling.
Lukasz Golab is an Associate Professor at the University of Waterloo and a Canada Research Chair. Prior to joining Waterloo, he was a Senior Member of Research Staff at AT&T Labs in Florham Park, NJ, USA. He holds a B.Sc. in Computer Science (with High Distinction) from the University of Toronto and a Ph.D. in Computer Science (with Alumni Gold Medal) from the University of Waterloo. His publications span several research areas within data management and data analytics, including data stream management, data profiling, data quality, data science for social good, and educational data mining.
Felix Naumann studied mathematics, economy, and computer sciences at the University of Technology in Berlin. After receiving his diploma in 1997 he joined the graduate school ""Distributed Information Systems"" at Humboldt University of Berlin. He completed his Ph.D. thesis on ""Quality-driven Query Answering"" in 2000. In 2001 and 2002 he worked at the IBM Almaden Research Center on topics around data integration. From 2003-2006 he was an assistant professor of information integration at the Humboldt University of Berlin. Since 2006 he has held the chair for information systems at the Hasso Plattner Institute at the University of Potsdam in Germany. He is Editor-in-Chief of the Information Systems journal. His research interests are in the areas of information integration, data quality, data cleansing, text extraction, and-of course-data profiling. He has given numerous invited talks and tutorials on the topic of the book.
Thorsten Papenbrock is a researcher and lecturer at the Hasso Plattner Institute at the University of Potsdam in Germany. He received his M.Sc. in IT-Systems Engineering in 2014 and his Ph.D. in Computer Science in 2017. His thesis on ""Data Profiling-Efficient Discovery of Dependencies"" inspired many sections of this book. In research, his main interests are data profiling, data cleaning, distributed and parallel computing, database systems, and data analytics.