Data Mining and Exploration: From Traditional Statistics to Modern Data Science
Autor Chong Ho Alex Yuen Limba Engleză Paperback – 4 oct 2024
First, most students in social sciences, engineering, and business took at least one class in introductory statistics before learning data science. However, usually these courses do not discuss the similarities and differences between traditional statistics and modern data science; as a result learners are disoriented by this seemingly drastic paradigm shift. In reaction, some traditionalists reject data science altogether while some beginning data analysts employ data mining tools as a “black box”, without a comprehensive view of the foundational differences between traditional and modern methods (e.g., dichotomous thinking vs. pattern recognition, confirmation vs. exploration, single method vs. triangulation, single sample vs. cross-validation etc.). This book delineates the transition between classical methods and data science (e.g. from p value to Log Worth, from resampling to ensemble methods, from content analysis to text mining etc.). Second, this book aims to widen the learner's horizon by covering a plethora of software tools. When a technician has a hammer, every problem seems to be a nail. By the same token, many textbooks focus on a single software package only, and consequently the learner tends to fit the problem with the tool, but not the other way around. To rectify the situation, a competent analyst should be equipped with a tool set, rather than a single tool. For example, when the analyst works with crucial data in a highly regulated industry, such as pharmaceutical and banking, commercial software modules (e.g., SAS) are indispensable. For a mid-size and small company, open-source packages such as Python would come in handy. If the research goal is to create an executive summary quickly, the logical choice is rapid model comparison. If the analyst would like to explore the data by asking what-if questions, then dynamic graphing in JMP Pro is a better option. This book uses concrete examples to explain the pros and cons of various software applications.
Toate formatele și edițiile | Preț | Express |
---|---|---|
Paperback (1) | 303.56 lei 6-8 săpt. | |
CRC Press – 4 oct 2024 | 303.56 lei 6-8 săpt. | |
Hardback (1) | 754.55 lei 6-8 săpt. | |
CRC Press – 27 oct 2022 | 754.55 lei 6-8 săpt. |
Preț: 303.56 lei
Preț vechi: 439.18 lei
-31% Nou
Puncte Express: 455
Preț estimativ în valută:
58.10€ • 61.29$ • 48.42£
58.10€ • 61.29$ • 48.42£
Carte tipărită la comandă
Livrare economică 02-16 ianuarie 25
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9780367721510
ISBN-10: 0367721511
Pagini: 290
Ilustrații: 212
Dimensiuni: 156 x 234 mm
Greutate: 0.54 kg
Ediția:1
Editura: CRC Press
Colecția CRC Press
Locul publicării:Boca Raton, United States
ISBN-10: 0367721511
Pagini: 290
Ilustrații: 212
Dimensiuni: 156 x 234 mm
Greutate: 0.54 kg
Ediția:1
Editura: CRC Press
Colecția CRC Press
Locul publicării:Boca Raton, United States
Public țintă
AcademicNotă biografică
Chong Ho Alex Yu has a Ph.D. in Educational Psychology with a focus on Measurement, Statistics, and Methodological Studies, and a Ph.D. in Philosophy with specialization in History and Philosophy of Science (Arizona State University). He joined Azusa Pacific University in 2012 and has served in various positions, including Director of Data Analytics, Professor of Behavioural and Applied Science, Adjunct Faculty of Mathematics, Quantitative Research Consultant, and Committee chair of the Big Data Discovery Summit.
Cuprins
1. Re-examination of Traditional Statistics 2. Why Data Science? 3. Cutting Edge Data Analytical Tools 4. Exploratory Data Analysis and Data Visualization: Pattern Seeking 5. Generalized Regression: Penalty against Complexity 6. Classification and Model Screening 7. Ensemble Methods: The Wisdom of the Crowd 8. Dimension Reduction: Breaking the Curse of Dimensionality 9. Clustering: Divide and Conquer 10. Neural Networks: Machines Mimic Human Intelligence 11. Text Mining: Structure the Unstructured
Descriere
This book covers both conceptual and procedural aspects of cutting-edge data mining methods, and the path for transitioning from classical statistics and modern data science. Readers will learn how various open source and commercial software modules can be properly applied to different situations.