Statistical and Machine-Learning Data Mining:: Techniques for Better Predictive Modeling and Analysis of Big Data, Third Edition
Autor Bruce Ratneren Limba Engleză Paperback – 30 iun 2020
What is new in the Third Edition:
- The current chapters have been completely rewritten.
- The core content has been extended with strategies and methods for problems drawn from the top predictive analytics conference and statistical modeling workshops.
- Adds thirteen new chapters including coverage of data science and its rise, market share estimation, share of wallet modeling without survey data, latent market segmentation, statistical regression modeling that deals with incomplete data, decile analysis assessment in terms of the predictive power of the data, and a user-friendly version of text mining, not requiring an advanced background in natural language processing (NLP).
- Includes SAS subroutines which can be easily converted to other languages.
As in the previous edition, this book offers detailed background, discussion, and illustration of specific methods for solving the most commonly experienced problems in predictive modeling and analysis of big data. The author addresses each methodology and assigns its application to a specific type of problem. To better ground readers, the book provides an in-depth discussion of the basic methodologies of predictive modeling and analysis. While this type of overview has been attempted before, this approach offers a truly nitty-gritty, step-by-step method that both tyros and experts in the field can enjoy playing with.
Toate formatele și edițiile | Preț | Express |
---|---|---|
Paperback (1) | 263.79 lei 6-8 săpt. | |
CRC Press – 30 iun 2020 | 263.79 lei 6-8 săpt. | |
Hardback (1) | 631.38 lei 6-8 săpt. | |
CRC Press – iun 2017 | 631.38 lei 6-8 săpt. |
Preț: 263.79 lei
Preț vechi: 381.73 lei
-31% Nou
Puncte Express: 396
Preț estimativ în valută:
50.49€ • 53.26$ • 42.07£
50.49€ • 53.26$ • 42.07£
Carte tipărită la comandă
Livrare economică 02-16 ianuarie 25
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9780367573607
ISBN-10: 0367573601
Pagini: 690
Dimensiuni: 178 x 254 mm
Greutate: 0.45 kg
Ediția:3
Editura: CRC Press
Colecția Chapman and Hall/CRC
ISBN-10: 0367573601
Pagini: 690
Dimensiuni: 178 x 254 mm
Greutate: 0.45 kg
Ediția:3
Editura: CRC Press
Colecția Chapman and Hall/CRC
Cuprins
Preface to Third Edition
Preface of Second Edition
Acknowledgments
Author
1. Introduction
2. Science Dealing with Data: Statistics and Data Science
3. Two Basic Data Mining Methods for Variable Assessment
4. CHAID-Based Data Mining for Paired-Variable Assessment
5. The Importance of Straight Data Simplicity and Desirability for Good Model-Building Practice
6. Symmetrizing Ranked Data: A Statistical Data Mining Method for Improving the Predictive Power of Data
7. Principal Component Analysis: A Statistical Data Mining Method for Many-Variable Assessment
8. Market Share Estimation: Data Mining for an Exceptional Case
9. The Correlation Coefficient: Its Values Range between Plus and Minus 1, or Do They?
10. Logistic Regression: The Workhorse of Response Modeling
11. Predicting Share of Wallet without Survey Data
12. Ordinary Regression: The Workhorse of Profit Modeling
13. Variable Selection Methods in Regression: Ignorable Problem, Notable Solution
14. CHAID for Interpreting a Logistic Regression Model
15. The Importance of the Regression Coefficient
16. The Average Correlation: A Statistical Data Mining Measure for Assessment of Competing Predictive Models and the Importance of the Predictor Variables
17. CHAID for Specifying a Model with Interaction Variables
18. Market Segmentation Classification Modeling with Logistic Regression
19. Market Segmentation Based on Time-Series Data Using Latent Class Analysis
20. Market Segmentation: An Easy Way to Understand the Segments
21. The Statistical Regression Model: An Easy Way to Understand the Model
22. CHAID as a Method for Filling in Missing Values
23. Model Building with Big Complete and Incomplete Data
24. Art, Science, Numbers, and Poetry
25. Identifying Your Best Customers: Descriptive, Predictive, and Look-Alike Profiling
26. Assessment of Marketing Models
27. Decile Analysis: Perspective and Performance
28. Net T-C Lift Model: Assessing the Net Effects of Test and Control Campaigns
29. Bootstrapping in Marketing: A New Approach for Validating Models
30. Validating the Logistic Regression Model: Try Bootstrapping
31. Visualization of Marketing Models: Data Mining to Uncover Innards of a Model
32. The Predictive Contribution Coefficient: A Measure of Predictive Importance
33. Regression Modeling Involves Art, Science, and Poetry, Too
34. Opening the Dataset: A Twelve-Step Program for Dataholics
35. Genetic and Statistic Regression Models: A Comparison
36. Data Reuse: A Powerful Data Mining Effect of the GenIQ Model
37. A Data Mining Method for Moderating Outliers Instead of Discarding Them
38. Overfitting: Old Problem, New Solution
39. The Importance of Straight Data: Revisited
40. The GenIQ Model: Its Definition and an Application
41. Finding the Best Variables for Marketing Models
42. Interpretation of Coefficient-Free Models
43. Text Mining: Primer, Illustration, and TXTDM Software
44. Some of My Favorite Statistical Subroutines
Index
Preface of Second Edition
Acknowledgments
Author
1. Introduction
2. Science Dealing with Data: Statistics and Data Science
3. Two Basic Data Mining Methods for Variable Assessment
4. CHAID-Based Data Mining for Paired-Variable Assessment
5. The Importance of Straight Data Simplicity and Desirability for Good Model-Building Practice
6. Symmetrizing Ranked Data: A Statistical Data Mining Method for Improving the Predictive Power of Data
7. Principal Component Analysis: A Statistical Data Mining Method for Many-Variable Assessment
8. Market Share Estimation: Data Mining for an Exceptional Case
9. The Correlation Coefficient: Its Values Range between Plus and Minus 1, or Do They?
10. Logistic Regression: The Workhorse of Response Modeling
11. Predicting Share of Wallet without Survey Data
12. Ordinary Regression: The Workhorse of Profit Modeling
13. Variable Selection Methods in Regression: Ignorable Problem, Notable Solution
14. CHAID for Interpreting a Logistic Regression Model
15. The Importance of the Regression Coefficient
16. The Average Correlation: A Statistical Data Mining Measure for Assessment of Competing Predictive Models and the Importance of the Predictor Variables
17. CHAID for Specifying a Model with Interaction Variables
18. Market Segmentation Classification Modeling with Logistic Regression
19. Market Segmentation Based on Time-Series Data Using Latent Class Analysis
20. Market Segmentation: An Easy Way to Understand the Segments
21. The Statistical Regression Model: An Easy Way to Understand the Model
22. CHAID as a Method for Filling in Missing Values
23. Model Building with Big Complete and Incomplete Data
24. Art, Science, Numbers, and Poetry
25. Identifying Your Best Customers: Descriptive, Predictive, and Look-Alike Profiling
26. Assessment of Marketing Models
27. Decile Analysis: Perspective and Performance
28. Net T-C Lift Model: Assessing the Net Effects of Test and Control Campaigns
29. Bootstrapping in Marketing: A New Approach for Validating Models
30. Validating the Logistic Regression Model: Try Bootstrapping
31. Visualization of Marketing Models: Data Mining to Uncover Innards of a Model
32. The Predictive Contribution Coefficient: A Measure of Predictive Importance
33. Regression Modeling Involves Art, Science, and Poetry, Too
34. Opening the Dataset: A Twelve-Step Program for Dataholics
35. Genetic and Statistic Regression Models: A Comparison
36. Data Reuse: A Powerful Data Mining Effect of the GenIQ Model
37. A Data Mining Method for Moderating Outliers Instead of Discarding Them
38. Overfitting: Old Problem, New Solution
39. The Importance of Straight Data: Revisited
40. The GenIQ Model: Its Definition and an Application
41. Finding the Best Variables for Marketing Models
42. Interpretation of Coefficient-Free Models
43. Text Mining: Primer, Illustration, and TXTDM Software
44. Some of My Favorite Statistical Subroutines
Index
Recenzii
"I bought your book as it seemed to have the right mixture of statistical theory, practice, and common sense – finally! You can find the first often; the second occasionally; but the third, esp. in combination with the first two – never. I cannot thank you enough, Bruce! You are brilliant at assimilating, stating the underlying principles of analyses." ~Sandra Hendren, Sr. Lecturer, Harvard"Bruce Ratner’s recent 3rd edition of "Statistical and Machine-Learning Data Mining" is the best I’ve seen in my long career. It provides insightful methods for data mining, and innovative techniques for predictive analytics. The book is a valuable resource for experienced and newbie data scientists. Bruce’s book is my new data science bible. It is written in a clear style, and is an enjoyable read as it includes historical notes, which flow with the material." ~Jack Theurer, President, G. Theurer Assoc. Inc.
"Your book has been very helpful when I was reviewing the manual for the Automatic Linear Modeling (ALM) in SPSS. It offers many insightful perspectives to use for future ALM features and improvements. This book is an excellent contribution to the literature of statistics, data mining, and machine learning. Thank you, Bruce." ~Patrick Yan, PhD, Professor, Arizona State Univ.
"I heard one of my instructors in Coursera mention Bruce Ratner’s new book "Statistical and Machine-Learning Data Mining" during an online chat when he became tired of answering questions." ~Mike Richardson, Head of Hardware, Smartfrog, Inc.
"Your book has been very helpful when I was reviewing the manual for the Automatic Linear Modeling (ALM) in SPSS. It offers many insightful perspectives to use for future ALM features and improvements. This book is an excellent contribution to the literature of statistics, data mining, and machine learning. Thank you, Bruce." ~Patrick Yan, PhD, Professor, Arizona State Univ.
"I heard one of my instructors in Coursera mention Bruce Ratner’s new book "Statistical and Machine-Learning Data Mining" during an online chat when he became tired of answering questions." ~Mike Richardson, Head of Hardware, Smartfrog, Inc.
Descriere
The third edition of a bestseller, Statistical and Machine-Learning Data Mining: Techniques for Better Predictive Modeling and Analysis of Big Data is still the only book, to date, to distinguish between statistical data mining and machine-learning data mining.