Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications
Autor Timothy Mastersen Limba Engleză Paperback – 19 dec 2017
Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation of operation, essential equations, references to more rigorous theory, and commented C++ source code.
Many of these techniques are recent developments, still not in widespread use. Others are standard algorithms given a fresh look. In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program. The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work.
What You'll Learn
- Use Monte-Carlo permutation tests to provide statistically sound assessments of relationships present in your data
- Discover how combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the data
- Work with feature weighting as regularized energy-based learning to rank variables according to their predictive power when there is too little data for traditional methods
- See how the eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the data
- Plot regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is high
Who This Book Is For
Anyone interested in discovering and exploiting relationships among variables. Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.
Preț: 469.16 lei
Preț vechi: 586.45 lei
-20% Nou
Puncte Express: 704
Preț estimativ în valută:
89.79€ • 93.27$ • 74.58£
89.79€ • 93.27$ • 74.58£
Carte tipărită la comandă
Livrare economică 03-17 februarie 25
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9781484233146
ISBN-10: 148423314X
Pagini: 228
Ilustrații: XIV, 286 p.
Dimensiuni: 178 x 254 x 20 mm
Greutate: 0.53 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
ISBN-10: 148423314X
Pagini: 228
Ilustrații: XIV, 286 p.
Dimensiuni: 178 x 254 x 20 mm
Greutate: 0.53 kg
Ediția:1st ed.
Editura: Apress
Colecția Apress
Locul publicării:Berkeley, CA, United States
Cuprins
1. Information and Entropy.- 2. Screening for Relationships.- 3. Displaying Relationship Anomalies.- 4. Fun With Eigenvectors.- 5. Using the DATAMINE Program.
Notă biografică
Timothy Masters has a PhD in statistics and is an experienced programmer. His dissertation was in image analysis. His career moved in the direction of signal processing, and for the last 25 years he's been involved in the development of automated trading systems in various financial markets.
Textul de pe ultima copertă
Find the various relationships among variables that can be present in big data as well as other data sets. This book also covers information entropy, permutation tests, combinatorics, predictor selections, and eigenvalues to give you a well-rounded view of data mining and algorithms in C++.
Furthermore, Data Mining Algorithms in C++ includes classic techniques that are widely available in standard statistical packages, such as maximum likelihood factor analysis and varimax rotation. After reading and using this book, you'll come away with many code samples and routines that can be repurposed into your own data mining tools and algorithms toolbox. This will allow you to integrate these techniques in your various data and analysis projects.
You will:
- Discover useful data mining techniques and algorithms using the C++ programming language
- Carry out permutation tests
- Work with the various relationships and screening types for these relationships
- Master predictor selections
- Use the DATAMINE program
Caracteristici
An expert-driven data mining and algorithms in C++ book Data mining is an important topic in big data Algorithms are also a critical topic of growing importance