Cantitate/Preț
Produs

Random Forests with R: Use R!

Autor Robin Genuer, Jean-Michel Poggi
en Limba Engleză Paperback – 11 sep 2020
This book offers an application-oriented guide to random forests: a statistical learning method extensively used in many fields of application, thanks to its excellent predictive performance, but also to its flexibility, which places few restrictions on the nature of the data used. Indeed, random forests can be adapted to both supervised classification problems and regression problems. In addition, they allow us to consider qualitative and quantitative explanatory variables together, without pre-processing. Moreover, they can be used to process standard data for which the number of observations is higher than the number of variables, while also performing very well in the high dimensional case, where the number of variables is quite large in comparison to the number of observations. Consequently, they are now among the preferred methods in the toolbox of statisticians and data scientists. The book is primarily intended for students in academic fields such as statistical education, but also for practitioners in statistics and machine learning. A scientific undergraduate degree is quite sufficient to take full advantage of the concepts, methods, and tools discussed. In terms of computer science skills, little background knowledge is required, though an introduction to the R language is recommended.
Random forests are part of the family of tree-based methods; accordingly, after an introductory chapter, Chapter 2 presents CART trees. The next three chapters are devoted to random forests. They focus on their presentation (Chapter 3), on the variable importance tool (Chapter 4), and on the variable selection problem (Chapter 5), respectively. After discussing the concepts and methods, we illustrate their implementation on a running example. Then, various complements are provided before examining additional examples. Throughout the book, each result is given together with the code (in R) that can be used to reproduce it. Thus, the book offers readersessential information and concepts, together with examples and the software tools needed to analyse data using random forests. 
Citește tot Restrânge

Din seria Use R!

Preț: 41055 lei

Nou

Puncte Express: 616

Preț estimativ în valută:
7857 8106$ 6650£

Carte tipărită la comandă

Livrare economică 04-18 martie

Preluare comenzi: 021 569.72.76

Specificații

ISBN-13: 9783030564841
ISBN-10: 3030564843
Pagini: 98
Ilustrații: X, 98 p. 49 illus., 5 illus. in color. With online files/update.
Dimensiuni: 155 x 235 x 6 mm
Greutate: 0.16 kg
Ediția:1st ed. 2020
Editura: Springer International Publishing
Colecția Springer
Seria Use R!

Locul publicării:Cham, Switzerland

Cuprins

Introduction.- CART trees.- Random forests.- Variable importance.- Variable selection.- References.

Recenzii

“The level of accuracy can be hard to beat. … The worked examples at the end of each chapter are perhaps the most useful feature of this text.” (John H. Maindonald, International Statistical Review, June 2, 2021)

Notă biografică

Robin Genuer is an Assistant Professor of Statistics at the University of Bordeaux and a member of the Inserm U1219 and Inria Bordeaux Sud-Ouest research centres.
Jean-Michel Poggi is a Professor of Statistics at the University of Paris and member of the LMO, the Orsay Mathematics Laboratory (University of Paris Saclay).
They have both produced various research works on random forests and have given numerous lectures and talks on the subject. They have also taught postgraduate and doctoral courses for a variety of audiences. Lastly, they have developed the R package VSURF



Textul de pe ultima copertă

This book offers an application-oriented guide to random forests: a statistical learning method extensively used in many fields of application, thanks to its excellent predictive performance, but also to its flexibility, which places few restrictions on the nature of the data used. Indeed, random forests can be adapted to both supervised classification problems and regression problems. In addition, they allow us to consider qualitative and quantitative explanatory variables together, without pre-processing. Moreover, they can be used to process standard data for which the number of observations is higher than the number of variables, while also performing very well in the high dimensional case, where the number of variables is quite large in comparison to the number of observations. Consequently, they are now among the preferred methods in the toolbox of statisticians and data scientists. The book is primarily intended for students in academic fields such as statistical education, but also for practitioners in statistics and machine learning. A scientific undergraduate degree is quite sufficient to take full advantage of the concepts, methods, and tools discussed. In terms of computer science skills, little background knowledge is required, though an introduction to the R language is recommended.
Random forests are part of the family of tree-based methods; accordingly, after an introductory chapter, Chapter 2 presents CART trees. The next three chapters are devoted to random forests. They focus on their presentation (Chapter 3), on the variable importance tool (Chapter 4), and on the variable selection problem (Chapter 5), respectively. After discussing the concepts and methods, we illustrate their implementation on a running example. Then, various complements are provided before examining additional examples. Throughout the book, each result is given together with the code (in R) that can be used to reproduce it. Thus, the book offers readers essential information and concepts, together with examples and the software tools needed to analyse data using random forests. 

Caracteristici

Offers an application-oriented guide to CART trees and random forests Covers a range of practical issues, and provides real-life examples and R codes Particularly valuable for statisticians wishing to use random forests in applied work, or to analyse datasets, but also for scientists from other fields, as it is accessible for non-statisticians