Introduction to Data Science: Data Analysis and Prediction Algorithms with R: Chapman & Hall/CRC Data Science Series
Autor Rafael A. Irizarryen Limba Engleză Hardback – 8 noi 2019
This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful. The book is divided into six parts: R, data visualization, statistics with R, data wrangling, machine learning, and productivity tools. Each part has several chapters meant to be presented as one lecture.
The author uses motivating case studies that realistically mimic a data scientist’s experience. He starts by asking specific questions and answers these through data analysis so concepts are learned as a means to answering the questions. Examples of the case studies included are: US murder rates by state, self-reported student heights, trends in world health and economics, the impact of vaccines on infectious disease rates, the financial crisis of 2007-2008, election forecasting, building a baseball team, image processing of hand-written digits, and movie recommendation systems.
The statistical concepts used to answer the case study questions are only briefly introduced, so complementing with a probability and statistics textbook is highly recommended for in-depth understanding of these concepts. If you read and understand the chapters and complete the exercises, you will be prepared to learn the more advanced concepts and skills needed to become an expert.
A complete solutions manual is available to registered instructors who require the text for a course.
Din seria Chapman & Hall/CRC Data Science Series
- 9% Preț: 594.74 lei
- Preț: 387.02 lei
- 20% Preț: 392.81 lei
- 20% Preț: 283.90 lei
- 20% Preț: 289.32 lei
- Preț: 326.56 lei
- 20% Preț: 285.16 lei
- 8% Preț: 421.32 lei
- Preț: 389.66 lei
- 8% Preț: 379.33 lei
- 20% Preț: 520.89 lei
- 20% Preț: 305.38 lei
- 8% Preț: 416.75 lei
- 20% Preț: 548.30 lei
- 20% Preț: 246.02 lei
- Preț: 364.40 lei
- 8% Preț: 428.70 lei
- 20% Preț: 430.05 lei
- 20% Preț: 388.75 lei
- 8% Preț: 548.14 lei
- 8% Preț: 455.98 lei
- 20% Preț: 332.19 lei
- 20% Preț: 324.37 lei
- 9% Preț: 771.25 lei
- 20% Preț: 368.92 lei
- 8% Preț: 418.23 lei
- 20% Preț: 390.16 lei
- 20% Preț: 432.71 lei
- 20% Preț: 370.91 lei
- 20% Preț: 368.88 lei
- 20% Preț: 456.44 lei
- 20% Preț: 390.27 lei
- 8% Preț: 411.42 lei
- 30% Preț: 444.87 lei
- 8% Preț: 401.38 lei
- 20% Preț: 365.10 lei
- 9% Preț: 572.78 lei
- 31% Preț: 651.51 lei
- 20% Preț: 989.00 lei
Preț: 506.99 lei
Preț vechi: 663.89 lei
-23% Nou
Puncte Express: 760
Preț estimativ în valută:
97.03€ • 100.91$ • 81.20£
97.03€ • 100.91$ • 81.20£
Cartea nu se mai tipărește
Doresc să fiu notificat când acest titlu va fi disponibil:
Se trimite...
Preluare comenzi: 021 569.72.76
Specificații
ISBN-13: 9780367357986
ISBN-10: 0367357984
Pagini: 743
Dimensiuni: 178 x 254 x 39 mm
Greutate: 1.72 kg
Ediția:1
Editura: CRC Press
Colecția Chapman and Hall/CRC
Seria Chapman & Hall/CRC Data Science Series
Locul publicării:Boca Raton, United States
ISBN-10: 0367357984
Pagini: 743
Dimensiuni: 178 x 254 x 39 mm
Greutate: 1.72 kg
Ediția:1
Editura: CRC Press
Colecția Chapman and Hall/CRC
Seria Chapman & Hall/CRC Data Science Series
Locul publicării:Boca Raton, United States
Cuprins
I R. 1 Installing R and RStudio. 2. Getting Started with R and RStudio. 3. R Basics. 4. Programming basics. 5. The tidyverse. 6. Importing data. II Data Visualization. 7. Introduction to data visualization. 8. ggplot2. 9. Visualizing data distributions. 10. Data visualization in practice. 11. Data visualization principles. 12. Robust summaries. III Statistics with R. 13. Introduction to Statistics with R. 14. Probability. 15. Random variables. 16. Statistical Inference. 17. Statistical models. 18. Regression. 19. Linear Models. 20. Association is not causation. IV Data Wrangling. 21. Introduction to Data Wrangling. 22. Reshaping data. 23. Joining tables. 24. Web Scraping. 25. String Processing. 26. Parsing Dates and Times. 27. Text mining. V Machine Learning. 28. Introduction to Machine Learning. 29. Smoothing. 30. Cross validation. 31. The caret package. 32. Examples of algorithms. 33. Machine learning in practice. 34. Large datasets. 35. Clustering. VI Productivity tools. 36. Introduction to productivity tools. 37. Accessing the terminal and installing Git. 38. Organizing with Unix. 39. Git and GitHub. 40. Reproducible projects with RStudio and R markdown.
Recenzii
"I think the book would be perfect for schools looking to make a transition to a model where introduction to data science takes the place of introduction to statistics and maybe introductory computer science." ~Arend Kuyper, Northwestern University
"A great introduction to data science and modern R programing, with tons of examples of application of the R abilities throughout the whole volume. The book suggests multiple links to the internet websites related to the topics under consideration that makes it an incredibly useful source of contemporary data science and programing, helping to students and researchers in their projects." ~Technometrics
"Introduction to Data Science will teach you to juggle with your data and get maximum results from it using R. I highly recommended this book for students and everybody taking the first steps in data science using R." ~ Maria Ivanchuk, ISCB News
"A great introduction to data science and modern R programing, with tons of examples of application of the R abilities throughout the whole volume. The book suggests multiple links to the internet websites related to the topics under consideration that makes it an incredibly useful source of contemporary data science and programing, helping to students and researchers in their projects." ~Technometrics
"Introduction to Data Science will teach you to juggle with your data and get maximum results from it using R. I highly recommended this book for students and everybody taking the first steps in data science using R." ~ Maria Ivanchuk, ISCB News
Notă biografică
Rafael A. Irizarry is professor of data sciences at the Dana-Farber Cancer Institute, professor of biostatistics at Harvard, and a fellow of the American Statistical Association. Dr. Irizarry is an applied statistician and during the last 20 years has worked in diverse areas, including genomics, sound engineering, and public health. He disseminates solutions to data analysis challenges as open source software, tools that are widely downloaded and used. Prof. Irizarry has also developed and taught several data science courses at Harvard as well as popular online courses.
Descriere
The book begins by going over the basics of R and the tidyverse. You learn R throughout the book, but in the first part we go over the building blocks needed to keep learning during the rest of the book.