Book description
R is the most powerful tool you can use for statistical analysis. This definitive guide smooths R’s steep learning curve with practical solutions and real-world applications for commercial environments.In R in Action, Third Edition you will learn how to:
- Set up and install R and RStudio
- Clean, manage, and analyze data with R
- Use the ggplot2 package for graphs and visualizations
- Solve data management problems using R functions
- Fit and interpret regression models
- Test hypotheses and estimate confidence
- Simplify complex multivariate data with principal components and exploratory factor analysis
- Make predictions using time series forecasting
- Create dynamic reports and stunning visualizations
- Techniques for debugging programs and creating packages
R in Action, Third Edition makes learning R quick and easy. That’s why thousands of data scientists have chosen this guide to help them master the powerful language. Far from being a dry academic tome, every example you’ll encounter in this book is relevant to scientific and business developers, and helps you solve common data challenges. R expert Rob Kabacoff takes you on a crash course in statistics, from dealing with messy and incomplete data to creating stunning visualizations. This revised and expanded third edition contains fresh coverage of the new tidyverse approach to data analysis and R’s state-of-the-art graphing capabilities with the ggplot2 package.
About the Technology
Used daily by data scientists, researchers, and quants of all types, R is the gold standard for statistical data analysis. This free and open source language includes packages for everything from advanced data visualization to deep learning. Instantly comfortable for mathematically minded users, R easily handles practical problems without forcing you to think like a software engineer.
About the Book
R in Action, Third Edition teaches you how to do statistical analysis and data visualization using R and its popular tidyverse packages. In it, you’ll investigate real-world data challenges, including forecasting, data mining, and dynamic report writing. This revised third edition adds new coverage for graphing with ggplot2, along with examples for machine learning topics like clustering, classification, and time series analysis.
What's Inside
- Clean, manage, and analyze data
- Use the ggplot2 package for graphs and visualizations
- Techniques for debugging programs and creating packages
- A complete learning resource for R and tidyverse
About the Reader
Requires basic math and statistics. No prior experience with R needed.
About the Author
Dr. Robert I Kabacoff is a professor of quantitative analytics at Wesleyan University and a seasoned data scientist with more than 20 years of experience.
Quotes
Kabacoff has outdone himself by significantly improving on the already excellent previous edition.
- Alain Lompo, ISO-Gruppe
R in Action has been my go-to reference on R for years. The third edition contains timely updates on the tidyverse and other new tools. I would recommend this book without hesitation.
- Daniel Kenney-Jung MD, Department of Pediatrics, Duke University
Outstandingly well-written. The best book on R programming that I have ever read.
- Kelvin Meeks, International Technology Ventures
Takes the reader through a series of essential methods from basic to complex. The only R book you will ever need.
- Martin Perry, Microsoft
Table of contents
- R in Action
- Copyright
- Praise for the previous edition of R in Action
- brief contents
- contents
- Front matter
- Part 1. Getting started
- 1 Introduction to R
-
2 Creating a dataset
- 2.1 Understanding datasets
- 2.2 Data structures
-
2.3 Data input
- 2.3.1 Entering data from the keyboard
- 2.3.2 Importing data from a delimited text file
- 2.3.3 Importing data from Excel
- 2.3.4 Importing data from JSON
- 2.3.5 Importing data from the web
- 2.3.6 Importing data from SPSS
- 2.3.7 Importing data from SAS
- 2.3.8 Importing data from Stata
- 2.3.9 Accessing database management systems
- 2.3.10 Importing data via Stat/Transfer
- 2.4 Annotating datasets
- 2.5 Useful functions for working with data objects
- Summary
-
3 Basic data management
- 3.1 A working example
- 3.2 Creating new variables
- 3.3 Recoding variables
- 3.4 Renaming variables
- 3.5 Missing values
- 3.6 Date values
- 3.7 Type conversions
- 3.8 Sorting data
- 3.9 Merging datasets
- 3.10 Subsetting datasets
- 3.11 Using dplyr to manipulate data frames
- 3.12 Using SQL statements to manipulate data frames
- Summary
- 4 Getting started with graphs
- 5 Advanced data management
- Part 2. Basic methods
- 6 Basic graphs
- 7 Basic statistics
- Part 3. Intermediate methods
- 8 Regression
- 9 Analysis of variance
- 10 Power analysis
- 11 Intermediate graphs
- 12 Resampling statistics and bootstrapping
- Part 4. Advanced methods
- 13 Generalized linear models
- 14 Principal components and factor analysis
- 15 Time series
- 16 Cluster analysis
- 17 Classification
-
18 Advanced methods for missing data
- 18.1 Steps in dealing with missing data
- 18.2 Identifying missing values
- 18.3 Exploring missing-values patterns
- 18.4 Understanding the sources and impact of missing data
- 18.5 Rational approaches for dealing with incomplete data
- 18.6 Deleting missing data
- 18.7 Single imputation
- 18.8 Multiple imputation
- 18.9 Other approaches to missing data
- Summary
- Part 5. Expanding your skills
- 19 Advanced graphs
- 20 Advanced programming
- 21 Creating dynamic reports
-
22 Creating a package
- 22.1 The edatools package
-
22.2 Creating a package
- 22.2.1 Installing development tools
- 22.2.2 Creating a package project
- 22.2.3 Writing the package functions
- 22.2.4 Adding function documentation
- 22.2.5 Adding a general help file (optional)
- 22.2.6 Adding sample data to the package (optional)
- 22.2.7 Adding a vignette (optional)
- 22.2.8 Editing the DESCRIPTION file
- 22.2.9 Building and installing the package
- 22.3 Sharing your package
- 22.4 Going further
- Summary
- Afterword. Into the rabbit hole
- Appendix A. Graphical user interfaces
- Appendix B. Customizing the startup environment
- Appendix C. Exporting data from R
- Appendix D. Matrix algebra in R
- Appendix E. Packages used in this book
- Appendix F. Working with large datasets
- Appendix G. Updating an R installation
- References
- index
Product information
- Title: R in Action, Third Edition
- Author(s):
- Release date: May 2022
- Publisher(s): Manning Publications
- ISBN: 9781617296055
You might also like
book
R in Action, Second Edition
R in Action, Second Edition presents both the R language and the examples that make it …
video
R Programming
15+ Hours of Video Instruction R Programming LiveLessons, 2nd Edition, is a tour through the most …
book
Hands-On Programming with R
Learn how to program by diving into the R language, and then use your newfound skills …
book
Programming Skills for Data Science: Start Writing Code to Wrangle, Analyze, and Visualize Data with R, First Edition
The Foundational Hands-On Skills You Need to Dive into Data Science “Freeman and Ross have created …