R Programming for Data Science
R Programming for Data Science
About the Book
Data science has taken the world by storm. Every field of study and area of business has been affected as people increasingly realize the value of the incredible quantities of data being generated. But to extract value from those data, one needs to be trained in the proper data science skills. The R programming language has become the de facto programming language for data science. Its flexibility, power, sophistication, and expressiveness have made it an invaluable tool for data scientists around the world.
This book is about the fundamentals of R programming. You will get started with the basics of the language, learn how to manipulate datasets, how to write functions, and how to debug and optimize code. With the fundamentals provided in this book, you will have a solid foundation on which to build your data science toolbox.
If you are interested in a printed copy of this book, you can purchase one at Lulu.
Packages
The Book
This package contains just the book in PDF, EPUB, or MOBI formats.
English
PDF
EPUB
MOBI
WEB
The Book + Datasets + R Code Files
This package contains the book and R code files corresponding to each of the chapters in the book. The package also contains the datasets used in all of the chapters so that the code can be fully executed.
Includes:
Datasets
R Code Files
English
PDF
EPUB
MOBI
WEB
The Book + Lecture Videos (HD) + Datasets + R Code Files
This package includes the book, high definition lecture video files (720p), datasets and R code files for all chapters. The collection also contains live demonstrations of how to use various aspects of R that could not be included in the book. The videos are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International license.
Includes:
Datasets
R Code Files
Lecture Videos (HD)
English
PDF
EPUB
MOBI
WEB
Table of Contents
- 1. Stay in Touch!
- 2. Preface
-
3. History and Overview of R
- 3.1 What is R?
- 3.2 What is S?
- 3.3 The S Philosophy
- 3.4 Back to R
- 3.5 Basic Features of R
- 3.6 Free Software
- 3.7 Design of the R System
- 3.8 Limitations of R
- 3.9 R Resources
-
4. Getting Started with R
- 4.1 Installation
- 4.2 Getting started with the R interface
-
5. R Nuts and Bolts
- 5.1 Entering Input
- 5.2 Evaluation
- 5.3 R Objects
- 5.4 Numbers
- 5.5 Attributes
- 5.6 Creating Vectors
- 5.7 Mixing Objects
- 5.8 Explicit Coercion
- 5.9 Matrices
- 5.10 Lists
- 5.11 Factors
- 5.12 Missing Values
- 5.13 Data Frames
- 5.14 Names
- 5.15 Summary
-
6. Getting Data In and Out of R
- 6.1 Reading and Writing Data
-
6.2 Reading Data Files with
read.table()
- 6.3 Reading in Larger Datasets with read.table
- 6.4 Calculating Memory Requirements for R Objects
-
7. Using the
readr
Package -
8. Using Textual and Binary Formats for Storing Data
-
8.1 Using
dput()
anddump()
- 8.2 Binary Formats
-
8.1 Using
-
9. Interfaces to the Outside World
- 9.1 File Connections
- 9.2 Reading Lines of a Text File
- 9.3 Reading From a URL Connection
-
10. Subsetting R Objects
- 10.1 Subsetting a Vector
- 10.2 Subsetting a Matrix
- 10.3 Subsetting Lists
- 10.4 Subsetting Nested Elements of a List
- 10.5 Extracting Multiple Elements of a List
- 10.6 Partial Matching
- 10.7 Removing NA Values
-
11. Vectorized Operations
- 11.1 Vectorized Matrix Operations
-
12. Dates and Times
- 12.1 Dates in R
- 12.2 Times in R
- 12.3 Operations on Dates and Times
- 12.4 Summary
-
13. Managing Data Frames with the
dplyr
package- 13.1 Data Frames
-
13.2 The
dplyr
Package -
13.3
dplyr
Grammar -
13.4 Installing the
dplyr
package -
13.5
select()
-
13.6
filter()
-
13.7
arrange()
-
13.8
rename()
-
13.9
mutate()
-
13.10
group_by()
-
13.11
%>%
- 13.12 Summary
-
14. Control Structures
-
14.1
if
-else
-
14.2
for
Loops -
14.3 Nested
for
loops -
14.4
while
Loops -
14.5
repeat
Loops -
14.6
next
,break
- 14.7 Summary
-
14.1
-
15. Functions
- 15.1 Functions in R
- 15.2 Your First Function
- 15.3 Argument Matching
- 15.4 Lazy Evaluation
-
15.5 The
...
Argument -
15.6 Arguments Coming After the
...
Argument - 15.7 Summary
-
16. Scoping Rules of R
- 16.1 A Diversion on Binding Values to Symbol
- 16.2 Scoping Rules
- 16.3 Lexical Scoping: Why Does It Matter?
- 16.4 Lexical vs. Dynamic Scoping
- 16.5 Application: Optimization
- 16.6 Plotting the Likelihood
- 16.7 Summary
- 17. Coding Standards for R
-
18. Loop Functions
- 18.1 Looping on the Command Line
-
18.2
lapply()
-
18.3
sapply()
-
18.4
split()
- 18.5 Splitting a Data Frame
- 18.6 tapply
-
18.7
apply()
- 18.8 Col/Row Sums and Means
- 18.9 Other Ways to Apply
-
18.10
mapply()
- 18.11 Vectorizing a Function
- 18.12 Summary
-
19. Regular Expressions
- 19.1 Before You Begin
- 19.2 Primary R Functions
-
19.3
grep()
-
19.4
grepl()
-
19.5
regexpr()
-
19.6
sub()
andgsub()
-
19.7
regexec()
-
19.8 The
stringr
Package - 19.9 Summary
-
20. Debugging
- 20.1 Something’s Wrong!
- 20.2 Figuring Out What’s Wrong
- 20.3 Debugging Tools in R
-
20.4 Using
traceback()
-
20.5 Using
debug()
-
20.6 Using
recover()
- 20.7 Summary
-
21. Profiling R Code
-
21.1 Using
system.time()
- 21.2 Timing Longer Expressions
- 21.3 The R Profiler
-
21.4 Using
summaryRprof()
- 21.5 Summary
-
21.1 Using
-
22. Simulation
- 22.1 Generating Random Numbers
- 22.2 Setting the random number seed
- 22.3 Simulating a Linear Model
- 22.4 Random Sampling
- 22.5 Summary
-
23. Data Analysis Case Study: Changes in Fine Particle Air Pollution in the U.S.
- 23.1 Synopsis
- 23.2 Loading and Processing the Raw Data
- 23.3 Results
-
24. Parallel Computation
- 24.1 Hidden Parallelism
- 24.2 Embarrassing Parallelism
- 24.3 The Parallel Package
- 24.4 Example: Bootstrapping a Statistic
- 24.5 Building a Socket Cluster
- 24.6 Summary
- 25. Why I Indent My Code 8 Spaces
- 26. About the Author
Other books by this author
Authors have earned$10,089,287writing, publishing and selling on Leanpub, earning 80% royalties while saving up to 25 million pounds of CO2 and up to 46,000 trees.
Learn more about writing on Leanpub
The Leanpub 45-day 100% Happiness Guarantee
Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them
Top Books
C++20
Rainer GrimmC++20 is the next big C++ standard after C++11. As C++11 did it, C++20 changes the way we program modern C++. This change is, in particular, due to the big four of C++20: ranges, coroutines, concepts, and modules.
The book is almost daily updated. These incremental updates ease my interaction with the proofreaders.
A Guide to Artificial Intelligence in Healthcare
Dr. Bertalan MeskoCan we stay human in the age of A.I.? To go even further, can we grow in humanity, can we shape a more humane, more equitable and sustainable healthcare? This e-book aims to prepare healthcare and medical professionals for the era of human-machine collaboration. Read our guide to understanding, anticipating and controlling artificial intelligence.
C++ Best Practices
Jason TurnerLevel up your C++, get the tools working for you, eliminate common problems, and move on to more exciting things!
Atomic Kotlin
Bruce Eckel and Svetlana IsakovaFor both beginning and experienced programmers! From the author of the multi-award-winning Thinking in C++ and Thinking in Java together with a member of the Kotlin language team comes a book that breaks the concepts into small, easy-to-digest "atoms," along with exercises supported by hints and solutions directly inside IntelliJ IDEA!
Sockets and Pipes
Type ClassesSockets and Pipes is not an introduction to Haskell; it is an introduction to writing software in Haskell. Using a handful of everyday Haskell libraries, this book walks through reading the HTTP specification and implementing it to create a web server.
Introducing EventStorming
Alberto BrandoliniThe deepest tutorial and explanation about EventStorming, straight from the inventor.
node-opcua by example
Etienne RossignonGet the best out of node-opcua through a set of documented examples by the author himself that will allow you to create stunning OPCUA Servers or Clients.
Ansible for DevOps
Jeff GeerlingAnsible is a simple, but powerful, server and configuration management tool. Learn to use Ansible effectively, whether you manage one server—or thousands.
Functional Design and Architecture
Alexander GraninSoftware Design in Functional Programming, Design Patterns and Practices, Methodologies and Application Architectures. How to build real software in Haskell with less efforts and low risks. The first complete source of knowledge.
Composing Software
Eric ElliottAll software design is composition: the act of breaking complex problems down into smaller problems and composing those solutions. Most developers have a limited understanding of compositional techniques. It's time for that to change.
Top Bundles
- #1
Software Architecture for Developers: Volumes 1 & 2 - Technical leadership and communication
2 Books
"Software Architecture for Developers" is a practical and pragmatic guide to modern, lightweight software architecture, specifically aimed at developers. You'll learn:The essence of software architecture.Why the software architecture role should include coding, coaching and collaboration.The things that you really need to think about before... - #2
All the Books of The Medical Futurist
6 Books
We put together the most popular books from The Medical Futurist to provide a clear picture about the major trends shaping the future of medicine and healthcare. Digital health technologies, artificial intelligence, the future of 20 medical specialties, big pharma, data privacy, digital health investments and how technology giants such as Amazon... - #3
PowerShell
3 Books
Buy every PowerShell book from Adam Bertram at a 20% discount! - #4
Cisco CCNA 200-301 Complet
4 Books
Ce lot comprend les quatre volumes du guide préparation à l'examen de certification Cisco CCNA 200-301. - #5
Linux Administration Complet
4 Books
Ce lot comprend les quatre volumes du Guide Linux Administration :Linux Administration, Volume 1, Administration fondamentale : Guide pratique de préparation aux examens de certification LPIC 1, Linux Essentials, RHCSA et LFCS. Administration fondamentale. Introduction à Linux. Le Shell. Traitement du texte. Arborescence de fichiers. Sécurité... - #6
Modern C++ by Nicolai Josuttis
2 Books
- #7
Django for Beginners/APIs/Professionals
3 Books
- #8
Learn Git, Bash, and Terraform the Hard Way
3 Books
Learn Git, Bash and Terraform using the Hard Way method.These technologies are essential tools in the DevOps armoury. These books walk you through their features and subtleties in a simple, gradual way that reinforces learning rather than baffling you with theory. - #9
Software Architecture and Beautiful APIs
2 Books
There is no better way to learn how to design good APIs than to look at many existing examples, complementing the Software Architecture theory on API design. - #10
Digital Future of Healthcare and Pharma
3 Books
These three popular e-books from The Medical Futurist describes how digital health technologies will shape the future of health, healthcare, medicine and pharma with exciting infographics in a digestible format.