Introduction to Data Science
Introduction to Data Science
Data Analysis and Prediction Algorithms with R
About the Book
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters.
Translations
Table of Contents
Part I R
1 Getting Started with R and RStudio
2 R Basics
3 Programming basics
4 The tidyverse
5 Importing data
Part II Data Visualization
6 Introduction to data visualization
7 ggplot2
8 Visualizing data distributions
9 Data visualization in practice
10 Data visualization principles
11 Robust summaries
Part III Statistics with R
12 Introduction to Statistics with R
13 Probability
14 Random variables
15 Statistical Inference
16 Statistical models
17 Regression
18 Linear Models
19 Association is not causation
Part IV Data Wrangling
20 Introduction to Data Wrangling
21 Reshaping data
22 Joining tables
23 Web Scraping
24 String Processing
25 Parsing Dates and Times
26 Text mining
Part V Machine Learning
27 Introduction to Machine Learning
28 Smoothing
29 Cross validation
30 The caret package
31 Examples of algorithms
32 Machine learning in practice
33 Large datasets
34 Clustering
Part VI Productivity tools
35 Introduction to productivity tools
36 Organizing with Unix
37 Git and GitHub
38 Reproducible projects with RStudio and R markdown
The Leanpub 60 Day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.
You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!
So, there's no reason not to click the Add to Cart button, is there?
See full terms...
Earn $8 on a $10 Purchase, and $16 on a $20 Purchase
We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earnedover $13 millionwriting, publishing and selling on Leanpub.
Learn more about writing on Leanpub
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them