Introduction to Data Science
Free!
Minimum price
$49.99
Suggested price

Introduction to Data Science

Data Analysis and Prediction Algorithms with R

About the Book

The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression and machine learning. It also helps you develop skills such as R programming, data wrangling with dplyr, data visualization with ggplot2, algorithm building with caret, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation with knitr and R markdown. The book is divided into six parts: R, Data Visualization, Data Wrangling, Probability, Inference and Regression with R, Machine Learning, and Productivity Tools. Each part has several chapters meant to be presented as one lecture. The book includes dozens of exercises distributed across most chapters. 

Translations

About the Author

Rafael A Irizarry
Rafael A Irizarry

Rafael Irizarry is a Professor of Biostatistics and Computational Biology at the Dana Farber Cancer Institute and Biostatistics at the Harvard T.H. Chan School of Public Health . For the past 17 years, Dr. Irizarry’s research has focused on the analysis of genomics data. 

Table of Contents

Part I R

1 Getting Started with R and RStudio 

2 R Basics 

3 Programming basics 

4 The tidyverse

5 Importing data 

Part II Data Visualization 

6 Introduction to data visualization

7 ggplot2 

8 Visualizing data distributions 

9 Data visualization in practice 

10 Data visualization principles 

11 Robust summaries 

Part III Statistics with R 

12 Introduction to Statistics with R 

13 Probability 

14 Random variables 

15 Statistical Inference 

16 Statistical models 

17 Regression

18 Linear Models

19 Association is not causation

Part IV Data Wrangling

20 Introduction to Data Wrangling

21 Reshaping data

22 Joining tables

23 Web Scraping

24 String Processing

25 Parsing Dates and Times

26 Text mining

Part V Machine Learning

27 Introduction to Machine Learning

28 Smoothing

29 Cross validation

30 The caret package

31 Examples of algorithms

32 Machine learning in practice

33 Large datasets

34 Clustering

Part VI Productivity tools 

35 Introduction to productivity tools

36 Organizing with Unix

37 Git and GitHub

38 Reproducible projects with RStudio and R markdown

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

80% Royalties. Earn $16 on a $20 book.

We pay 80% royalties. That's not a typo: you earn $16 on a $20 sale. If we sell 5000 non-refunded copies of your book or course for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earnedover $13 millionwriting, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub