Regression Models for Data Science in R
Regression Models for Data Science in R
A companion book for the Coursera Regression Models class
About the Book
The ideal reader for this book will be quantitatively literate and has a basic understanding of statistical concepts and R programming. The student should have a basic understanding of statistical inference such as contained in https://leanpub.com/LittleInferenceBook/. The book gives a rigorous treatment of the elementary concepts of regression models from a practical perspective. After reading the book and watching the associated videos, students will be able to perform multivariable regression models and understand their interpretations.
Packages
The Book
This is just the boook.
PDF
EPUB
MOBI
WEB
English
The Book+Videos+Code
This is the book, plus the videos, plus the video solutions. All of the videos are available on YouTube as well. The book plus lecture note github repos are included as well.
Includes:
Video lectures
These are the video lectures associated with the book. They are also available on YouTube and Coursera.
Lecture notes and code
This is the github repo zipped up as one entity. You can get this off of github if you'd like. It also includes the book repo.
PDF
EPUB
MOBI
WEB
English
The Book+Code+Lecture Videos+Solution Videos
This is the book, the github repos (lecture notes and book) plus the video lectures plus the video HW solutions. All are available elsewhere for free (github and YouTube).
Includes:
Video lectures
These are the video lectures associated with the book. They are also available on YouTube and Coursera.
Lecture notes and code
This is the github repo zipped up as one entity. You can get this off of github if you'd like. It also includes the book repo.
Video HW solutions.
This is the video homework solutions. These are also all available on YouTube.
PDF
EPUB
MOBI
WEB
English
Table of Contents
-
Preface
- About this book
- About the cover
-
Introduction
- Before beginning
- Regression models
- Motivating examples
- Summary notes: questions for this book
- Exploratory analysis of Galton’s Data
- The math (not required)
- Comparing children’s heights and their parent’s heights
- Regression through the origin
- Exercises
-
Notation
- Some basic definitions
- Notation for data
- The empirical mean
- The empirical standard deviation and variance
- Normalization
- The empirical covariance
- Some facts about correlation
- Exercises
-
Ordinary least squares
- General least squares for linear equations
- Revisiting Galton’s data
- Showing the OLS result
- Exercises
-
Regression to the mean
- A historically famous idea, regression to the mean
- Regression to the mean
- Exercises
-
Statistical linear regression models
- Basic regression model with additive Gaussian errors.
- Interpreting regression coefficients, the intercept
- Interpreting regression coefficients, the slope
- Using regression for prediction
- Example
- Exercises
-
Residuals
- Residual variation
- Properties of the residuals
- Example
- Estimating residual variation
- Summarizing variation
- R squared
- Exercises
-
Regression inference
- Reminder of the model
- Review
- Results for the regression parameters
- Example diamond data set
- Getting a confidence interval
- Prediction of outcomes
- Summary notes
- Exercises
-
Multivariable regression analysis
- The linear model
- Estimation
- Example with two variables, simple linear regression
- The general case
- Simulation demonstrations
- Interpretation of the coefficients
- Fitted values, residuals and residual variation
- Summary notes on linear models
- Exercises
-
Multivariable examples and tricks
- Data set for discussion
- Simulation study
- Back to this data set
- What if we include a completely unnecessary variable?
- Dummy variables are smart
- More than two levels
- Insect Sprays
-
Further analysis of the
swiss
dataset - Exercises
-
Adjustment
- Experiment 1
- Experiment 2
- Experiment 3
- Experiment 4
- Experiment 5
- Some final thoughts
- Exercises
-
Residuals, variation, diagnostics
- Residuals
- Influential, high leverage and outlying points
- Residuals, Leverage and Influence measures
- Simulation examples
- Example described by Stefanski
- Back to the Swiss data
- Exercises
-
Multiple variables and model selection
- Multivariable regression
- The Rumsfeldian triplet
- General rules
- R squared goes up as you put regressors in the model
- Simulation demonstrating variance inflation
- Summary of variance inflation
- Swiss data revisited
- Impact of over- and under-fitting on residual variance estimation
- Covariate model selection
- How to do nested model testing in R
- Exercises
-
Generalized Linear Models
- Example, linear models
- Example, logistic regression
- Example, Poisson regression
- How estimates are obtained
- Odds and ends
- Exercises
-
Binary GLMs
- Example Baltimore Ravens win/loss
- Odds
- Modeling the odds
- Interpreting Logistic Regression
- Visualizing fitting logistic regression curves
- Ravens logistic regression
- Some summarizing comments
- Exercises
-
Count data
- Poisson distribution
- Poisson distribution
- Linear regression
- Poisson regression
- Mean-variance relationship
- Rates
- Exercises
-
Bonus material
- How to fit functions using linear models
- Notes
- Harmonics using linear models
- Thanks!
- Notes
The Leanpub 60-day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Do Well. Do Good.
Authors have earned$11,577,045writing, publishing and selling on Leanpub, earning 80% royalties while saving up to 25 million pounds of CO2 and up to 46,000 trees.
Learn more about writing on Leanpub
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them
Top Books
Recipes for Decoupling
Matthias NobackSignalR on .NET 6 - the Complete Guide
Fiodar SazanavetsLearn everything there is to learn about SignalR and how to integrate it with the latest .NET 6 and C# 10 features. Learn how to connect any type of client to SignalR, including plain WebSocket client. Learn how to build interactive applications that can communicate with each other in real time without making excessive calls.
The BDD Books - Discovery (Japanese Edition)
Gáspár Nagy, Seb Rose, and Yuya Kazamaウクライナ難民を支援 - 2022年5月末まで延長!
この本の売り上げの50%は、 https://unicef.hu/veszhelyzet-ukrajnaban と https://int.depaulcharity.org/fundraising-for-depaul-ukraine/ に寄付されます。
本書籍は、振る舞い駆動開発(Behavior Driven Development, BDD)や受け入れテスト駆動開発(Acceptance Test-Driven Development, ATDD)の発見フェーズを最大限に活用する方法を提供します。
The easiest way to learn design patterns
Fiodar SazanavetsLearn design patterns in the easiest way possible. You will no longer have to brute-force your way through each one of them while trying to figure out how it works. The book provides a unique methodology that will make your understanding of design patterns stick. It can also be used as a reference book where you can find design patterns in seconds.
Agile Testing Condensed Japanese Edition
Yuya Kazama, Janet Gregory, and Lisa CrispinJanet GregoryとLisa Crispinによる2019年9月発行の書籍『Agile Testing Condensed』の日本語翻訳版です。アジャイルにおいてどのような考えでテストを行うべきなのか簡潔に書かれています!
OpenIntro Statistics
David Diez, Christopher Barr, Mine Cetinkaya-Rundel, and OpenIntroA complete foundation for Statistics, also serving as a foundation for Data Science.
Leanpub revenue supports OpenIntro (US-based nonprofit) so we can provide free desk copies to teachers interested in using OpenIntro Statistics in the classroom and expand the project to support free textbooks in other subjects.
More resources: openintro.org.
Tech Giants in Healthcare
Dr. Bertalan MeskoThis comprehensive guide, Tech Giants in Healthcare, clarifies how and why big tech companies step into healthcare, and breaks it down from one market player to the other in what direction they are going, what tools they are using and what horizons they have in front of them.
Functional event-driven architecture: Powered by Scala 3
Gabriel VolpeExplore the event-driven architecture (EDA) in a purely functional way, mainly powered by Fs2 streams in Scala 3!
Leverage your functional programming skills by designing and writing stateless microservices that scale, powered by stateful message brokers.
CCIE Service Provider Version 4 Written and Lab Exam Comprehensive Guide
Nicholas RussoThe service provider landscape has changed rapidly over the past several years. Networking vendors are continuing to propose new standards, techniques, and procedures for overcoming new challenges while concurrently reducing costs and delivering new services. Cisco has recently updated the CCIE Service Provider track to reflect these changes; this book represents the author's personal journey in achieving that certification.
Ansible for DevOps
Jeff GeerlingAnsible is a simple, but powerful, server and configuration management tool. Learn to use Ansible effectively, whether you manage one server—or thousands.
Top Bundles
- #1
All the Books of The Medical Futurist
6 Books
We put together the most popular books from The Medical Futurist to provide a clear picture about the major trends shaping the future of medicine and healthcare. Digital health technologies, artificial intelligence, the future of 20 medical specialties, big pharma, data privacy, digital health investments and how technology giants such as Amazon... - #2
Practical FP in Scala + Functional event-driven architecture
2 Books
Practical FP in Scala (A hands-on approach) & Functional event-driven architecture, aka FEDA, (Powered by Scala 3), together as a bundle! The content of PFP in Scala is a requirement to understand FEDA so why not take advantage of this bundle!? - #3
Software Architecture for Developers: Volumes 1 & 2 - Technical leadership and communication
2 Books
"Software Architecture for Developers" is a practical and pragmatic guide to modern, lightweight software architecture, specifically aimed at developers. You'll learn:The essence of software architecture.Why the software architecture role should include coding, coaching and collaboration.The things that you really need to think about before... - #4
CCIE Service Provider Ultimate Study Bundle
2 Books
Piotr Jablonski, Lukasz Bromirski, and Nick Russo have joined forces to deliver the only CCIE Service Provider training resource you'll ever need. This bundle contains a detailed and challenging collection of workbook labs, plus an extensively detailed technical reference guide. All of us have earned the CCIE Service Provider certification... - #6
Pattern-Oriented Memory Forensics and Malware Detection
2 Books
This training bundle for security engineers and researchers, malware and memory forensics analysts includes two accelerated training courses for Windows memory dump analysis using WinDbg. It is also useful for technical support and escalation engineers who analyze memory dumps from complex software environments and need to check for possible... - #8
Modern C++ Collection
3 Books
Get All about Modern C++C++ Standard Library, including C++20Concurrency with Modern C++, including C++20C++20Each book has about 200 complete code examples. Updates are included. When I update one of the books, you immediately get the updated bundle. You can expect significant updates to each new C++ standard (C++23, C++26, .. ) and also...