Leanpub Header

Skip to main content

Filters

Category: "R"

Books

  1. This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.

  2. This book teaches you to use R to effectively visualize and explore complex datasets. Exploratory data analysis is a key part of the data science process because it allows you to sharpen your question and refine your modeling strategies. This book is based on the industry-leading Johns Hopkins Data Science Specialization.

  3. Applied Data Science for Credit Risk
    A Practical Guide in R and Python
    Andrija Djurovic

    This book provides a practical guide to critical data science methods, focusing on their application in credit risk management. Using examples in R and Python, it presents step-by-step processes for applying various analytical techniques while highlighting the importance of aligning methods with the specific characteristics of the data. Designed for practitioners and those with foundational data science and banking knowledge, the book bridges theory and practice with real-world examples.

  4. This book teaches the fundamental concepts and tools behind reporting modern data analyses in a reproducible manner. As data analyses become increasingly complex, the need for clear and reproducible report writing is greater than ever. The material for this book was developed as part of the industry-leading Johns Hopkins Data Science Specialization. Printed versions are available through Lulu (see link below).

  5. Data Analysis for the Life Sciences
    Rafael A Irizarry and Michael I Love

    Data analysis is now part of practically every research project in the life sciences. In this book we use data and computer code to teach the necessary statistical concepts and programming skills to become a data analyst. Instead of showing theory first and then applying it to toy examples, we start with actual applications and describe the theory as it becomes necessary to solve specific challenges. The book includes links to computer code that readers can use to follow along as they program.

  6. Methods in Biostatistics with R
    A Rigorous and Practical Treatment of Biostatistics Foundations using R
    Brian Caffo, John Muschelli, and Ciprian Crainiceanu

    The book provides a modern look at introductory Biostatistical concepts and the associated computational tools using the latest developments in computation and visualization in the R language environment. The book includes practical data analysis based on datasets that can be downloaded here: https://github.com/muschellij2/biostatmethods.

  7. Mastering Software Development in R
    Roger D. Peng, Sean Kross, and Brooke Anderson

    This book covers R software development for building data science tools. This book provides rigorous training in the R language and covers modern software development practices for building tools that are highly reusable, modular, and suitable for use in a team-based environment or a community of developers. (Printed copies coming soon!)

  8. Tidyverse Skills for Data Science in R
    Roger D. Peng, Carrie Wright, Stephanie Hicks, and Shannon Ellis

    Develop insights from data with tidy tools. Import, wrangle, visualize, and model data with the Tidyverse R packages.

  9. Credit Risk Modeling Working Notes
    A Collection of Presentations, Experiments, and Technical Papers
    Andrija Djurovic

    The Working Notes complement Applied Data Science for Credit Risk and Probability of Default Rating Modeling with R, offering practice-oriented insights. Based on the author’s GitHub repository, they address real-world challenges and are regularly updated to reflect ongoing developments.

  10. Regression Models for Data Science in R
    A companion book for the Coursera Regression Models class
    Brian Caffo

    This book gives a brief, but rigorous, treatment of regression models intended for practicing Data Scientists.

  11. A rigorous treatment of linear models for self learning data scientists. This book is only available in pdf form.

  12. Developing Data Products in R
    Brian Caffo and Sean Kross

    This book introduces the topic of Developing Data Products in R. A data product is the ideal output of a Data Science experiment. This book is based on the Coursera Class "Developing Data Products" as part of the Data Science Specialization. Particular emphasis is paid to developing Shiny apps and interactive graphics.

  13. The Ultimate Cheat Sheet for Longitudinal Data Analysis in R
    Learn key concepts, commands and analyses for longitudinal data analysis
    Alexandru Cernat

    Longitudinal data are powerful but complex, requiring new concepts, data structures, and models that can feel overwhelming to learn. This cheat sheet brings together the key ideas, R commands, and modelling approaches into a single workflow, helping you understand how everything fits together and providing the building blocks for mastering longitudinal data analysis.

  14. Predicting Persistence: Population Models for Conservation
    With examples and exercises in R
    Jacob Koella

    How long will a population persist, and why do some populations go extinct while others survive? 'Predicting Persistence' will help you to get a grasp of the fundamental aspects of the theory underlying these questions, and it will show you how to simulate them with the programming language R.

  15. Data Science with R
    Data Analysis and Prediction Algorithms with R - Third Edition
    Daniel Paredes

    Discover how to become a Data Scientist at your own pace with updated content and real-world examples. From data analysis to prediction algorithms with machine learning. This Third Edition includes new chapters on Generative AI, Ethics, and modern Machine Learning workflows.