Kick off your book project in 3 hours! Live workshop on Zoom. You’ll leave with a real book project, progress on your first chapter, and a clear plan to keep going. Saturday, May 16, 2026. Learn more…

Leanpub Header

Skip to main content

Filters

Category: "Data Science"

Books

  1. OpenIntro Statistics
    Includes 1st, 2nd, 3rd, and 4th Editions
    OpenIntro, Christopher Barr, Mine Cetinkaya-Rundel, and David Diez

    A complete foundation for Statistics, also serving as a foundation for Data Science. Leanpub revenue supports OpenIntro (US-based nonprofit) so we can provide free desk copies to teachers interested in using OpenIntro Statistics in the classroom and expand the project to support free textbooks in other subjects. More resources: openintro.org.

  2. The Orange Book of Machine Learning - Green edition
    The essentials of making predictions using supervised regression and classification for tabular data.
    Carl McBride Ellis

    The essentials of making predictions using supervised regression and classification for tabular data. Tech stack: python, pandas, scikit-learn, CatBoost, LightGBM, XGBoost, TabPFN, TabICL

  3. Everything you really need to know in Machine Learning in a hundred pages.

  4. Longitudinal Data Analysis Using R is a practical, end-to-end guide to longitudinal data analysis that helps you move from raw data to credible substantive conclusions. It tackles the real pain points researchers face in longitudinal analysis, such as data preparation, exploring change over time, model choice, missing data, and interpretation, using clear explanations, real-world data, and fully reproducible R code.

  5. Introduction to Modern Statistics
    OpenIntro, Mine Cetinkaya-Rundel, and Johanna Hardin

    The book is also available in paperback for $25. Paperback royalties go to OpenIntro (US-based nonprofit), and the optional Leanpub PDF contributions go to authors to fund their time on this book.

  6. The Agentic AI book
    From Language Models to Multi-Agent Systems
    Dr. Ryan Rad

    It's never been easier to build an AI agent—and never been harder to make one that actually works. This book takes you from language model foundations to production-ready multi-agent systems, with the depth to understand what you're building and why it fails.

  7. This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.

  8. Interpretable Machine Learning (Third Edition)
    A Guide for Making Black Box Models Explainable
    Christoph Molnar

    This book teaches you how to make machine learning models more interpretable.

  9. The Hundred-Page Language Models Book
    hands-on with PyTorch
    Andriy Burkov

    Master language models through mathematics, illustrations, and code―and build your own from scratch!

  10. My Adventures with Large Language Models
    Build foundational LLMs from Transformers to DeepSeek, from scratch, in PyTorch.
    Prathamesh S.

    Build GPT-2, Llama 3, and DeepSeek from scratch in PyTorch. Every chapter has runnable end-to-end code and loads real pretrained weights. Goes well past where most LLM tutorials stop.

  11. Introduction to Data Science
    Data Analysis and Prediction Algorithms with R
    Rafael A Irizarry

    The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts from probability, statistical inference, linear regression and machine learning and R programming skills. Throughout the book we demonstrate how these can help you tackle real-world data analysis challenges.

  12. D3 Start to Finish (2nd Edition)
    Learn how to make a custom data visualisation using D3.js.
    Peter Cook

    D3 Start to Finish shows you how to build a custom, interactive and beautiful data visualisation using the JavaScript library D3.js (versions 6 & 7). The book covers D3.js concepts such as selections, joins, requests, scale functions, events & transitions. You'll put these concepts into practice by building a custom, interactive data visualisation.

  13. The Art of Data Science
    A Guide for Anyone Who Works with Data
    Roger D. Peng and Elizabeth Matsui

    This book describes the process of analyzing data. The authors have extensive experience both managing data analysts and conducting their own data analyses, and this book is a distillation of their experience in a format that is applicable to both practitioners and managers in data science. Printed copies are available through Lulu.

  14. Interpreting Machine Learning Models With SHAP
    A Guide With Python Examples And Theory On Shapley Values
    Christoph Molnar

    Master machine learning interpretability with this comprehensive guide to SHAP – your tool to communicating model insights and building trust in all your machine learning applications.

  15. Deep Learning with PyTorch Step-by-Step
    A Beginner's Guide
    Daniel Voigt Godoy

    Revised for PyTorch 2.x! In 2019, I published a PyTorch tutorial on Towards Data Science and I was amazed by the reaction from the readers! Their feedback motivated me to write this book to help beginners start their journey into Deep Learning and PyTorch. I hope you enjoy reading this book as much as I enjoy writing it.