A complete foundation for Statistics, also serving as a foundation for Data Science. Leanpub revenue supports OpenIntro (US-based nonprofit) so we can provide free desk copies to teachers interested in using OpenIntro Statistics in the classroom and expand the project to support free textbooks in other subjects. More resources: openintro.org.
Bridge AI and science with this hands-on guide. Whether you're a researcher learning ML or an engineer entering scientific applications, build real systems across chemistry, biology, physics & climate. Master Transformers, Diffusion Models & GNNs for scientific discovery. 500+ pages, 50+ Colab notebooks. Design molecules, predict proteins, accelerate climate models—all hands-on, zero setup required.
The book is also available in paperback for $25. Paperback royalties go to OpenIntro (US-based nonprofit), and the optional Leanpub PDF contributions go to authors to fund their time on this book.
Master language models through mathematics, illustrations, and code―and build your own from scratch!
Everything you really need to know in Machine Learning in a hundred pages.
Introduction to Statistics for the Life and Biomedical Sciences is the 4th official OpenIntro book and has been written to be used in conjunction with a set of self-paced learning labs. These labs guide students through learning how to apply statistical ideas and concepts discussed in the text with the R computing language.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts from probability, statistical inference, linear regression and machine learning and R programming skills. Throughout the book we demonstrate how these can help you tackle real-world data analysis challenges.
The essentials of making predictions using supervised regression and classification for tabular data. Tech stack: python, pandas, scikit-learn, CatBoost, LightGBM, XGBoost
This book teaches you how to make machine learning models more interpretable.
This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.
Data analysis is now part of practically every research project in the life sciences. In this book we use data and computer code to teach the necessary statistical concepts and programming skills to become a data analyst. Instead of showing theory first and then applying it to toy examples, we start with actual applications and describe the theory as it becomes necessary to solve specific challenges. The book includes links to computer code that readers can use to follow along as they program.
A book about how to be a scientist the modern, open-source way.
The book covers all the key skills needed for preparing, exploring, and analysing longitudinal data. To facilitate understanding and help readers learn these skills, it interweaves statistical modelling with computer code and visualizations. It does this using real-world data, code, and outputs that readers can replicate.
This book teaches you how to quantify the uncertainty of machine learning models with conformal prediction in Python.
D3 Start to Finish shows you how to build a custom, interactive and beautiful data visualisation using the JavaScript library D3.js (versions 6 & 7). The book covers D3.js concepts such as selections, joins, requests, scale functions, events & transitions. You'll put these concepts into practice by building a custom, interactive data visualisation.