A complete foundation for Statistics, also serving as a foundation for Data Science. Leanpub revenue supports OpenIntro (US-based nonprofit) so we can provide free desk copies to teachers interested in using OpenIntro Statistics in the classroom and expand the project to support free textbooks in other subjects. More resources: openintro.org.
Master language models through mathematics, illustrations, and code―and build your own from scratch!
Everything you really need to know in Machine Learning in a hundred pages.
It's never been easier to build an AI agent—and never been harder to make one that actually works. This book takes you from language model foundations to production-ready multi-agent systems, with the depth to understand what you're building and why it fails.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts from probability, statistical inference, linear regression and machine learning and R programming skills. Throughout the book we demonstrate how these can help you tackle real-world data analysis challenges.
AI engines are booming, and the more we work with agentic systems, the more we see that we need something to make them work at the enterprise level. We're quite active in exploring ideas around context graphs, decision traces, and supporting explainability—giving agents the ability to make more aware and company-aligned decisions.But this makes sense not only for enterprises, but for users and individuals building personal agents as well. Unfortunately, we have zero-to-none inclination on how to actually build a context graph.I'll try to explain how to build something like a context graph—but go beyond it. I deeply believe that to make this work, we need specific agentic memory and a set of cognitive processes that truly help agents use this memory and learn from experience and data.That's why this is the Book: Beyond Context Graphs—with a focus on real-life enterprise tasks and how to make agents make better decisions and, let's say, hallucinate less.
Mastering Modern Time Series Forecasting is your all-in-one guide to building real-world forecasting systems that work — from classical stats to deep learning and beyond. Whether you're modeling retail demand or energy loads, this book gives you the tools, intuition, and code to go from zero to production. You'll cover ARIMA, ML, deep nets, transformers, and even the rise of FTSMs (Foundational Time Series Models). Written by a practitioner who’s built forecasting solutions for multibillion-dollar businesses, this is the hands-on, honest guide every data scientist, analyst, or forecaster needs.
Most people think they are bad at math. In reality, they were never taught arithmetic properly.This book is a modern English edition of Arithmetic by Alexander P. Kiselev—the text that formed the backbone of mathematical education in Russia and USSR for over a century and helped produce generations of exceptionally strong mathematicians, scientists, and engineers.Unlike modern textbooks that prioritise shortcuts, visuals, and lowered expectations, Kiselev builds arithmetic logically, systematically, and rigorously. Every method is explained. Every operation has meaning. Exercises are carefully sequenced to develop real understanding—not rote pattern-following.This book does not promise “easy math”. It promises something better: clarity, confidence, and competence.Whether you are a student, a parent, a tutor, or an adult rebuilding fundamentals, this book will change how you understand arithmetic—and why so much later mathematics suddenly becomes easier.
The book is also available in paperback for $25. Paperback royalties go to OpenIntro (US-based nonprofit), and the optional Leanpub PDF contributions go to authors to fund their time on this book.
This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.
Data analysis is now part of practically every research project in the life sciences. In this book we use data and computer code to teach the necessary statistical concepts and programming skills to become a data analyst. Instead of showing theory first and then applying it to toy examples, we start with actual applications and describe the theory as it becomes necessary to solve specific challenges. The book includes links to computer code that readers can use to follow along as they program.
The essentials of making predictions using supervised regression and classification for tabular data. Tech stack: python, pandas, scikit-learn, CatBoost, LightGBM, XGBoost
Bridge AI and science with this hands-on guide. Whether you're a researcher learning ML or an engineer entering scientific applications, build real systems across chemistry, biology, physics & climate. Master Transformers, Diffusion Models & GNNs for scientific discovery. 500+ pages, 50+ Colab notebooks. Design molecules, predict proteins, accelerate climate models—all hands-on, zero setup required.
This book provides a practical guide to critical data science methods, focusing on their application in credit risk management. Using examples in R and Python, it presents step-by-step processes for applying various analytical techniques while highlighting the importance of aligning methods with the specific characteristics of the data. Designed for practitioners and those with foundational data science and banking knowledge, the book bridges theory and practice with real-world examples.
D3 Start to Finish shows you how to build a custom, interactive and beautiful data visualisation using the JavaScript library D3.js (versions 6 & 7). The book covers D3.js concepts such as selections, joins, requests, scale functions, events & transitions. You'll put these concepts into practice by building a custom, interactive data visualisation.