Mastering Modern Time Series Forecasting is your all-in-one guide to building real-world forecasting systems that work — from classical stats to deep learning and beyond. Whether you're modeling retail demand or energy loads, this book gives you the tools, intuition, and code to go from zero to production. You'll cover ARIMA, ML, deep nets, transformers, and even the rise of FTSMs (Foundational Time Series Models). Written by a practitioner who’s built forecasting solutions for multibillion-dollar businesses, this is the hands-on, honest guide every data scientist, analyst, or forecaster needs.
A complete foundation for Statistics, also serving as a foundation for Data Science. Leanpub revenue supports OpenIntro (US-based nonprofit) so we can provide free desk copies to teachers interested in using OpenIntro Statistics in the classroom and expand the project to support free textbooks in other subjects. More resources: openintro.org.
Everything you really need to know in Machine Learning in a hundred pages.
The demand for skilled data science practitioners in industry, academia, and government is rapidly growing. This book introduces concepts from probability, statistical inference, linear regression and machine learning and R programming skills. Throughout the book we demonstrate how these can help you tackle real-world data analysis challenges.
Master language models through mathematics, illustrations, and code―and build your own from scratch!
Are you a data scientist or analyst struggling to take your Jupyter Notebook prototypes to the next level? Have you encountered challenges with code organization, reproducibility, or collaboration as your data science projects grow in complexity? This book is the solution you’ve been seeking. This comprehensive guide bridges the gap between data analysis and software engineering, providing you with the essential tools and best practices to transform your data science projects into scalable, maintainable, and collaborative solutions.
This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.
The book is also available in paperback for $25. Paperback royalties go to OpenIntro (US-based nonprofit), and the optional Leanpub PDF contributions go to authors to fund their time on this book.
This book describes the process of analyzing data. The authors have extensive experience both managing data analysts and conducting their own data analyses, and this book is a distillation of their experience in a format that is applicable to both practitioners and managers in data science. Printed copies are available through Lulu.
The book covers all the key skills needed for preparing, exploring, and analysing longitudinal data. To facilitate understanding and help readers learn these skills, it interweaves statistical modelling with computer code and visualizations. It does this using real-world data, code, and outputs that readers can replicate.
Revised for PyTorch 2.x! In 2019, I published a PyTorch tutorial on Towards Data Science and I was amazed by the reaction from the readers! Their feedback motivated me to write this book to help beginners start their journey into Deep Learning and PyTorch. I hope you enjoy reading this book as much as I enjoy writing it.
This book teaches you how to make machine learning models more interpretable.
Master machine learning interpretability with this comprehensive guide to SHAP – your tool to communicating model insights and building trust in all your machine learning applications.
This book teaches you to use R to effectively visualize and explore complex datasets. Exploratory data analysis is a key part of the data science process because it allows you to sharpen your question and refine your modeling strategies. This book is based on the industry-leading Johns Hopkins Data Science Specialization.
Introduction to Statistics for the Life and Biomedical Sciences is the 4th official OpenIntro book and has been written to be used in conjunction with a set of self-paced learning labs. These labs guide students through learning how to apply statistical ideas and concepts discussed in the text with the R computing language.