Statistical foundations of machine learning: the book
$12.99
Minimum price
$14.99
Suggested price

Statistical foundations of machine learning: the book

About the Book

The book (whose abridged handbook version is freely available here) is dedicated to all researchers interested in machine learning who are not content with only running lines of (deep learning) code but who are eager to learn about this discipline’s assumptions, limitations, and perspectives.

The book aims to introduce students (at Master or PhD level) with the most important theoretical and applied notions to understand how, when and why machine learning algorithms work. The book contains 16 chapters, several appendices and is accompanied by a GitHub R package.

After an introductory chapter, Chapter 2 introduces the problem of extracting information from observations from an epistemological perspective. Chapter 3 summarises the relevant background material in probability. Chapter 4 introduces graphical modelling, a flexible and interpretable way of representing large variate problems in probabilistic terms. Chapter 5 introduces the classical parametric approach to parametric estimation and hypothesis testing. Chapter 6 introduces the Bayesian parametric approach to parametric estimation, hypothesis testing and decision theory. Chapter 7 presents some nonparametric alternatives to the parametric techniques discussed in Chapter 5 and 6. Chapter 8 introduces supervised learning as the statistical problem of assessing and selecting a hypothesis function on the basis of input/output observations. Chapter 9 reviews the steps which lead from raw observations to a final model. This is a methodological chapter that introduces some algorithmic procedures underlying most of the machine learning techniques. Chapter 10 presents conventional linear approaches to regression and classification. Chapter 11 introduces the most common machine learning techniques which deal with nonlinear regression and classification tasks. Chapter 12 presents the model averaging approach, a recent and powerful way for obtaining improved generalisation accuracy by combining several learning machines. Chapter 13 deals with the problem of dimensionality reduction and in particular with feature selection strategies. Chapter 14 introduces stochastic processes and deals with the use of machine learning techniques for time series forecasting. Chapter 15 presents the problem of estimation in dynamic

settings (notably Kalman filter and HMM). Chapter 16 presents the relation between machine learning and causal inference by highlighting the risks of interpreting observational models in a causal manner. Although the book focuses on supervised learning, some related notions of unsupervised learning and density estimation are presented in Appendix A.

Several scripts and dashboards are used in the main text to illustrate statistical and machine learning notions. All the scripts have been implemented in R, Shiny and are available in a companion R GitHub package.

  • Share this book

  • Categories

    • Data Science
    • Machine Learning
    • R
    • Artificial Intelligence
    • Sciences
  • Feedback

    Contact the Author(s)

About the Author

Gianluca Bontempi
Gianluca Bontempi

Gianluca Bontempi is Full Professor in the Computer Science Department at the Université Libre de Bruxelles (ULB), Brussels, Belgium, co-head of the ULB Machine Learning Group (mlg.ulb.ac.be). He has been Director of (IB)2, the ULB/VUB Interuniversity Institute of Bioinformatics in Brussels in 2013-17.

His main research interests are big data mining, machine learning, bioinformatics, causal inference, predictive modeling and their application to complex tasks in engineering (time series forecasting, fraud detection) and life science (network inference, gene signature extraction). He was Marie Curie fellow researcher, he was awarded in two international data analysis competitions and he took part to many research projects in collaboration with universities and private companies all over Europe.

He is author of more than 250 scientific publications and his H-number is 64. He is Belgian (French Community) national contact point of the CLAIRE network, co-leader of the CLAIRE COVID19 Task Force, associate editor of the International Journal of Forecasting and IEEE Senior Member. He is also co-author of several open-source software packages for bioinformatics, data mining and prediction.

His blog is at https://datascience741.wordpress.com/

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

80% Royalties. Earn $16 on a $20 book.

We pay 80% royalties. That's not a typo: you earn $16 on a $20 sale. If we sell 5000 non-refunded copies of your book or course for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earnedover $13 millionwriting, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub