MOST COMMON MISTAKES IN MACHINE LEARNING AND HOW TO AVOID THEM
MOST COMMON MISTAKES IN MACHINE LEARNING AND HOW TO AVOID THEM
With Examples in Python
About the Book
This book is a compilation of the most common mistakes when building machine learning models. I have gathered this list from mistakes I typically find when grading assignments, supervising graduate students, reading blog posts, looking at the accompanying code of published papers, and of course, from my own experience making those mistakes.
This book includes examples in Python. Some examples of mistakes that you will find in this book include:
- Not understanding the data
- Including irrelevant variables
- Data injection
- Assuming all users behave the same
- Wasting unlabeled data
- and much more!
Table of Contents
Introduction
Terminology
1 Not understanding the data
2 Reporting train performance
3 Not setting a seed value
4 Including irrelevant features
5 Ignoring differences in scales
6 Using the test set for fine tunning
7 Only reporting accuracy
8 Not comparing against a baseline
9 Not accounting for variance
10 Injecting data into the test set
11 Not shuffling the training data
12 Not saving the results
13 Not parallelizing
14 Encoding categories as integers
15 Forget data changes over time
16 Ignoring inter-user variance
17 Wasting unlabeled data
Apendix Setup Your Environment
Other books by this author
The Leanpub 60 Day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.
You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!
So, there's no reason not to click the Add to Cart button, is there?
See full terms...
Earn $8 on a $10 Purchase, and $16 on a $20 Purchase
We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earnedover $14 millionwriting, publishing and selling on Leanpub.
Learn more about writing on Leanpub
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them