Wrangling F1 Data With R (The Book + Code and Data Samples)
Wrangling F1 Data With R
A Data Junkie's Guide
About the Book
As a driver of technological and engineering innovation, Formula One motorsport is unsurpassed in its relentless pursuit of improvement on a weekly basis. But whilst sports such as cricket and baseball provide a wealth of geekery for the stats fans, F1 seems to lag behind.
If you're attracted by F1's passion to push engineering and technology to the limit, this book will help you grab a range of Formula One datasets by the scruff of the neck and wrangle a wide variety of insights from them.
Using the latest in open source data analysis and visualisation techniques, you'll learn how to extract the stories that often go unnoticed from whatever Formula One data you can lay your hands on. And maybe, just maybe, you'll be able to use the skills you learn along the way outside of the F1 context...
Let your passion for Formula One drive your data skills to new heights... #f1datajunkie
The current release of the book is an early draft. Many of the chapters are in a raw and incomplete form; others aren't published yet. The preview will change regularly, containing anywhere between 30% and 80% of the the latest version of the paid for version. If you pay for the book, you get any and all updates to it for free. The book is priced as it is so that affiliate links work.
The Book + Code and Data Samples
A copy of the book plus some example code files and data files referenced from within the book.
- A Note on the Data Sources
- The Lean and Live Nature of This Book
- What are we trying to do with the data?
- Choosing the tools
- The Data Sources
- Additional Data Sources
- Getting the Data into RStudio
- Example F1 Stats Sites
- How to Use This Book
- The Rest of This Book…
An Introduction to RStudio and R dataframes
- Getting Started with RStudio
- Getting Started with R
Getting the data from the ergast Motor Racing Database API
- Accessing Data from the ergast API
Getting the data from the ergast Motor Racing Database Download
- Accessing the ergast Data via a SQLite Database
- The Virtual Machine Approach
- Getting Started with the ergast Database
- Asking Questions of the ergast Data
Data Scraped from the Formula One Website (Pre-2015)
- Format of the Original scraperwiki.sqlite Database
- Format of the f1com_results_archive.sqlite Database
- Problems with the Formula One Data
- How to use the Formula1.com Data alongside the ergast data
Reviewing the Practice Sessions
- The Weekend Starts Here
- Practice Session Data from the Official Formula One Website Prior up to 2014
- Sector Times (Prior to 2015)
Practice Session Utilisation
- Session Utilisation Charts
- Finding Purple and Green Times
- Stint Detection
- Revisiting the Session Utilisation Chart - Annotations
- Session Summary Annotations
- Session Utilisation Lap Delta Charts
- Useful Functions Derived From This Chapter
A Quick Look at Qualifying
- Qualifying Progression Charts
- Improving the Qualifying Session Progression Tables
- Qualifying Session Rank Position Summary Chart - Towards the Slopegraph
- Rank-Real Plots
- Ultimate Laps
A Further Look at Qualifying
- Clustering Qualifying Laptime by Session
- Purple and Green Laptimes in Qualifying
- How do Session Cut-off Times Evolve Over the Course of Qualifying?
Lapcharts and the Race Slope Graph
- Creating a Lap Chart
- Lap Trivia
- Lap Position Status Charts
- The Race Summary Chart
- Position Change Counts
- The Race Slope Graph
- Further Riffs on the Lapchart Idea
Race History Charts
- The Simple Laptime Chart
- Accumulated Laptimes
- Gap to Leader Charts
- The Lapalyzer Session Gap
- Eventually: The Race History Chart
From Battlemaps to Track Position Maps
- Identifying Track Position From Accumulated Laptimes
- Calculating DIFF and GAP times
- Battles for a particular position
- Generating Track Position Maps
Pit Stop Analysis
- Pit Stop Data
- Pit Stops Over Time
- From Pitstops to Stints
- The Effect of Age on Performance
- Statistical Models of Career Trajectories
- Modeling the Perfromance of F1 Drivers In General
- The Age-Productivity Gradient
- Spotting Runs
- Generating Streak Reports
- Streak Maps
- Team Streaks
- Time to N’th Win
- Looking for Streaks Elsewhere
Keeping an Eye on Competitiveness - Tracking Churn
- Calculating Adjusted Churn - Event Level
- Calculating Adjusted Churn - Across Seasons
- Taking it Further
Laps Completed and Laps Led
- Calculating Laps Completed and Laps Led Percentages
- Comparing laps led counts over seasons
- Comparing Laps Led Counts for Specified Circuits Across Several Years
- Laps Led From Race Position Start
- Detecting Position Change Groupings
- Detecting Undercuts
Comparing Intra-Team Driver Performances
- Intra-Team League Tables
- Race Performance
Points Performance Charts
- Grid Points Productivity
- Maximising Team Points Hauls
- Intra-Team Support
- Points Performance Charts - One-Way
- Points Performance Charts - Two-Way
End of Season Showdown
- Modeling the Points Effects of the Final Championship Race
- Visualising the Outcome
Charting the Championship Race
- Getting the Championship Data
- Charting a Championship Points Race
- Charting the Championship Race Standings
- Appendix - Converting the ergast Database to SQLite
The Leanpub 45-day 100% Happiness Guarantee
Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Algebra-Driven DesignSandy Maguire
A how-to field guide on building leak-free abstractions and algebraically designing real-world applications.
Production HaskellMatt Parsons
Are you excited about Haskell, but don't know where to begin? Are you thrilled by the technical advantages, but worried about the unknown pitfalls? This book has you covered.
Machine Learning EngineeringAndriy Burkov
"If you intend to use machine learning to solve business problems at scale, I'm delighted you got your hands on this book."
—Cassie Kozyrkov, Chief Decision Scientist at Google
"Foundational work about the reality of building machine learning models in production."
—Karolis Urbonas, Head of Machine Learning and Science at Amazon
CCIE Service Provider Version 4 Written and Lab Exam Comprehensive GuideNicholas Russo
The service provider landscape has changed rapidly over the past several years. Networking vendors are continuing to propose new standards, techniques, and procedures for overcoming new challenges while concurrently reducing costs and delivering new services. Cisco has recently updated the CCIE Service Provider track to reflect these changes; this book represents the author's personal journey in achieving that certification.
CCIE SP v4.1 - WorkbookŁukasz Bromirski, Piotr Jablonski, and Nicholas Russo
Are you striving to prepare to and pass CCIE SP lab exam? Take the opportunity and get this workbook! With the attached initial cfg files you will prepare yourself for the CCIE SP exam as well as learn SP technologies applicable to all kinds of today modern networks! This workbook covers blueprint topics and provides challenging examples.
Ansible for DevOpsJeff Geerling
Ansible is a simple, but powerful, server and configuration management tool. Learn to use Ansible effectively, whether you manage one server—or thousands.
The Hundred-Page Machine Learning BookAndriy Burkov
Everything you really need to know in Machine Learning in a hundred pages.
Cloud StrategyGregor Hohpe
“Strategy is the difference between making a wish and making it come true.” A successful migration to the cloud shouldn’t be driven by wishes, but guided by a sound strategy, frameworks, and decision models. This book tells you how—without becoming superficial nor getting lost in technology and product details.
C++ Best PracticesJason Turner
Level up your C++, get the tools working for you, eliminate common problems, and move on to more exciting things!
Thinking with TypesSandy Maguire
This book aims to be the comprehensive manual for type-level programming. It's about getting you from here to there---from a competent Haskell programmer to one who convinces the compiler to do their work for them.
11 BooksThe Quality Software Bundle is for managers, would-be managers, and any of us who find themselves being managed and confused. This comprehensive bundle covers the entire span of software development approaches, from hacking through waterfall, cascade, prototyping, Iterative enhancement, reusable code, off-the-shelf, to Agile teams. The bundle...
The Node.js Bundle
3 BooksThis bundle combines three bestselling Leanpub Node.js books into a package that gives you everything you need to get started with developing Node.js applications at an unbeatable price.
The Tester's Library
8 BooksThe Tester's Library consists of eight five-star books that every software tester should read and re-read. As bound books, this collection would cost over $200. Even as e-books, their price would exceed $80, but in this bundle, their cost is only $49.99. Here are the books, and why they should be in your library: Perfect Software and Other...
11 BooksIn this bundle, you will find 10 different agile books. They are about different aspects of being agile. - finding a job - doing coding dojo's - Retrospectives - Personal kanban - a non-typical coaching book and even a book that gives you an insight in the lives of some agile people.
WTFlop 6M + HU - Beta Bundle
Fifty Quick Ideas
3 BooksGet all three books for the price of two! Fifty Quick Ideas books are full of practical, real-world techniques that you can use to improve teamwork, build better products and build them in a better way.
Growing Agile: Coach's Guide Series
4 BooksThis bundle provides a collection of training and workshop plans for a variety of agile topics. The series is aimed at agile coaches, trainers and ScrumMasters who often find themselves needing to help teams understand agile concepts. Each book in the series provides the plans, slides, handouts and activity instructions to run a number of...
Marionette.js A to Z
Build A Better Backbone App
3 BooksThe best way to learn new development skills is through experience, but that takes time you don't have.Get the best of both worlds with this bundle: you'll learn how to produce modern web applications by learning from experienced developers like Derick Bailey and David Sulc. BackboneJS is one of the favorite tools on the web today, but it...
People Skills—Soft but Difficult
7 BooksPerhaps you've been told that "lack of people skills" has been holding you back. No wonder: you may have had hundreds of hours of technical training, but little or no "people skills" guidance.You've heard it said that people skills are "soft," whereas technical skills are "hard." For you, though, technical skills are "easy," but people skills...