Wrangling F1 Data With R (The Book + Code and Data Samples)
Wrangling F1 Data With R
A Data Junkie's Guide
About the Book
As a driver of technological and engineering innovation, Formula One motorsport is unsurpassed in its relentless pursuit of improvement on a weekly basis. But whilst sports such as cricket and baseball provide a wealth of geekery for the stats fans, F1 seems to lag behind.
If you're attracted by F1's passion to push engineering and technology to the limit, this book will help you grab a range of Formula One datasets by the scruff of the neck and wrangle a wide variety of insights from them.
Using the latest in open source data analysis and visualisation techniques, you'll learn how to extract the stories that often go unnoticed from whatever Formula One data you can lay your hands on. And maybe, just maybe, you'll be able to use the skills you learn along the way outside of the F1 context...
Let your passion for Formula One drive your data skills to new heights... #f1datajunkie
The current release of the book is an early draft. Many of the chapters are in a raw and incomplete form; others aren't published yet. The preview will change regularly, containing anywhere between 30% and 80% of the the latest version of the paid for version. If you pay for the book, you get any and all updates to it for free. The book is priced as it is so that affiliate links work.
The Book + Code and Data Samples
A copy of the book plus some example code files and data files referenced from within the book.
- A Note on the Data Sources
- The Lean and Live Nature of This Book
- What are we trying to do with the data?
- Choosing the tools
- The Data Sources
- Additional Data Sources
- Getting the Data into RStudio
- Example F1 Stats Sites
- How to Use This Book
- The Rest of This Book…
An Introduction to RStudio and R dataframes
- Getting Started with RStudio
- Getting Started with R
Getting the data from the ergast Motor Racing Database API
- Accessing Data from the ergast API
Getting the data from the ergast Motor Racing Database Download
- Accessing the ergast Data via a SQLite Database
- The Virtual Machine Approach
- Getting Started with the ergast Database
- Asking Questions of the ergast Data
Data Scraped from the Formula One Website (Pre-2015)
- Format of the Original scraperwiki.sqlite Database
- Format of the f1com_results_archive.sqlite Database
- Problems with the Formula One Data
- How to use the Formula1.com Data alongside the ergast data
Reviewing the Practice Sessions
- The Weekend Starts Here
- Practice Session Data from the Official Formula One Website Prior up to 2014
- Sector Times (Prior to 2015)
Practice Session Utilisation
- Session Utilisation Charts
- Finding Purple and Green Times
- Stint Detection
- Revisiting the Session Utilisation Chart - Annotations
- Session Summary Annotations
- Session Utilisation Lap Delta Charts
- Useful Functions Derived From This Chapter
A Quick Look at Qualifying
- Qualifying Progression Charts
- Improving the Qualifying Session Progression Tables
- Qualifying Session Rank Position Summary Chart - Towards the Slopegraph
- Rank-Real Plots
- Ultimate Laps
A Further Look at Qualifying
- Clustering Qualifying Laptime by Session
- Purple and Green Laptimes in Qualifying
- How do Session Cut-off Times Evolve Over the Course of Qualifying?
Lapcharts and the Race Slope Graph
- Creating a Lap Chart
- Lap Trivia
- Lap Position Status Charts
- The Race Summary Chart
- Position Change Counts
- The Race Slope Graph
- Further Riffs on the Lapchart Idea
Race History Charts
- The Simple Laptime Chart
- Accumulated Laptimes
- Gap to Leader Charts
- The Lapalyzer Session Gap
- Eventually: The Race History Chart
From Battlemaps to Track Position Maps
- Identifying Track Position From Accumulated Laptimes
- Calculating DIFF and GAP times
- Battles for a particular position
- Generating Track Position Maps
Pit Stop Analysis
- Pit Stop Data
- Pit Stops Over Time
- From Pitstops to Stints
- The Effect of Age on Performance
- Statistical Models of Career Trajectories
- Modeling the Perfromance of F1 Drivers In General
- The Age-Productivity Gradient
- Spotting Runs
- Generating Streak Reports
- Streak Maps
- Team Streaks
- Time to N’th Win
- Looking for Streaks Elsewhere
Keeping an Eye on Competitiveness - Tracking Churn
- Calculating Adjusted Churn - Event Level
- Calculating Adjusted Churn - Across Seasons
- Taking it Further
Laps Completed and Laps Led
- Calculating Laps Completed and Laps Led Percentages
- Comparing laps led counts over seasons
- Comparing Laps Led Counts for Specified Circuits Across Several Years
- Laps Led From Race Position Start
- Detecting Position Change Groupings
- Detecting Undercuts
Comparing Intra-Team Driver Performances
- Intra-Team League Tables
- Race Performance
Points Performance Charts
- Grid Points Productivity
- Maximising Team Points Hauls
- Intra-Team Support
- Points Performance Charts - One-Way
- Points Performance Charts - Two-Way
End of Season Showdown
- Modeling the Points Effects of the Final Championship Race
- Visualising the Outcome
Charting the Championship Race
- Getting the Championship Data
- Charting a Championship Points Race
- Charting the Championship Race Standings
- Appendix - Converting the ergast Database to SQLite
The Leanpub 45-day 100% Happiness Guarantee
Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
El Manual del ManagerKeyvan Akbary, Félix López, and Álvaro Salazar
¿Has deseado alguna vez el haber tenido una buena introducción al rol del Engineering Manager? En este libro aprenderás lo necesario para ejercer el rol de una manera efectiva: Expectativas y Responsabilidades del Rol, 1-1s, Ayudar a Crecer, Objetivos, Planes de Carrera, Cultura, Feedback, Contratación, Cultura de Producto y mucho más.
Functional Design and ArchitectureAlexander Granin
Software Design in Functional Programming, Design Patterns and Practices, Methodologies and Application Architectures. How to build real software in Haskell with less efforts and low risks. The first complete source of knowledge.
Ansible for KubernetesJeff Geerling
Ansible is a powerful infrastructure automation tool. Kubernetes is a powerful application deployment platform. Learn how to use these tools to automate massively-scalable, highly-available infrastructure.
CCIE Service Provider Version 4 Written and Lab Exam Comprehensive GuideNicholas Russo
The service provider landscape has changed rapidly over the past several years. Networking vendors are continuing to propose new standards, techniques, and procedures for overcoming new challenges while concurrently reducing costs and delivering new services. Cisco has recently updated the CCIE Service Provider track to reflect these changes; this book represents the author's personal journey in achieving that certification.
CCIE SP v4.1 - WorkbookŁukasz Bromirski, Piotr Jablonski, and Nicholas Russo
Are you striving to prepare to and pass CCIE SP lab exam? Take the opportunity and get this workbook! With the attached initial cfg files you will prepare yourself for the CCIE SP exam as well as learn SP technologies applicable to all kinds of today modern networks! This workbook covers blueprint topics and provides challenging examples.
Practical FP in Scala: A hands-on approachGabriel Volpe
A practical book aimed for those familiar with functional programming in Scala who are yet not confident about architecting an application from scratch.
Together, we will develop a purely functional application using the best libraries in the Cats ecosystem, while learning about design patterns and best practices.
Ansible for DevOpsJeff Geerling
Ansible is a simple, but powerful, server and configuration management tool. Learn to use Ansible effectively, whether you manage one server—or thousands.
C++ Best PracticesJason Turner
Level up your C++, get the tools working for you, eliminate common problems, and move on to more exciting things!
Tame your Work FlowSteve Tendon and Daniel Doiron
Do you need a high performance enterprise governance approach improving management, execution and delivery while dealing with multiple projects/products, events, stakeholders and teams? Giving you better bottom line results, faster time to market, less work, better predictability, happier employees, and delighted clients? Then learn about TameFlow!
R Programming for Data ScienceRoger D. Peng
This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.
11 BooksThe Quality Software Bundle is for managers, would-be managers, and any of us who find themselves being managed and confused. This comprehensive bundle covers the entire span of software development approaches, from hacking through waterfall, cascade, prototyping, Iterative enhancement, reusable code, off-the-shelf, to Agile teams. The bundle...
The Node.js Bundle
3 BooksThis bundle combines three bestselling Leanpub Node.js books into a package that gives you everything you need to get started with developing Node.js applications at an unbeatable price.
The Tester's Library
8 BooksThe Tester's Library consists of eight five-star books that every software tester should read and re-read. As bound books, this collection would cost over $200. Even as e-books, their price would exceed $80, but in this bundle, their cost is only $49.99. Here are the books, and why they should be in your library: Perfect Software and Other...
11 BooksIn this bundle, you will find 10 different agile books. They are about different aspects of being agile. - finding a job - doing coding dojo's - Retrospectives - Personal kanban - a non-typical coaching book and even a book that gives you an insight in the lives of some agile people.
WTFlop 6M + HU - Beta Bundle
Growing Agile: Coach's Guide Series
4 BooksThis bundle provides a collection of training and workshop plans for a variety of agile topics. The series is aimed at agile coaches, trainers and ScrumMasters who often find themselves needing to help teams understand agile concepts. Each book in the series provides the plans, slides, handouts and activity instructions to run a number of...
Marionette.js A to Z
Complete Scala Bundle
3 BooksScala is a general-purpose programming language and it's getting extremely popular these days. Some say that learning Scala could be a challenging task. My experience, however, suggests that this is actually a myth that has very little to do with reality. With the right approach, learning Scala can be easy, fun and rewarding.The first book from...
Build A Better Backbone App
3 BooksThe best way to learn new development skills is through experience, but that takes time you don't have.Get the best of both worlds with this bundle: you'll learn how to produce modern web applications by learning from experienced developers like Derick Bailey and David Sulc. BackboneJS is one of the favorite tools on the web today, but it...
People Skills—Soft but Difficult
7 BooksPerhaps you've been told that "lack of people skills" has been holding you back. No wonder: you may have had hundreds of hours of technical training, but little or no "people skills" guidance.You've heard it said that people skills are "soft," whereas technical skills are "hard." For you, though, technical skills are "easy," but people skills...