The Leanpub 60 Day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms...
Kick off your book project in 2 hours, get started with GhostAI in 2 hours, or do both! Free live workshops, on Zoom. You’ll leave with a real book project and a clear plan to keep going. Saturday, June 27, 2026.
Quick Start Guide
A hands-on guide to building, scheduling, and deploying
data pipelines with Apache Airflow 2.x from scratch to
AWS MWAA production deployment.
Minimum price
$9.99
$11.99
About the Book
Data pipelines don't manage themselves.
As workflows grow more complex, teams need a reliable way
to schedule tasks, handle failures automatically, and
monitor everything from a single place. Apache Airflow is
how the best engineering teams do exactly that — and this
book teaches you how to use it, from your very first DAG
all the way to a production deployment on AWS.
This is a practical, hands-on guide. Every chapter builds
on the last, every concept comes with real code, and by
the end you will have worked through a complete data
engineering workflow that mirrors what teams use in
production today.
WHAT YOU WILL LEARN
──────────────────────
Part 1 — Foundations
You will start by understanding what Airflow is, why it
exists, and how its core components fit together. Then
you will install it locally and write your first working
DAG — a real pipeline that runs on your own machine.
Part 2 — Building Pipelines
You will learn the tools data engineers use every day.
How to write Python tasks and pass data between them
using XComs. How to schedule pipelines and handle
historical backfills. How to store credentials securely
using Variables, Connections, and Secrets backends. How
to monitor DAG runs, read logs, and set up alerts.
Part 3 — Real-World Use Case
You will build a complete ETL pipeline — extracting data
from an API, transforming it with Pandas, creating the
target table, and loading it into PostgreSQL. You will
add production reliability with retry logic, SLA
monitoring, Slack alerts, and failure callbacks. Then
you will extend the pipeline to orchestrate AWS services
including S3, Lambda, and Redshift, and wire it into a
CI/CD workflow using Git and GitHub Actions.
Part 4 — Deployment and Scaling
You will move your pipelines to the cloud using AWS MWAA
— Amazon's fully managed Airflow service. You will set
up a complete MWAA environment from scratch, deploy your
DAGs via S3, and learn how to debug issues using
CloudWatch logs. You will also compare MWAA against
Google Cloud Composer and Astronomer so you can make an
informed choice for your own infrastructure.
Part 5 — Beyond the Basics
You will go deeper with advanced Airflow features —
building custom operators and plugins, generating tasks
dynamically at runtime using dynamic task mapping, and
optimising pipeline performance through scheduler tuning,
XCom management, and efficient operator design.
WHO THIS BOOK IS FOR
──────────────────────
This book is most useful if you are:
- A data engineer looking to adopt Airflow as your
orchestration layer
- A software developer moving into data infrastructure
- A data analyst or scientist whose scripts need to run
on a reliable schedule
- A DevOps or cloud engineer deploying Airflow in
production
You do not need prior Airflow experience. You will get
the most out of this book if you are comfortable with
Python, the command line, and basic SQL.
WHAT IS INCLUDED
──────────────────────
- 20 focused chapters across 5 parts
- Complete Airflow CLI command reference (Appendix A)
- Ready to use DAG templates — Simple ETL, Dynamic
Task Mapping, DAG Factory (Appendix B)
- Official documentation and community resource links
(Appendix C)
- 125 pages of practical, code-first content
TECHNICAL DETAILS
──────────────────────
- Written for Apache Airflow 2.x
- All code tested against Airflow 2.6+
- Python 3.8+
- AWS MWAA, S3, Lambda, Redshift examples included
About the Author
Rohit Jain is a software engineer and technical author
specialising in Software Engineering and workflow Orchestration.
Click the buttons to get the free sample in PDF or EPUB, or read the sample online here
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms...
We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earned over $15 million writing, publishing and selling on Leanpub.
Learn more about writing on Leanpub
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them
You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!
Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.
Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.