Kick off your book project in 3 hours! Live workshop on Zoom. You’ll leave with a real book project, progress on your first chapter, and a clear plan to keep going. Saturday, June 6, 2026. Learn more…

Leanpub Header

Skip to main content

Learning Pandas 2, Second Edition

Master Data Wrangling, NLP, Geospatial Analysis, and Production ML Pipelines using pandas 2.3

This book is 100% completeLast updated on 2026-05-09

This book is specially written for ML engineers who know what a groupby is but want to know why it's slow and how to fix it; data scientists who understand sentiment analysis but want to see how it connects cleanly to a Pandas pipeline; and data engineers who ship Pandas code to production and need to know which patterns will break on Pandas 3.0 and which are safe.

Minimum price

$29.99

$29.99

You pay

Author earns

$
PDF
EPUB
1 Previous Editionwith 34 Readers
New edition of Learning Pandas 2.0
About

About

About the Book

This book has been updated with Pandas 2.3, and it's exactly what ML engineers, data scientists and data engineers have been waiting for. It's a hands-on desk guide that's full of solutions, and it's the most up-to-date, production-ready book to the most widely used data manipulation library in the Python ecosystem.

This book covers all the big changes in Pandas 2.3, like Copy-on-Write semantics, PyArrow-backed types that save over 50% memory, the new default StringDtype, and the deprecated frequency aliases that are messing up time series pipelines everywhere. All the chapters are based on one growing application using a real Customer Churn dataset, so every technique is put into a context where you can trace it and use it in production.Once you've got the hang of pandas, you will be exploring deep into feature engineering with feature_engine and scikit-learn's set_output API, dealing with class imbalance with SMOTE and ADASYN, and doing distributed computing with Dask, as well as JIT-compiled custom functions with Numba and JAX. On top of that, you'll be able to handle full NLP pipelines from TF-IDF to LDA topic modelling, and geospatial analysis with GeoPandas.

It doesn't matter if you're building ML pipelines, scaling data infrastructure, or connecting pandas to TensorFlow, PyTorch, or JAX, this book will give you the practical depth and modern patterns to do it correctly on pandas 2.3 today, and stay forward-compatible with pandas 3.0 tomorrow.

Key Features
  • Build memory-efficient pipelines using PyArrow backends and targeted dtype choices.
  • Write Copy-on-Write-safe assignment patterns that work on pandas 2.3 and 3.0.
  • Engineer rich ML features using ratios, bins, group statistics, and interaction terms.
  • Handle class imbalance with SMOTE, ADASYN, and quantified pandas-based profiling.
  • Scale datasets beyond RAM using Dask lazy evaluation and distributed cluster computing.
  • Accelerate custom scoring functions with Numba JIT and JAX-compiled batch operations.
  • Extract sentiment, topics, and clusters from raw text using TF-IDF and LDA pipelines.
  • Perform spatial joins, buffer analysis, and geocoding with GeoPandas and geopy.
  • Preserve named DataFrames throughout sklearn Pipelines using the set_output API.
  • Migrate confidently from legacy pandas patterns to pandas 2.3 production standards.

Table of Content
  • Getting Started with Pandas 2.3
  • Data Read, Storage, and File Formats
  • Indexing and Selecting Data
  • Data Manipulation and Transformation
  • Time Series and DateTime Operations
  • Performance Optimization and Scaling
  • Machine Learning with Pandas 2.3
  • Text Mining and NLP
  • Geospatial Data Analysis

Author

About the Author

GitforGits | Asian Publishing House

We are the engineer’s publisher, the coder’s mentor, and the content alchemist—meticulously turning dense tech into practical gold. With a growing library of 100+ titles, we don’t just develop technical books, rather we build roadmaps for professionals across Python, MySQL, DevOps, Rust, AI, Kotlin, Arduino, Golang and everything around the massive IT ecosystem. Every chapter, every script, every project is a tool in the hands of developers who want to get things done.

Where others summarize, we construct step-by-step learning blueprints, cutting through clutter, banning the fluff, and ensuring every paragraph delivers hands-on value. Our audience isn’t learning from scratch—they’re leveling up with purpose, and we stand by them with code-first content, consistent project workflows, and a zero-redundancy approach.

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

Earn $8 on a $10 Purchase, and $16 on a $20 Purchase

We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earned over $15 million writing, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub