The Bastards Book of Regular Expressions
The Bastards Book of Regular Expressions
Finding Patterns in Everyday Text
About the Book
This is a spinoff of a chapter from the Bastards Book of Ruby. Regular expressions are an essential and useful skill even outside of programming. They can serve not only as a handy tool for anyone whose work involves writing or data, but also act as a gateway into more interesting and complex kinds of programming. While you're waiting for me to finish this experiment in self-publishing, you can get a good start by reading the massive chapter on regular expressions in the BBoR
- Regular Expressions are for Everyone
- Release notes & changelog
- Getting Started
- Finding a proper text editor
- Why a dedicated text editor?
- Windows text editors
- Mac Text Editors
- Sublime Text
- Online regex testing sites
- A better Find-and-Replace
- How to find and replace
- The limitations of Find-and-Replace
- There’s more than find-and-replace
- Your first regex
- Hello, word boundaries
- Word boundaries
- Escape with backslash
- Regex Fundamentals
- Removing emptiness
- The newline character
- Viewing invisible characters
- Match one-or-more with the plus sign
- The plus operator
- Match zero-or-more with the star sign
- The star sign
- Specific and limited repetition
- Curly braces
- Curly braces, maximum and no-limit matching
- Cleaning messily-spaced data
- Anchors: A way to trim emptiness
- The caret as starting anchor
- The dollar sign as the ending anchor
- Escaping special characters
- Matching any letter, any number
- The numeric character class
- Word characters
- Bracketed character classes
- Matching ranges of characters with brackets and hyphens
- All the characters with dot
- Negative character sets
- Negative character sets
- Capture, Reuse
- Parentheses for precedence
- Parentheses for captured groups
- Correcting dates with capturing groups
- Using parentheses without capturing
- Optionality and alternation
- Alternation with the pipe character
- Optionality with the question mark
- Laziness and greediness
- Positive lookahead
- Negative lookahead
- Positive lookbehind
- Negative lookbehind
- The importance of zero-width (TODO)
- Regexes in Real Life
- Why learn Excel?
- The limits of Excel (todo)
- Mixed commas and other delimiters
- Dealing with text charts (todo)
- Completely unstructured text (todo)
- Moving in and out and into Excel
- From Data to HTML (TODO)
- Simple HTML tricks
- Example Domain
- Tabular data to HTML tables
- Mocking full web pages from data
- The Exercises
- Data Cleaning with the Stars
- Normalized alphabetical titles
- Make your own delimiters
- Finding needles in haystacks (TODO)
- Shakespeare’s longest word
- Changing phone format (TODO)
- Telephone game
- Ordering names and dates (TODO)
- Year, months, days
- Preparing for a spreadsheet
- Dating, Associated Press Style (TODO)
- The AP Date format
- Real-world considerations
- The limits of regex
- Sorting a police blotter
- Sloppy copy-and-paste
- Start loose and simple
- Converting XML to tab-delimited data
- The payments XML
- The pattern
- Add more delimitation
- Cleaning up Microsoft Word HTML (TODO)
- Switching visualizations (TODO)
- A visualization in Excel
- From Excel to Google Static Chart
- From Google Static Charts to Google Interactive Charts
- Cleaning up OCR Text (TODO)
- Cheat Sheet
- Moving forward
- Additional references and resources
The Leanpub 45-day 100% Happiness Guarantee
Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Ansible for KubernetesJeff Geerling
Ansible is a powerful infrastructure automation tool. Kubernetes is a powerful application deployment platform. Learn how to use these tools to automate massively-scalable, highly-available infrastructure.
Practical FP in Scala: A hands-on approachGabriel Volpe
A practical book aimed for those familiar with functional programming in Scala who are yet not confident about architecting an application from scratch.
Together, we will develop a purely functional application using the best libraries in the Cats ecosystem, while learning about design patterns and best practices.
Functional Design and ArchitectureAlexander Granin
Software Design in Functional Programming, Design Patterns and Practices, Methodologies and Application Architectures. How to build real software in Haskell with less efforts and low risks. The first complete source of knowledge.
Production HaskellMatt Parsons
Are you excited about Haskell, but don't know where to begin? Are you thrilled by the technical advantages, but worried about the unknown pitfalls? This book has you covered.
Tame your Work FlowSteve Tendon and Daniel Doiron
Do you need a high performance enterprise governance approach improving management, execution and delivery while dealing with multiple projects/products, events, stakeholders and teams? Giving you better bottom line results, faster time to market, less work, better predictability, happier employees, and delighted clients? Then learn about TameFlow!
Ansible for DevOpsJeff Geerling
Ansible is a simple, but powerful, server and configuration management tool. Learn to use Ansible effectively, whether you manage one server—or thousands.
Machine Learning EngineeringAndriy Burkov
"If you intend to use machine learning to solve business problems at scale, I'm delighted you got your hands on this book."
—Cassie Kozyrkov, Chief Decision Scientist at Google
"Foundational work about the reality of building machine learning models in production."
—Karolis Urbonas, Head of Machine Learning and Science at Amazon
C++ Best PracticesJason Turner
Level up your C++, get the tools working for you, eliminate common problems, and move on to more exciting things!
Composing SoftwareEric Elliott
All software design is composition: the act of breaking complex problems down into smaller problems and composing those solutions. Most developers have a limited understanding of compositional techniques. It's time for that to change.
El Manual del ManagerKeyvan Akbary, Félix López, and Álvaro Salazar
¿Has deseado alguna vez el haber tenido una buena introducción al rol del Engineering Manager? En este libro aprenderás lo necesario para ejercer el rol de una manera efectiva: Expectativas y Responsabilidades del Rol, 1-1s, Ayudar a Crecer, Objetivos, Planes de Carrera, Cultura, Feedback, Contratación, Cultura de Producto y mucho más.
The Tester's Library
8 BooksThe Tester's Library consists of eight five-star books that every software tester should read and re-read. As bound books, this collection would cost over $200. Even as e-books, their price would exceed $80, but in this bundle, their cost is only $49.99. Here are the books, and why they should be in your library: Perfect Software and Other...
11 BooksIn this bundle, you will find 10 different agile books. They are about different aspects of being agile. - finding a job - doing coding dojo's - Retrospectives - Personal kanban - a non-typical coaching book and even a book that gives you an insight in the lives of some agile people.
WTFlop 6M + HU - Beta Bundle
Marionette.js A to Z
Build A Better Backbone App
3 BooksThe best way to learn new development skills is through experience, but that takes time you don't have.Get the best of both worlds with this bundle: you'll learn how to produce modern web applications by learning from experienced developers like Derick Bailey and David Sulc. BackboneJS is one of the favorite tools on the web today, but it...
General Systems Thinker Bundle
5 BooksThe General Systems Thinker Bundle is just that: a bundle of five books to advance the reader one giant step toward improved thinking, based on General Systems principles. Four of the books are the complete General Systems Series. The fifth is fictional piece which shows some general systems thinkers in action. It's a mystery in which a group of...
Experiential Learning Bundle
4 BooksThis bundle provides all four volumes of the popular Experiential Learning Series at a savings of $20 over the price if purchased separately.
2 BooksAfter getting up and running with Ansible in Jeff Geerling's Ansible for DevOps, strengthen your skills managing tens to thousands of instances and services in Amazon's AWS cloud with Yan Kurniawan's Ansible for AWS.
Learn ECMAScript 6 inside and out
2 BooksFor any technology, it helps to get multiple points of view on the functionality to get the best possible understanding. For ECMAScript 6/2015, no two resources are recommended more frequently thanExploring ES6by Dr. Axel Rauschmayer andUnderstanding ECMAScript 6by Nicholas C. Zakas. These two points of view, investigating the specification and...
Software architecture, for systems old and new
2 BooksThis bundle includes books about hands-on software architecture.