Idiosyncrasies of the HTML parser
Minimum price
Suggested price

Idiosyncrasies of the HTML parser

About the Book

The HTML parser is a piece of software that processes HTML markup and produces an in-memory tree representation (known as the DOM).

The HTML parser has many strange behaviors. This book will highlight the ins and outs of the HTML parser, and contains almost-impossible quizzes.

HTML is not only used by basically all of the web, but it is also part of many modern applications. The HTML parser is part of the foundation of the web platform.

About the Author

Simon Pieters
Simon Pieters

Simon works on web standards and testing the web platform to foster interoperability between web browsers. He is one of the editors of the WHATWG HTML Living Standard, and has helped the research and design of the HTML parser specification.

Table of Contents

  • Preface
    • Intended audience
    • Definition
    • Scope
    • Practical application
    • About the author
    • Acknowledgements
    • Contribute
  • Chapter 1. Introduction
    • The DOM, parsing, and serialization
    • History of HTML parsers
    • The HTML parser is specified
  • Chapter 2. The HTML syntax
    • The doctype
    • Elements
    • Documents
    • Start tags
    • End tags
    • Attributes
    • Optional tags
    • Character references
    • CDATA sections
    • Comments
  • Chapter 3. The HTML parser
    • Overview of the HTML parser
    • Error handling
    • Detecting character encoding
    • Preprocessing the input stream
    • Tokenizer
    • Tree construction
    • Tags that are no longer supported
  • Chapter 4. Scripting complications
    • Revised overview of the HTML parser
    • document.write()
    • Other parser APIs
    • DOM manipulation
  • Chapter 5. Serializing
  • Chapter 6. Security implications
    • Introduction
    • Case studies
    • Best practice
  • Appendix A. Implementations
  • Appendix B. Conformance checkers
    • DTD-based validators
  • Appendix C. Microsyntaxes
    • Numbers
    • Image map coordinates
    • Responsive images
    • Colors
    • Meta refresh

Causes Supported

Amazon Watch

Supporting Indigenous Peoples. Protecting the Amazon.

Amazon Watch is a nonprofit organization founded in 1996 to protect the rainforest and advance the rights of indigenous peoples in the Amazon Basin. We partner with indigenous and environmental organizations in campaigns for human rights, corporate accountability and the preservation of the Amazon's ecological systems.

We envision a world that honors and values cultural and biological diversity and the critical contribution of tropical rainforests to our planet's life support system. We believe that indigenous self-determination is paramount, and see that indigenous knowledge, cultures and traditional practices contribute greatly to sustainable and equitable stewardship of the Earth. We strive for a world in which governments, corporations and civil society respect the collective rights of indigenous peoples to free, prior and informed consent over any activity affecting their territories and resources. We commit, in the spirit of partnership and mutual respect, to support our indigenous allies in their efforts to protect life, land, and culture in accordance with their aspirations and needs.

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

80% Royalties. Earn $16 on a $20 book.

We pay 80% royalties. That's not a typo: you earn $16 on a $20 sale. If we sell 5000 non-refunded copies of your book or course for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earnedover $13 millionwriting, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub