Email the Author

You can use this page to email Gábor László Hajba about Website Scraping with Python.

Please include an email address so the author can respond to your query

This message will be sent to Gábor László Hajba

This site is protected by reCAPTCHA and the Google  Privacy Policy and  Terms of Service apply.

About the Book

New version by Apress

In 2018 I teamed-up with Apress and we released an updated version of this book. You can find it on Amazon: https://amzn.to/2Dkl4gI

This book is the follow-up of my previous one: "XML processing and website scraping in Java". There I looked at ways and tools to process XML and HTML in Java, did some performace comparisons and introduced some new programming concepts to make things even better.

In this book I take a closer look at website scraping with the two tools used nowadays: BeautifulSoup and Scrapy.

I create the sample application from the Java book -- now in Python, use the two tools for parsing, show examples how to export CSV files in Python.

As a bonus I will compare the two tools for their runtime, try to tweak where possible and I will give a quick introduction on plotting the runtimes as charts.

Until it is finished, you can buy the book for a discounted price. The final book will be around $35.

I will write about the following topics in this book:

  • BeautifulSoup
  • Scrapy
  • Performance comparison
  • Plotting in Python
  • Functional programming with Python
  • Parallel code execution with Python
  • Sample application to gather Amazon data
  • Other real-life projects (source code coming soon into the package)
  • Update for Scrapy's release and Python 3 (coming soon)

About the Author

Gábor László Hajba’s avatar Gábor László Hajba

@GHajba

Instagram

Gábor László Hajba is a versatile Senior Software Developer at ProLion GmbH in Wiener Neustadt, Austria, specializing in Java and Python. With a deep commitment to crafting innovative solutions, Gábor not only excels in technical problem-solving but also takes pride in mentoring his colleagues, helping them grow in their professional journeys.

A published author, Gábor's book "Website Scraping with Python - Using BeautifulSoup and Scrapy", released by Apress in 2018, began as a LeanPub project in 2014, reflecting his passion for sharing knowledge and empowering developers across the globe.

In addition to his technical expertise, Gábor has embarked on a transformative coaching journey, focusing on burnout prevention and personal growth. His work as a mental trainer is dedicated to helping individuals unlock their potential, making meaningful changes in both personal and professional realms. Through his coaching practice, Gábor offers practical strategies for resilience and empowerment.

Beyond his professional endeavors, Gábor is a devoted husband and proud father of a spirited daughter and son. He also nurtures a keen interest in music, aspiring to master the bass guitar, a testament to his relentless pursuit of creativity and balance in life.

Gábor’s journey is a blend of technical mastery, coaching wisdom, and personal fulfillment, embodying his dedication to growth, both in his career and in the lives he touches.

Logo white 96 67 2x

Publish Early, Publish Often

  • Path
  • There are many paths, but the one you're on right now on Leanpub is:
  • Websitescrapingwithpython › Email Author › New
    • READERS
    • Newsletters
    • Weekly Sale
    • Monthly Sale
    • Store
    • Home
    • Redeem a Token
    • Search
    • Support
    • Leanpub FAQ
    • Leanpub Author FAQ
    • Search our Help Center
    • How to Contact Us
    • FRONTMATTER PODCAST
    • Featured Episode
    • Episode List
    • MEMBERSHIPS
    • Reader Memberships
    • Department Reader Memberships
    • Author Memberships
    • Your Membership
    • COMPANY
    • About
    • About Leanpub
    • Blog
    • Contact
    • Press
    • Essays
    • AI Services
    • Imagine a world...
    • Manifesto
    • More
    • Partner Program
    • Causes
    • Accessibility
    • AUTHORS
    • Write and Publish on Leanpub
    • Create a Book
    • Create a Bundle
    • Create a Course
    • Create a Track
    • Testimonials
    • Why Leanpub
    • Services
    • TranslateAI
    • TranslateWord
    • TranslateEPUB
    • PublishWord
    • Publish on Amazon
    • CourseAI
    • GlobalAuthor
    • Marketing Packages
    • IndexAI
    • Author Newsletter
    • The Leanpub Author Update
    • Author Support
    • Author Help Center
    • Leanpub Authors Forum
    • The Leanpub Manual
    • Supported Languages
    • The LFM Manual
    • Markua Manual
    • API Docs
    • Organizations
    • Learn More
    • Sign Up
    • LEGAL
    • Terms of Service
    • Copyright Policy
    • Privacy Policy
    • Refund Policy

*   *   *

Leanpub is copyright © 2010-2025 Ruboss Technology Corp.
All rights reserved.

This site is protected by reCAPTCHA
and the Google  Privacy Policy and  Terms of Service apply.

Leanpub requires cookies in order to provide you the best experience. Dismiss