Email the Author

You can use this page to email Gábor László Hajba about Website Scraping with Python.

About the Book

New version by Apress

In 2018 I teamed-up with Apress and we released an updated version of this book. You can find it on Amazon: https://amzn.to/2Dkl4gI

This book is the follow-up of my previous one: "XML processing and website scraping in Java". There I looked at ways and tools to process XML and HTML in Java, did some performace comparisons and introduced some new programming concepts to make things even better.

In this book I take a closer look at website scraping with the two tools used nowadays: BeautifulSoup and Scrapy.

I create the sample application from the Java book -- now in Python, use the two tools for parsing, show examples how to export CSV files in Python.

As a bonus I will compare the two tools for their runtime, try to tweak where possible and I will give a quick introduction on plotting the runtimes as charts.

Until it is finished, you can buy the book for a discounted price. The final book will be around $35.

I will write about the following topics in this book:

BeautifulSoup
Scrapy
Performance comparison
Plotting in Python
Functional programming with Python
Parallel code execution with Python
Sample application to gather Amazon data
Other real-life projects (source code coming soon into the package)
Update for Scrapy's release and Python 3 (coming soon)

About the Author

Gábor László Hajba

@GHajba

Gábor László Hajba is a versatile Senior Software Developer at ProLion GmbH in Wiener Neustadt, Austria, specializing in Java and Python. With a deep commitment to crafting innovative solutions, Gábor not only excels in technical problem-solving but also takes pride in mentoring his colleagues, helping them grow in their professional journeys.

A published author, Gábor's book "Website Scraping with Python - Using BeautifulSoup and Scrapy", released by Apress in 2018, began as a LeanPub project in 2014, reflecting his passion for sharing knowledge and empowering developers across the globe.

In addition to his technical expertise, Gábor has embarked on a transformative coaching journey, focusing on burnout prevention and personal growth. His work as a mental trainer is dedicated to helping individuals unlock their potential, making meaningful changes in both personal and professional realms. Through his coaching practice, Gábor offers practical strategies for resilience and empowerment.

Beyond his professional endeavors, Gábor is a devoted husband and proud father of a spirited daughter and son. He also nurtures a keen interest in music, aspiring to master the bass guitar, a testament to his relentless pursuit of creativity and balance in life.

Gábor’s journey is a blend of technical mastery, coaching wisdom, and personal fulfillment, embodying his dedication to growth, both in his career and in the lives he touches.