About the Book
This book is the follow-up of my previous one: "XML processing and website scraping in Java". There I looked at ways and tools to process XML and HTML in Java, did some performace comparisons and introduced some new programming concepts to make things even better.
In this book I take a closer look at website scraping with the two tools used nowadays: BeautifulSoup and Scrapy.
I create the sample application from the Java book -- now in Python, use the two tools for parsing, show examples how to export CSV files in Python.
As a bonus I will compare the two tools for their runtime, try to tweak where possible and I will give a quick introduction on plotting the runtimes as charts.
Until it is finished, you can buy the book for a discounted price. The final book will be around $35.
I will write about the following topics in this book:
- Performance comparison
- Plotting in Python
- Functional programming with Python
- Parallel code execution with Python
- Sample application to gather Amazon data
- Other real-life projects (source code coming soon into the package)
- Update for Scrapy's release and Python 3 (coming soon)
About the Author
Gabor Laszlo Hajba is IT Consultant with a core competence of Java and Python. As the CEO of the JaPy Szoftver Kft in Sopron, Hungary he is responsible for designing and developing customer needs in the enterprise software world. Beside this he holds workshops about Java 8 and Java Enterprise Edition.