Email the Author
You can use this page to email Kevin Sahin about Java Web Scraping Handbook.
About the Book
Web scraping or crawling is the art of fetching data from a third party website by downloading and parsing the HTML code to extract the data you want. It can be hard. From bad HTML code to heavy Javascript use and anti-bot techniques, it is often tricky. Lots of companies use it to obtain knowledge concerning competitor prices, news aggregation, mass email collect…
This book will teach you how to extract data from any website, how to deal with AJAX / Javascript heavy websites, break captchas, deploy your scrapers in the cloud and many other advanced techniques.
About the Author
Hi there, I'm Kevin Sahin, the author of the Java Web Scraping Handbook. I have a blog about web scraping where I write about Web scraping and software development. I am also the Co-founder of ScrapingBee the easiest web scraping API.
Previously I spent more than four years building large scale web scrapers in the fintech industry, we're talking about millions of web pages scraped each day. I got my BS in computer science at Paul Sabatier University, in Toulouse, France. I wish I had a book like this when I started my job, to answer all the questions I had. Unfortunately, there wasn't a lot of good resources about web scraping back then. But now there is :)
You can find me on twitter !