Scraping for Journalists
Scraping for Journalists
$15.10
Minimum
$20.01
Suggested
Scraping for Journalists

Last updated on 2016-01-21

About the Book

Scraping - getting a computer to capture information from online sources - is one of the most powerful techniques for data-savvy journalists who want to get to the story first, or find exclusives that no one else has spotted. Faster than FOI and more detailed than advanced search techniques, scraping also allows you to grab data that organisations would rather you didn’t have - and put it into a form that allows you to get answers.

Scraping for Journalists introduces you to a range of scraping techniques - from very simple scraping techniques which are no more complicated than a spreadsheet formula, to more complex challenges such as scraping databases or hundreds of documents. At every stage you'll see results - but you'll also be building towards more ambitious and powerful tools.

You’ll be scraping within 5 minutes of reading the first chapter - but more importantly you'll be learning key principles and techniques for dealing with scraping problems.

Unlike general books about programming languages, everything in this book has a direct application for journalism, and each principle of programming is related to their application in scraping for newsgathering. And unlike standalone guides and blog posts that cover particular tools or techniques, this book aims to give you skills that you can apply in new situations and with new tools.

Bundles that include this book

Data Journalism Heist
Scraping for Journalists
2 Books
$24.09
Regular Price
$19.99
Bundle Price
Finding Stories in Spreadsheets
Scraping for Journalists
2 Books
$28.09
Regular Price
$24.99
Bundle Price
Finding Stories in Spreadsheets
Data Journalism Heist
Scraping for Journalists
3 Books
$37.08
Regular Price
$28.99
Bundle Price
Finding Stories in Spreadsheets
Data Journalism Heist
Scraping for Journalists
3 Books
$37.08
Regular Price
$25.00
Bundle Price

About the Author

Paul Bradshaw
Paul Bradshaw

Paul Bradshaw runs the MA in Online Journalism at Birmingham City University, where he is an associate professor. He publishes the Online Journalism Blog, and is the founder of investigative journalism website HelpMeInvestigate. He has written for the Guardian and Telegraph’s data blogs, journalism.co.uk, Press Gazette, InPublishing, Nieman Reports and the Poynter Institute in the US. Formerly Visiting Professor at City University’s School of Journalism in London, He is the co-author of the Online Journalism Handbook with former Financial Times web editor Liisa Rohumaa, and of Magazine Editing (3rd Edition) with John Morrish. Other books which Bradshaw has contributed to include Investigative Journalism (second edition), Web Journalism: A New Form of Citizenship; and Citizen Journalism: Global Perspectives.

His books on Leanpub include Scraping for JournalistsFinding Stories in Spreadsheets, the Data Journalism Heist and 8000 Holes: How the 2012 Olympic Torch Relay Lost its Way.

Bradshaw has been listed in Journalism.co.uk’s list of the leading innovators in journalism and media and Poynter’s most influential people in social media. In 2010, he was shortlisted for Multimedia Publisher of the Year.

In addition to teaching and writing, Paul acts as a consultant and trainer to a number of organisations on social media and data journalism. You can find him on Twitter @paulbradshaw

The Leanpub Unconditional, No Risk, 100% Happiness Guarantee

Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms

Write and Publish on Leanpub

Authors and publishers use Leanpub to publish amazing in-progress and completed ebooks, just like this one. You can use Leanpub to write, publish and sell your book as well! Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks. Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. It really is that easy.

Learn more about writing on Leanpub