Spreadsheets and Scraping
Spreadsheets and Scraping
About the Bundle
This bundle brings together the two main books on data journalism techniques: using spreadsheets to find stories; and learning key techniques to 'scrape' information from a range of sources into one easy-to-analyse spreadsheet.
Finding Stories in Spreadsheets
Recipes for interviewing data - and getting answers
One spreadsheet can tell many stories. You just have to know the right questions to ask.
Based on a decade of training journalists and working with news organisations on data-driven stories, Finding Stories In Spreadsheets outlines the techniques for asking the right questions of data using tools like Excel and Google spreadsheets.
These aren't just questions about numbers: you'll find out how spreadsheet techniques can help you find the 'needle in the haystack' in text data. You'll learn how to clean up and modify your data so that you can ask it different questions, or get it ready for maps or charts, how to create new data from raw materials, and how to combine datasets to look for connections and trends.
With regular examples from journalism and real life data to work with, Finding Stories In Spreadsheets is *full* of those questions, in the language that spreadsheets understand.
Scraping for Journalists (2nd edition)
How to grab information from hundreds of sources, put it in data you can interrogate - and still hit deadlines
Scraping - getting a computer to capture information from online sources - is one of the most powerful techniques for data-savvy journalists who want to get to the story first, or find exclusives that no one else has spotted. Faster than FOI and more detailed than advanced search techniques, scraping also allows you to grab data that organisations would rather you didn’t have - and put it into a form that allows you to get answers.
Scraping for Journalists introduces you to a range of scraping techniques - from very simple scraping techniques which are no more complicated than a spreadsheet formula, to more complex challenges such as scraping databases or hundreds of documents. At every stage you'll see results - but you'll also be building towards more ambitious and powerful tools.
You’ll be scraping within 5 minutes of reading the first chapter - but more importantly you'll be learning key principles and techniques for dealing with scraping problems.
Unlike general books about programming languages, everything in this book has a direct application for journalism, and each principle of programming is related to their application in scraping for newsgathering. And unlike standalone guides and blog posts that cover particular tools or techniques, this book aims to give you skills that you can apply in new situations and with new tools.
The Leanpub 60-day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
80% Royalties. Earn $16 on a $20 book.
We pay 80% royalties. That's not a typo: you earn $16 on a $20 sale. If we sell 5000 non-refunded copies of your book or course for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earnedover $12 millionwriting, publishing and selling on Leanpub.
Learn more about writing on Leanpub
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.