A Data Engineer's Manual
A Data Engineer's Manual
About the Book
A great deal of hype recently has been directed toward the data scientists who use powerful algorithms and visualization tools to develop new ways of analyzing business data and find new insights. This is challenging, creative work, but by itself a new model or report only provides a one-time benefit. There is an increasingly important new role that has received much less attention than it deserves: that of the data engineer who can take a new model or algorithm and automate it, making it repeatable and accessible to non-expert users such as managers and customers. These unsung heroes create analytical systems, also called "data products", that are critical for organizations to reap ongoing benefits from their data assets.
In A Data Engineer's Manual, we dive into a hierarchy of fundamental knowledge you'll need to understand and work on data products. We will explore "data in the wild", that is, what forms it takes and how it is communicated over the Internet; learn about the roles played by different types of databases---relational, dimensional, and NoSQL; and examine how new data technologies change analytics workflows and deliver value to the business.
About the Contributors
Table of Contents
-
Preface: Productizing Data
- Analytics and Business Value
- A Hierarchy of Data Engineering Knowledge
- References & Recommended Reading
-
Chapter 1: Atoms, Bytes, and Databases
- Atoms
- Bytes
- The Trouble With Files
- Databases
- Data Engineering
- References & Recommended Reading
-
Chapter 2: Data in the Wild
- The Context of Data Sharing
- Formats for Data Interchange
- Other Technologies for Data Serialization
- References & Recommended Reading
-
Chapter 3: A Multitude of Databases
- Defining the Database
- Data Models
- Databases in Applications
- Choices in Logical Database Design
- References & Recommended Reading
-
Chapter 4: Analytics in the Database
- Querying a Database
- The Query Optimizer
- Complex Analyses Made Simple
- Crunching Big Data with MapReduce in the Database
- Logic in the Database
- Summary
- References & Recommended Reading
-
Chapter 5: Opening Your Data to the World
- The language of the Internet
- Communicating in Data
- REST to the Rescue
- Designing an API
- Summary
- References & Recommended Reading
-
Chapter 6: Data for Humans
- Value from data
- Analytics for humans or for machines?
- Augmenting the brain
- Iterating toward enlightenment
- References and Recommended Reading
-
Chapter 7: A Workflow for Analytics Development
- The “Black Box” View
- A Pipeline to Analytics
- Agility in Analytics
- References and Recommended Reading
- About the Author
The Leanpub 60 Day 100% Happiness Guarantee
Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.
You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!
So, there's no reason not to click the Add to Cart button, is there?
See full terms...
Earn $8 on a $10 Purchase, and $16 on a $20 Purchase
We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.
(Yes, some authors have already earned much more than that on Leanpub.)
In fact, authors have earnedover $13 millionwriting, publishing and selling on Leanpub.
Learn more about writing on Leanpub
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Learn more about Leanpub's ebook formats and where to read them