About the Book
A great deal of hype recently has been directed toward the data scientists who use powerful algorithms and visualization tools to develop new ways of analyzing business data and find new insights. This is challenging, creative work, but by itself a new model or report only provides a one-time benefit. There is an increasingly important new role that has received much less attention than it deserves: that of the data engineer who can take a new model or algorithm and automate it, making it repeatable and accessible to non-expert users such as managers and customers. These unsung heroes create analytical systems, also called "data products", that are critical for organizations to reap ongoing benefits from their data assets.
In A Data Engineer's Manual, we dive into a hierarchy of fundamental knowledge you'll need to understand and work on data products. We will explore "data in the wild", that is, what forms it takes and how it is communicated over the Internet; learn about the roles played by different types of databases---relational, dimensional, and NoSQL; and examine how new data technologies change analytics workflows and deliver value to the business.
About the Author
Joseph W. Clark, Ph.D. has researched and taught information systems and data analytics topics since 2006, most recently at the University of Maine. He was one of the first generation of Web developers in the 1990s, and has been fascinated with databases and data modeling since he first learned how relational databases could power dynamic websites, around 1999. His academic interests lately have been at the intersection of data analytics and entrepreneurship, and new types of workflows such as Agile, Lean, and Design Thinking. His most ambitious project yet is raising four children with his beautiful wife, Xiaofang.