Data Science Project
Free!
Minimum price
$10.00
Suggested price

Data Science Project

An Inductive Learning Approach

About the Book

"Data Science Project: An Inductive Learning Approach" is a comprehensive and methodologically grounded resource designed for students, professionals, and educators seeking a rigorous approach to data science. Rooted in years of academic teaching and practical experience in research and development, this book equips readers with the theoretical and practical tools necessary to conduct end-to-end data science projects with rigor and confidence.

Key Features of the Book

  • Comprehensive Project Methodology: Learn a structured approach to data science, emphasizing predictive methods and inductive learning while integrating software engineering principles tailored to the unique challenges of data-driven solutions.
  • Foundational Theories and Concepts: Explore the history and definition of data science, with detailed discussions on structured data, tidy data principles, and database normalization — key foundations for effective data handling.
  • Advanced Data Handling and Preprocessing: Delve into the mathematics of data operations, ensuring split-invariance and mitigating data leakage, alongside robust preprocessing techniques that align with machine learning requirements.
  • Experimental Planning and Validation: Gain insight into the critical role of experimental design in validating data-driven solutions, with a detailed framework for assessing predictive models using relevant performance metrics.
  • Agile Methodologies Adapted for Data Science: Discover how to extend Scrum and other agile practices to fit the iterative and exploratory nature of data science projects.

About the Author

The book reflects the author’s extensive experience teaching graduate-level courses, coordinating data science programs, and leading R&D projects for applications in natural language processing, image processing, and spatio-temporal data analysis. This multifaceted background informs a balanced perspective, blending academic rigor with industry relevance.

Why This Book?

While the literature is rich in theoretical works on machine learning and practical guides on data science tools, this book fills a critical gap by providing:

  • A focus on the *semantics* of data science tasks, enabling tool-agnostic learning.
  • A clear explanation of *why* machine learning works, ensuring a deeper understanding of its principles and limitations.
  • Practical guidance for ensuring data integrity and validating solutions in stochastic, real-world environments.

Whether you are looking to teach a course on data science projects or seeking a reference to improve your professional practice, this book provides a formal yet accessible framework for success. It is an indispensable resource for anyone aiming to approach data science with rigor, confidence, and a critical mindset.

Contributions from Dr. Johnny Marques, professor and an expert in critical software development, bring an industry-tested perspective to the software aspects, making this an essential guide for aspiring data scientists, researchers and seasoned professionals alike.

About the Author

Filipe A. N. Verri
Filipe A. N. Verri

Filipe Verri is a Christian, married to Nayara Verri, and father to Tito and Antonio. In his professional career, he graduated at the top of his class with a Bachelor's degree in Computer Science from the Institute of Mathematics and Computer Sciences (ICMC), University of São Paulo (USP), in 2014. He received his Ph.D. from the Computational Mathematics and Computer Science program at ICMC/USP with a FAPESP scholarship in 2018. During his Ph.D., he completed a research internship at the School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, USA. His thesis received an Honorable Mention at the 8th USP Highlight Award in the major area of Earth and Exact Sciences. He was an Assistant Professor in the Computer Science Division (IEC) at the Aeronautics Institute of Technology (ITA) in São José dos Campos, São Paulo, from 2018 to 2024. Currently, he is an Research & Development Consultant and an Affiliate Professor at ITA and Federal University of São Paulo (UNIFESP). His main research area is Data Science, with a special focus on Artificial Intelligence, Machine Learning, Complex Networks, and Complex Systems. He also coordinated several strategic Research and Development projects for the Brazilian Air Force (FAB) on topics such as autonomous navigation, future air traffic control, computer vision, monitoring, and search and rescue.

About the Contributors

Johnny Marques
Johnny Marques

Prof. Dr.

Reader Testimonials

Ana Carolina Lorena
Ana Carolina Lorena

A Foreword

The book effectively balances theory and practice, focusing on the inductive principles underpinning predictive analytics and machine learning. While other texts focus solely on machine learning algorithms or delve deeply into tool-specific details, this book provides a holistic view of every stage of a data science project. It emphasises the importance of robust data handling, sound statistical learning principles, and meticulous model evaluation. The author thoughtfully integrates the mathematical foundations and practical considerations needed to design and execute successful data science projects. Beyond the technical mechanics, this book challenges readers to critically evaluate their models' strengths and limitations. It underscores the importance of semantics in data handling, equipping readers with the skills to interpret and transform data meaningfully.

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

Earn $8 on a $10 Purchase, and $16 on a $20 Purchase

We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earnedover $14 millionwriting, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub