Email the Author
You can use this page to email Filipe A. N. Verri about Data Science Project.
About the Book
"Data Science Project: An Inductive Learning Approach" is a comprehensive and methodologically grounded resource designed for students, professionals, and educators seeking a rigorous approach to data science. Rooted in years of academic teaching and practical experience in research and development, this book equips readers with the theoretical and practical tools necessary to conduct end-to-end data science projects with rigor and confidence.
Key Features of the Book
- Comprehensive Project Methodology: Learn a structured approach to data science, emphasizing predictive methods and inductive learning while integrating software engineering principles tailored to the unique challenges of data-driven solutions.
- Foundational Theories and Concepts: Explore the history and definition of data science, with detailed discussions on structured data, tidy data principles, and database normalization — key foundations for effective data handling.
- Advanced Data Handling and Preprocessing: Delve into the mathematics of data operations, ensuring split-invariance and mitigating data leakage, alongside robust preprocessing techniques that align with machine learning requirements.
- Experimental Planning and Validation: Gain insight into the critical role of experimental design in validating data-driven solutions, with a detailed framework for assessing predictive models using relevant performance metrics.
- Agile Methodologies Adapted for Data Science: Discover how to extend Scrum and other agile practices to fit the iterative and exploratory nature of data science projects.
About the Author
The book reflects the author’s extensive experience teaching graduate-level courses, coordinating data science programs, and leading R&D projects for applications in natural language processing, image processing, and spatio-temporal data analysis. This multifaceted background informs a balanced perspective, blending academic rigor with industry relevance.
Why This Book?
While the literature is rich in theoretical works on machine learning and practical guides on data science tools, this book fills a critical gap by providing:
- A focus on the *semantics* of data science tasks, enabling tool-agnostic learning.
- A clear explanation of *why* machine learning works, ensuring a deeper understanding of its principles and limitations.
- Practical guidance for ensuring data integrity and validating solutions in stochastic, real-world environments.
Whether you are looking to teach a course on data science projects or seeking a reference to improve your professional practice, this book provides a formal yet accessible framework for success. It is an indispensable resource for anyone aiming to approach data science with rigor, confidence, and a critical mindset.
Contributions from Dr. Johnny Marques, professor and an expert in critical software development, bring an industry-tested perspective to the software aspects, making this an essential guide for aspiring data scientists, researchers and seasoned professionals alike.
About the Author
Filipe Verri is a Christian, married to Nayara Verri, and father to Tito and Antonio. In his professional career, he graduated at the top of his class with a Bachelor's degree in Computer Science from the Institute of Mathematics and Computer Sciences (ICMC), University of São Paulo (USP), in 2014. He received his Ph.D. from the Computational Mathematics and Computer Science program at ICMC/USP with a FAPESP scholarship in 2018. During his Ph.D., he completed a research internship at the School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, USA. His thesis received an Honorable Mention at the 8th USP Highlight Award in the major area of Earth and Exact Sciences. He was an Assistant Professor in the Computer Science Division (IEC) at the Aeronautics Institute of Technology (ITA) in São José dos Campos, São Paulo, from 2018 to 2024. Currently, he is an Research & Development Consultant and an Affiliate Professor at ITA and Federal University of São Paulo (UNIFESP). His main research area is Data Science, with a special focus on Artificial Intelligence, Machine Learning, Complex Networks, and Complex Systems. He also coordinated several strategic Research and Development projects for the Brazilian Air Force (FAB) on topics such as autonomous navigation, future air traffic control, computer vision, monitoring, and search and rescue.