Hypothesis-Based Collaborative Filtering
Hypothesis-Based Collaborative Filtering
Retrieving Like-Minded Individuals Based on the Comparison of Hypothesized Preferences
About the Book
The vast product variety and product variation offered by online retailers provide an amazing amount of choice options to individuals, thus posing a big challenge to them finding and choosing interesting products which provide them the most utility. Consequently, consumers have to be satisfied with finding a product that provides them sufficient utility. Beyond that, individuals tend to even defer product choice, which is known as overchoice phenomenon.
Recommender systems have emerged in the past years as an effective method to help individuals with finding interesting products. As a result, the consumer welfare enhanced by $731 million to $1.03 billion in the year 2000 due to the increased product variety of online bookstores. Consumer welfare refers to consumers’ total satisfaction. This enhancement in consumer welfare is 7 to 10 times larger than the consumer welfare gain from increased competition and lower prices in the book market. In other words, recommender systems are essential for increasing consumers welfare, which ultimately leads to an increase of economic and social welfare.
Typically, recommender systems use the collective wisdom of individuals for exposing individuals to products which best fits their preferences, thus maximizing their utility. More precisely, the product ratings of like-minded individuals are considered by the recommender system to provide individuals recommendations. Commonly, like-minded individuals are retrieved by comparing their ratings for common rated products. This filtering technology is commonly referred to as collaborative filtering.
However, retrieving like-minded individuals based on their ratings for common rated products may be inappropriate because common rated products may not necessarily be a representative sample of two individuals’ preferences being compared. We show why and when this is the case.
In this dissertation, we present hypothesis-based collaborative filtering (HCF) to expose individuals to products which best fits their preferences. HCF retrieves like-minded individuals based on the similarity of their hypothesized preferences by means of machine learning algorithms hypothesizing individuals’ preferences. Machine learning is a method to extract patterns to generalize from observations, thus being adequate to hypothesize individuals’ preferences from their product ratings. We present two different frameworks which retrieve like-minded individuals comparing the composition of hypothesized preferences and the predicted utilities individuals receive from products. Furthermore, we provide empirical evidence about the superiority of HCF to baseline collaborative filtering methods.
I Setting the Scene
1.1 Motivation and Thesis
1.2 Hypothesis-Based Collaborative Filtering in a Nutshell
1.3 Thesis Statement
1.3.1 Research Hypotheses
1.3.2 Research Goals
2 Related Work
2.1 Recommender Systems
2.1.1 Formal Framework
2.2 Collaborative Filtering
2.2.1 General Framework for Collaborative Filtering
2.2.2 Cold-Start Problem
2.3 Machine Learning
II Preference Modeling
3 Conceptualization and Specification of Preferences
3.1 Formalization of Preferences
3.2 Partial Preference Extraction from Machine Learning Models
3.2.1 Partial Preference Extraction from Decision Tree Classifier
3.2.2 Partial Preference Extraction from Naïve Bayesian Classifier
3.3 Ontological Specification of Hypothesized Preferences
3.4 Acceptance of Hypotheses
4 Domain Ontology-Boosted Decision Tree Induction
4.1 Decision Tree Induction
4.1.1 Feature Selection
4.2 SEMTREE Extension to the Decision Tree Model
4.2.1 Basic Idea
4.2.2 Injecting Concept Features to Generalize from Features
4.3 Acceptance of Hypotheses
III Preference Similarity
5 Hypothesized Preference Similarity
5.1 Theoretical Foundation of Hypothesized Preference Similarity
5.1.1 Hypothesized Partial Preference Similarity
5.1.2 Hypothesized Semi-Partial Preference Similarity
5.2 Hypothesized Utility-Based Preference Similarity
5.2.1 Product Set for Utility Prediction
5.2.2 Correlative Predicted Utility-Based Similarity
5.2.3 Probabilistic Predicted Utility-Based Similarity
5.2.4 Probabilistic Predicted Utility-Based Semi-Partial Similarity
5.3 Hypothesis Composition-Based Preference Similarity
5.3.1 Similarity of Hypothesized Partial Preferences
5.3.2 Similarity Computation Based on Partial Preference Similarity Matrix
6.1 Experimental Setting
6.1.1 Performance Metrics
6.2 Candidates for Comparison
6.2.1 Hypothesis-Based Collaborative Filtering Candidates
6.2.2 Baseline Collaborative Filtering Candidates
6.2.3 Baseline Content Filtering Candidates
6.4 Results and Discussion
6.4.1 Rating Prediction Accuracy
6.4.2 Relevance Filtering Quality
6.5 Information Theoretic Reflection of Hypothesized Preferences versus Product Ratings
6.6 Acceptance of Hypotheses
7.1.1 Grounded Theory
7.1.2 Data Collection
7.1.3 Data Analysis
7.2 Theory Development
7.2.2 Comparison of Recommendation Performance
7.3 Theory Consolidation
7.4 Theory Validation
7.4.1 Experimental Setting
7.4.2 Results and Discussion
7.5 Acceptance of Hypotheses
8.1 Conceptual Limitations
8.2 Technical Limitations
9.1 Acceptance of Hypotheses
9.2 Achievements of Research Goals and Thesis
9.3 Opportunities for Future Research
A.4 LiMo Database
A.4.1 Interlinking Movies across Web Pages
B Movie Ontology MO
C MovieLens Dataset
C.1 Genres of MovieLens
C.2 Sparse MovieLens Dataset
D Distribution of Recommendation Performance
E Comparison Between Properties and Recommendation Performance
F Comparison Between Recomm. Perform. regarding Cold-Start Behavior
The Leanpub 45-day 100% Happiness Guarantee
Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
C++ Best PracticesJason Turner
Level up your C++, get the tools working for you, eliminate common problems, and move on to more exciting things!
OpenIntro StatisticsDavid Diez, Christopher Barr, Mine Cetinkaya-Rundel, and OpenIntro
A complete foundation for Statistics, also serving as a foundation for Data Science.
Leanpub revenue supports OpenIntro (US-based nonprofit) so we can provide free desk copies to teachers interested in using OpenIntro Statistics in the classroom and expand the project to support free textbooks in other subjects.
More resources: openintro.org.
Functional Design and ArchitectureAlexander Granin
Software Design in Functional Programming, Design Patterns and Practices, Methodologies and Application Architectures. How to build real software in Haskell with less efforts and low risks. The first complete source of knowledge.
R Programming for Data ScienceRoger D. Peng
This book brings the fundamentals of R programming to you, using the same material developed as part of the industry-leading Johns Hopkins Data Science Specialization. The skills taught in this book will lay the foundation for you to begin your journey learning data science. Printed copies of this book are available through Lulu.
C++20 is the next big C++ standard after C++11. As C++11 did it, C++20 changes the way we program modern C++. This change is, in particular, due to the big four of C++20: ranges, coroutines, concepts, and modules.
I am a Software Engineer and I am in ChargeAlexis Monville and Michael Doyle
I am a Software Engineer and I am in Charge is a real-world, practical book that helps you increase your impact and satisfaction at work no matter who you work with.
In the book, we will follow Sandrine, a fictional character who learns to think in a new way enabling her to take a different course of action.
Atomic KotlinBruce Eckel and Svetlana Isakova
For both beginning and experienced programmers! From the author of the multi-award-winning Thinking in C++ and Thinking in Java together with a member of the Kotlin language team comes a book that breaks the concepts into small, easy-to-digest "atoms," along with exercises supported by hints and solutions directly inside IntelliJ IDEA!
Invest In Digital Health - The Medical Futurist's GuideDr. Bertalan Mesko
Artificial Intelligence and Digital Health are booming. In this book, we explain why now it's a good time to invest in Digital Health and give recommendations on where to invest by looking at the top 24 technological trends we find the most promising.
The Hundred-Page Machine Learning BookAndriy Burkov
Everything you really need to know in Machine Learning in a hundred pages.
Mastering STM32Carmine Noviello
With more than 600 microcontrollers, STM32 is probably the most complete ARM Cortex-M platform on the market. This book aims to be the first guide around that introduces the reader to this exciting MCU portfolio from ST Microelectronics and its official CubeHAL.
Software Architecture for Developers: Volumes 1 & 2 - Technical leadership and communication
2 Books"Software Architecture for Developers" is a practical and pragmatic guide to modern, lightweight software architecture, specifically aimed at developers. You'll learn:The essence of software architecture.Why the software architecture role should include coding, coaching and collaboration.The things that you really need to think about before...
CCIE Service Provider Ultimate Study Bundle
2 BooksPiotr Jablonski, Lukasz Bromirski, and Nick Russo have joined forces to deliver the only CCIE Service Provider training resource you'll ever need. This bundle contains a detailed and challenging collection of workbook labs, plus an extensively detailed technical reference guide. All of us have earned the CCIE Service Provider certification...
The Future of Digital Health
6 BooksWe put together the most popular books from The Medical Futurist to provide a clear picture about the major trends shaping the future of medicine and healthcare. Digital health technologies, artificial intelligence, the future of 20 medical specialties, big pharma, data privacy and how technology giants such as Amazon or Google want to conquer...
Cisco CCNA 200-301 Complet
4 BooksCe lot comprend les quatre volumes du guide préparation à l'examen de certification Cisco CCNA 200-301.
CCDE Practical Studies (All labs)
3 BooksCCDE lab
"The C++ Standard Library" and "Concurrency with Modern C++"
2 BooksGet my books "The C++ Standard Library" and "Concurrency with Modern C++" in a bundle. The first book gives you the details you should know about the C++ standard library; the second one dives deeper into concurrency with modern C++. In sum, you get more than 600 pages full of modern C++ and about 250 source files presenting the standard library...
Modern Management Made Easy
3 BooksRead all three Modern Management Made Easy books. Learn to manage yourself, lead and serve others, and lead the organization.
Linux Administration Complet
4 BooksCe lot comprend les quatre volumes du Guide Linux Administration :Linux Administration, Volume 1, Administration fondamentale : Guide pratique de préparation aux examens de certification LPIC 1, Linux Essentials, RHCSA et LFCS. Administration fondamentale. Introduction à Linux. Le Shell. Traitement du texte. Arborescence de fichiers. Sécurité...
Programming with Ease
3 BooksAlle drei Bände der Serie Programming with Ease in einem Paket. Darin findest du alles, was ich dir zu den wichtigsten Phasen der Softwareentwicklung im Hinblick auf Clean Code Development für langfristig hohe Produktivität sagen kann.Im Band Slicing findest du die Anforderungsanalyse im Rahmen eines iterativ-inkrementellen Vorgehensmodells aus...
2 BooksUnveil the power of Ansible and Vagrant with this bundle at a special price. You'll have everything you need to get started with Vagrant - learn the basics and how to create your virtual development environments, using Ansible as provisioner! About Vagrant Cookbook Vagrant Cookbook is a complete guide to get started with Vagrant and create your...