Hypothesis-Based Collaborative Filtering
Hypothesis-Based Collaborative Filtering
Retrieving Like-Minded Individuals Based on the Comparison of Hypothesized Preferences
About the Book
The vast product variety and product variation offered by online retailers provide an amazing amount of choice options to individuals, thus posing a big challenge to them finding and choosing interesting products which provide them the most utility. Consequently, consumers have to be satisfied with finding a product that provides them sufficient utility. Beyond that, individuals tend to even defer product choice, which is known as overchoice phenomenon.
Recommender systems have emerged in the past years as an effective method to help individuals with finding interesting products. As a result, the consumer welfare enhanced by $731 million to $1.03 billion in the year 2000 due to the increased product variety of online bookstores. Consumer welfare refers to consumers’ total satisfaction. This enhancement in consumer welfare is 7 to 10 times larger than the consumer welfare gain from increased competition and lower prices in the book market. In other words, recommender systems are essential for increasing consumers welfare, which ultimately leads to an increase of economic and social welfare.
Typically, recommender systems use the collective wisdom of individuals for exposing individuals to products which best fits their preferences, thus maximizing their utility. More precisely, the product ratings of like-minded individuals are considered by the recommender system to provide individuals recommendations. Commonly, like-minded individuals are retrieved by comparing their ratings for common rated products. This filtering technology is commonly referred to as collaborative filtering.
However, retrieving like-minded individuals based on their ratings for common rated products may be inappropriate because common rated products may not necessarily be a representative sample of two individuals’ preferences being compared. We show why and when this is the case.
In this dissertation, we present hypothesis-based collaborative filtering (HCF) to expose individuals to products which best fits their preferences. HCF retrieves like-minded individuals based on the similarity of their hypothesized preferences by means of machine learning algorithms hypothesizing individuals’ preferences. Machine learning is a method to extract patterns to generalize from observations, thus being adequate to hypothesize individuals’ preferences from their product ratings. We present two different frameworks which retrieve like-minded individuals comparing the composition of hypothesized preferences and the predicted utilities individuals receive from products. Furthermore, we provide empirical evidence about the superiority of HCF to baseline collaborative filtering methods.
I Setting the Scene
1.1 Motivation and Thesis
1.2 Hypothesis-Based Collaborative Filtering in a Nutshell
1.3 Thesis Statement
1.3.1 Research Hypotheses
1.3.2 Research Goals
2 Related Work
2.1 Recommender Systems
2.1.1 Formal Framework
2.2 Collaborative Filtering
2.2.1 General Framework for Collaborative Filtering
2.2.2 Cold-Start Problem
2.3 Machine Learning
II Preference Modeling
3 Conceptualization and Specification of Preferences
3.1 Formalization of Preferences
3.2 Partial Preference Extraction from Machine Learning Models
3.2.1 Partial Preference Extraction from Decision Tree Classifier
3.2.2 Partial Preference Extraction from Naïve Bayesian Classifier
3.3 Ontological Specification of Hypothesized Preferences
3.4 Acceptance of Hypotheses
4 Domain Ontology-Boosted Decision Tree Induction
4.1 Decision Tree Induction
4.1.1 Feature Selection
4.2 SEMTREE Extension to the Decision Tree Model
4.2.1 Basic Idea
4.2.2 Injecting Concept Features to Generalize from Features
4.3 Acceptance of Hypotheses
III Preference Similarity
5 Hypothesized Preference Similarity
5.1 Theoretical Foundation of Hypothesized Preference Similarity
5.1.1 Hypothesized Partial Preference Similarity
5.1.2 Hypothesized Semi-Partial Preference Similarity
5.2 Hypothesized Utility-Based Preference Similarity
5.2.1 Product Set for Utility Prediction
5.2.2 Correlative Predicted Utility-Based Similarity
5.2.3 Probabilistic Predicted Utility-Based Similarity
5.2.4 Probabilistic Predicted Utility-Based Semi-Partial Similarity
5.3 Hypothesis Composition-Based Preference Similarity
5.3.1 Similarity of Hypothesized Partial Preferences
5.3.2 Similarity Computation Based on Partial Preference Similarity Matrix
6.1 Experimental Setting
6.1.1 Performance Metrics
6.2 Candidates for Comparison
6.2.1 Hypothesis-Based Collaborative Filtering Candidates
6.2.2 Baseline Collaborative Filtering Candidates
6.2.3 Baseline Content Filtering Candidates
6.4 Results and Discussion
6.4.1 Rating Prediction Accuracy
6.4.2 Relevance Filtering Quality
6.5 Information Theoretic Reflection of Hypothesized Preferences versus Product Ratings
6.6 Acceptance of Hypotheses
7.1.1 Grounded Theory
7.1.2 Data Collection
7.1.3 Data Analysis
7.2 Theory Development
7.2.2 Comparison of Recommendation Performance
7.3 Theory Consolidation
7.4 Theory Validation
7.4.1 Experimental Setting
7.4.2 Results and Discussion
7.5 Acceptance of Hypotheses
8.1 Conceptual Limitations
8.2 Technical Limitations
9.1 Acceptance of Hypotheses
9.2 Achievements of Research Goals and Thesis
9.3 Opportunities for Future Research
A.4 LiMo Database
A.4.1 Interlinking Movies across Web Pages
B Movie Ontology MO
C MovieLens Dataset
C.1 Genres of MovieLens
C.2 Sparse MovieLens Dataset
D Distribution of Recommendation Performance
E Comparison Between Properties and Recommendation Performance
F Comparison Between Recomm. Perform. regarding Cold-Start Behavior
The Leanpub 45-day 100% Happiness Guarantee
Within 45 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.
See full terms
Free Updates. DRM Free.
If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).
Most Leanpub books are available in PDF (for computers), EPUB (for phones and tablets) and MOBI (for Kindle). The formats that a book includes are shown at the top right corner of this page.
Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.
Algebra-Driven DesignSandy Maguire
A how-to field guide on building leak-free abstractions and algebraically designing real-world applications.
Production HaskellMatt Parsons
Are you excited about Haskell, but don't know where to begin? Are you thrilled by the technical advantages, but worried about the unknown pitfalls? This book has you covered.
The Hundred-Page Machine Learning BookAndriy Burkov
Everything you really need to know in Machine Learning in a hundred pages.
Ansible for DevOpsJeff Geerling
Ansible is a simple, but powerful, server and configuration management tool. Learn to use Ansible effectively, whether you manage one server—or thousands.
Machine Learning EngineeringAndriy Burkov
"If you intend to use machine learning to solve business problems at scale, I'm delighted you got your hands on this book."
—Cassie Kozyrkov, Chief Decision Scientist at Google
"Foundational work about the reality of building machine learning models in production."
—Karolis Urbonas, Head of Machine Learning and Science at Amazon
Cloud StrategyGregor Hohpe
“Strategy is the difference between making a wish and making it come true.” A successful migration to the cloud shouldn’t be driven by wishes, but guided by a sound strategy, frameworks, and decision models. This book tells you how—without becoming superficial nor getting lost in technology and product details.
CCIE Service Provider Version 4 Written and Lab Exam Comprehensive GuideNicholas Russo
The service provider landscape has changed rapidly over the past several years. Networking vendors are continuing to propose new standards, techniques, and procedures for overcoming new challenges while concurrently reducing costs and delivering new services. Cisco has recently updated the CCIE Service Provider track to reflect these changes; this book represents the author's personal journey in achieving that certification.
CCIE SP v4.1 - WorkbookŁukasz Bromirski, Piotr Jablonski, and Nicholas Russo
Are you striving to prepare to and pass CCIE SP lab exam? Take the opportunity and get this workbook! With the attached initial cfg files you will prepare yourself for the CCIE SP exam as well as learn SP technologies applicable to all kinds of today modern networks! This workbook covers blueprint topics and provides challenging examples.
C++ Best PracticesJason Turner
Level up your C++, get the tools working for you, eliminate common problems, and move on to more exciting things!
Introducing EventStormingAlberto Brandolini
The deepest tutorial and explanation about EventStorming, straight from the inventor.
The Node.js Bundle
3 BooksThis bundle combines three bestselling Leanpub Node.js books into a package that gives you everything you need to get started with developing Node.js applications at an unbeatable price.
The Tester's Library
8 BooksThe Tester's Library consists of eight five-star books that every software tester should read and re-read. As bound books, this collection would cost over $200. Even as e-books, their price would exceed $80, but in this bundle, their cost is only $49.99. Here are the books, and why they should be in your library: Perfect Software and Other...
11 BooksIn this bundle, you will find 10 different agile books. They are about different aspects of being agile. - finding a job - doing coding dojo's - Retrospectives - Personal kanban - a non-typical coaching book and even a book that gives you an insight in the lives of some agile people.
WTFlop 6M + HU - Beta Bundle
Fifty Quick Ideas
3 BooksGet all three books for the price of two! Fifty Quick Ideas books are full of practical, real-world techniques that you can use to improve teamwork, build better products and build them in a better way.
Growing Agile: Coach's Guide Series
4 BooksThis bundle provides a collection of training and workshop plans for a variety of agile topics. The series is aimed at agile coaches, trainers and ScrumMasters who often find themselves needing to help teams understand agile concepts. Each book in the series provides the plans, slides, handouts and activity instructions to run a number of...
Marionette.js A to Z
Complete Scala Bundle
3 BooksScala is a general-purpose programming language and it's getting extremely popular these days. Some say that learning Scala could be a challenging task. My experience, however, suggests that this is actually a myth that has very little to do with reality. With the right approach, learning Scala can be easy, fun and rewarding.The first book from...
Build A Better Backbone App
3 BooksThe best way to learn new development skills is through experience, but that takes time you don't have.Get the best of both worlds with this bundle: you'll learn how to produce modern web applications by learning from experienced developers like Derick Bailey and David Sulc. BackboneJS is one of the favorite tools on the web today, but it...