Solving the Cold-Start Problem in Recommender Systems with Social Tags
(1004.3732v2)
Published 21 Apr 2010 in cs.IR and physics.soc-ph
Abstract: In this paper, based on the user-tag-object tripartite graphs, we propose a recommendation algorithm, which considers social tags as an important role for information retrieval. Besides its low cost of computational time, the experiment results of two real-world data sets, \emph{Del.icio.us} and \emph{MovieLens}, show it can enhance the algorithmic accuracy and diversity. Especially, it can obtain more personalized recommendation results when users have diverse topics of tags. In addition, the numerical results on the dependence of algorithmic accuracy indicates that the proposed algorithm is particularly effective for small degree objects, which reminds us of the well-known \emph{cold-start} problem in recommender systems. Further empirical study shows that the proposed algorithm can significantly solve this problem in social tagging systems with heterogeneous object degree distributions.
The paper’s main contribution is a novel diffusion-based algorithm that uses social tags to bridge users and objects, effectively addressing the cold-start problem.
The methodology employs a user-tag-object tripartite graph and evaluates performance on Del.icio.us and MovieLens using ranking and diversity metrics.
Results indicate improved recommendation accuracy for low-degree objects and enhanced diversity through personalized tag usage.
The paper introduces a diffusion-based recommendation algorithm leveraging social tags within a user-tag-object tripartite graph to address the cold-start problem in recommender systems. The algorithm posits that social tags serve as a conduit connecting users to objects, thereby enhancing recommendation accuracy and diversity.
The proposed algorithm is compared against two baseline algorithms:
User-object diffusion
User-object-tag diffusion
In contrast, the proposed algorithm, user-tag-object diffusion, posits that resources are initially located on tags based on their usage frequency by a target user Ui​. These resources are then distributed to neighboring objects. The final resource vector f′′​ is expressed as:
ajl′​ represents the object-tag relation, where ajk′​=1 if object Oj​ has been assigned by tag Tk​, and ajk′​=0 otherwise.
ail′′​ represents the user-tag relation, where aik′′​ is the number of times that user Ui​ has adopted tag Tk​.
k′(Tl​) is the number of neighboring objects for tag Tl​, where k′(Tl​)=∑j=1m​ajl′​.
The algorithm's advantages are its capacity to generate personalized recommendations, reduced computational time, and the explicit modeling of tags as bridges between users and objects.
The performance of the algorithm is evaluated using three metrics:
Ranking Score (RS): Defined as the rank of the object divided by the number of all uncollected objects for the corresponding user.
Inter Diversity (InterD): Measures the differences in recommendation lists between users. Given ORi​ as the set of recommended objects for user Ui​, InterD is calculated as:
where $S_{jl}=\frac{|\Gamma_{O_j}\cap\Gamma_{O_l}|}{\sqrt{|\Gamma_{O_j}|\times |\Gamma_{O_l}|}$ is the cosine similarity between objects Oj​ and Ol​, and ΓOj​​ denotes the set of users having collected object Oj​.