Behavior Sequence Transformer for E-commerce Recommendation in Alibaba (1905.06874v1)

Published 15 May 2019 in cs.IR and cs.LG

Abstract: Deep learning based methods have been widely used in industrial recommendation systems (RSs). Previous works adopt an Embedding&MLP paradigm: raw features are embedded into low-dimensional vectors, which are then fed on to MLP for final recommendations. However, most of these works just concatenate different features, ignoring the sequential nature of users' behaviors. In this paper, we propose to use the powerful Transformer model to capture the sequential signals underlying users' behavior sequences for recommendation in Alibaba. Experimental results demonstrate the superiority of the proposed model, which is then deployed online at Taobao and obtain significant improvements in online Click-Through-Rate (CTR) comparing to two baselines.

Citations (344)

View on Semantic Scholar

Summary

The paper introduces BST, a Transformer-based architecture that captures sequential user behavior to enhance CTR prediction.
BST integrates embedding layers, multi-head self-attention, and MLP to extract detailed user interaction patterns from click sequences.
Experimental results show a 7.57% CTR improvement on Taobao, demonstrating the model's scalability and practical impact on e-commerce recommendations.

An Expert Review of "Behavior Sequence Transformer for E-commerce Recommendation in Alibaba"

The paper "Behavior Sequence Transformer for E-commerce Recommendation in Alibaba" puts forward a novel approach to enhance the recommendation system (RS) employed by Taobao, a prominent platform under Alibaba. Herein, the paper introduces the Behavior Sequence Transformer (BST), an architecture leveraging the Transformer model, known for its success in natural language processing tasks, to capture sequential dependencies within users' behavior for improved Click-Through-Rate (CTR) prediction.

Objective and Context

Recommender Systems play a pivotal role in the success of e-commerce platforms by predicting users' interests and suggesting relevant items. Traditional methods, such as those using Embedding and Multi-layer Perceptron (MLP), have been effective yet lack incorporation of sequential user interactions. This research aims to address this gap by modeling the sequence of user clicks and adapting the Transformer model to extract valuable sequential signals from these interactions.

Methodology

The architecture proposed in the paper, BST, builds upon the Wide and Deep Learning (WDL) paradigm. Key components of BST include:

Embedding Layer: Utilizes low-dimensional embeddings for various features like user profile, item characteristics, and their interactions. Notably, it captures sequence item features and positional information, where the latter distinguishes the order of user clicks.
Transformer Layer: This layer employs multi-head self-attention mechanisms to learn in-depth representations of each item in a user's sequence of interactions. The design draws on the ability of attention mechanisms to highlight important relationships within the sequence, addressing the sequential nature ignored by traditional models.
MLP: Post the Transformer processing, embeddings are fed into a stacked MLP to model complex interactions leading to CTR predictions.

The execution of these components within an end-to-end framework allows for an efficient capture of user behavior insights which are crucial for accurate recommendation delivery.

Experimental Results

The experimental evaluation of the BST model involved both offline tests on a vast dataset from Taobao and online A/B testing to assess improvements in CTR. Offline results indicated a notable AUC improvement to 0.7894 over competitors such as WDL and DIN (Deep Interest Network). More importantly, online deployments of BST achieved a 7.57% increase in CTR, demonstrating tangible enhancements in consumer engagement. Moreover, the scalability and reasonable service response time affirm the practicality of deploying such advanced algorithms in large-scale industrial setups.

Implications and Future Directions

The implementation of the Transformer model within e-commerce RS contexts symbolizes a significant progression in capturing intricate behavior patterns and underscores the potential benefits of cross-disciplinary AI advancements. Practically, improved CTR translates to increased relevance in recommendations, potentially enhancing user satisfaction and business metrics such as Gross Merchandise Volume.

Future work could explore tweaking the complexity of the Transformer layers, considering other context-enhancing mechanisms, or integrating multi-modal data inputs (e.g., visual content) to further enrich user intent understanding. Additionally, addressing computational efficiency, particularly with growing sequence lengths, could enhance the model's real-time applicability.

In summation, the paper presents a well-founded examination and application of advanced attention mechanisms tailored to improving recommendation accuracy in a commercial context, paving the way for enriched interactions between consumers and e-commerce platforms.

PDF Markdown

Related Papers

Tweets

https://twitter.com/cosminnegruseri/status/1903921012175151264

YouTube

Show All Videos