SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis (2005.05635v2)

Published 12 May 2020 in cs.CL

Abstract: Recently, sentiment analysis has seen remarkable advance with the help of pre-training approaches. However, sentiment knowledge, such as sentiment words and aspect-sentiment pairs, is ignored in the process of pre-training, despite the fact that they are widely used in traditional sentiment analysis approaches. In this paper, we introduce Sentiment Knowledge Enhanced Pre-training (SKEP) in order to learn a unified sentiment representation for multiple sentiment analysis tasks. With the help of automatically-mined knowledge, SKEP conducts sentiment masking and constructs three sentiment knowledge prediction objectives, so as to embed sentiment information at the word, polarity and aspect level into pre-trained sentiment representation. In particular, the prediction of aspect-sentiment pairs is converted into multi-label classification, aiming to capture the dependency between words in a pair. Experiments on three kinds of sentiment tasks show that SKEP significantly outperforms strong pre-training baseline, and achieves new state-of-the-art results on most of the test datasets. We release our code at https://github.com/baidu/Senta.

Citations (212)

View on Semantic Scholar

Summary

The paper introduces SKEP, a novel approach that integrates sentiment-specific knowledge into language model pre-training for improved sentiment analysis.
The paper employs targeted sentiment masking techniques including sentiment word, word polarity, and aspect-sentiment pair prediction to enhance learning.
The paper demonstrates significant performance gains over models like RoBERTa on fine-grained sentiment and opinion role labeling tasks.

Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis

The paper presents Sentiment Knowledge Enhanced Pre-training (SKEP), a novel approach aiming to improve sentiment analysis by incorporating sentiment knowledge into the pre-training process of LLMs. This is a departure from traditional sentiment analysis techniques, which either design models for specific tasks independently or employ pre-training without explicitly leveraging sentiment-specific information.

Motivation and Approach

With the success of pre-training methods in NLP, such as BERT and RoBERTa, this research identifies a gap where sentiment-specific knowledge is often ignored. The paper proposes SKEP to fill this void by integrating sentiment words, word polarity, and aspect-sentiment pairs into the pre-training phase. This approach is particularly beneficial for sentiment analysis tasks which vary widely, from sentence-level classification to opinion extraction.

The SKEP model introduces sentiment masking, where sentiment-related tokens are selectively masked to guide the model in learning sentiment-specific dependencies. It defines three main objectives:

Sentiment Word (SW) Prediction: Recovering masked sentiment words.
Word Polarity (WP) Prediction: Inferring the polarity of these words.
Aspect-Sentiment Pair (AP) Prediction: Capturing relationships between aspect terms and related sentiments through multi-label classification.

Key Insights and Results

Notably, SKEP goes beyond random word masking by targeting sentiment tokens, thus producing representations more aligned with sentiment analysis tasks. Strong empirical results underscore SKEP’s effectiveness. It demonstrates superior performance over RoBERTa on various sentiment tasks:

Achieves notable improvements in fine-grained tasks such as aspect-level sentiment classification and opinion role labeling.
Outperforms previous state-of-the-art models across multiple datasets, delivering robust sentiment-specific representations.

Implications and Future Directions

The integration of sentiment knowledge into pre-training processes holds considerable promise for enhancing sentiment analysis across numerous domains. This work suggests the potential for expanding the pre-training framework to incorporate additional types of structured knowledge. Future research could explore more refined sentiment mining techniques and evaluate SKEP across an even broader array of sentiment tasks to fully assess its generalizability and impact.

Overall, SKEP represents a significant step toward specialized pre-training methods that marry domain-specific knowledge with generic learning frameworks, opening pathways for more nuanced and contextually relevant natural language understanding.

PDF Markdown

Related Papers

GitHub

GitHub - baidu/Senta: Baidu's open-source Sentiment Analysis System. (1,928 stars)

Tweets

https://twitter.com/BaiduResearch/status/1313216290996121600

https://twitter.com/Quantum_Stat/status/1441024564071477255