KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier (2110.02523v1)

Published 6 Oct 2021 in cs.CL

Abstract: Pre-trained models are widely used in fine-tuning downstream tasks with linear classifiers optimized by the cross-entropy loss, which might face robustness and stability problems. These problems can be improved by learning representations that focus on similarities in the same class and contradictions in different classes when making predictions. In this paper, we utilize the K-Nearest Neighbors Classifier in pre-trained model fine-tuning. For this KNN classifier, we introduce a supervised momentum contrastive learning framework to learn the clustered representations of the supervised downstream tasks. Extensive experiments on text classification tasks and robustness tests show that by incorporating KNNs with the traditional fine-tuning process, we can obtain significant improvements on the clean accuracy in both rich-source and few-shot settings and can improve the robustness against adversarial attacks. \footnote{all codes is available at https://github.com/LinyangLee/KNN-BERT}

Citations (20)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - LinyangLee/KNN-BERT: Code for paper: KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier (23 stars)

KNN-BERT: Fine-Tuning Pre-Trained Models with KNN Classifier (2110.02523v1)

Summary

Related Papers

GitHub