A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection (2403.00226v3)
Abstract: Detecting temporal semantic changes of words is an important task for various NLP applications that must make time-sensitive predictions. Lexical Semantic Change Detection (SCD) task involves predicting whether a given target word, $w$, changes its meaning between two different text corpora, $C_1$ and $C_2$. For this purpose, we propose a supervised two-staged SCD method that uses existing Word-in-Context (WiC) datasets. In the first stage, for a target word $w$, we learn two sense-aware encoders that represent the meaning of $w$ in a given sentence selected from a corpus. Next, in the second stage, we learn a sense-aware distance metric that compares the semantic representations of a target word across all of its occurrences in $C_1$ and $C_2$. Experimental results on multiple benchmark datasets for SCD show that our proposed method achieves strong performance in multiple languages. Additionally, our method achieves significant improvements on WiC benchmarks compared to a sense-aware encoder with conventional distance functions. Source code is available at https://github.com/LivNLP/svp-sdml .
- Taichi Aida and Danushka Bollegala. 2023a. Swap and predict – predicting the semantic changes in words across corpora by context swapping. In Houda Bouamor, Juan Pino, and Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics, Singapore, pages 7753–7772. https://doi.org/10.18653/v1/2023.findings-emnlp.520.
- Taichi Aida and Danushka Bollegala. 2023b. Unsupervised semantic variation prediction using the distribution of sibling embeddings. In Findings of the Association for Computational Linguistics: ACL 2023. Association for Computational Linguistics, Toronto, Canada, pages 6868–6882. https://doi.org/10.18653/v1/2023.findings-acl.429.
- A comprehensive analysis of PMI-based models for measuring semantic differences. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation. Association for Computational Lingustics, Shanghai, China, pages 21–31. https://aclanthology.org/2021.paclic-1.3.
- Deepmistake: Which senses are hard to distinguish for a word-in-context model. In Computational linguistics and intellectual technologies: Papers from the annual conference Dialogue. volume 20. https://doi.org/10.28995/2075-7182-2021-20-16-30.
- Evaluating the underlying gender bias in contextualized word embeddings. In Proceedings of the First Workshop on Gender Bias in Natural Language Processing. Association for Computational Linguistics, Florence, Italy, pages 33–39. https://doi.org/10.18653/v1/W19-3805.
- Christin Beck. 2020. DiaSense at SemEval-2020 task 1: Modeling sense change via pre-trained BERT embeddings. In Proceedings of the Fourteenth Workshop on Semantic Evaluation. International Committee for Computational Linguistics, Barcelona (online), pages 50–58. https://doi.org/10.18653/v1/2020.semeval-1.4.
- Dallas Card. 2023. Substitution-based semantic change detection using contextual embeddings. In Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Toronto, Canada, pages 590–602. https://doi.org/10.18653/v1/2023.acl-short.52.
- XL-LEXEME: WiC pretrained model for cross-lingual LEXical sEMantic changE. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Toronto, Canada, pages 1577–1585. https://doi.org/10.18653/v1/2023.acl-short.135.
- Yair Censor and Stravoz A. Zenios. 1997. Parellel Optimization. Oxford University Press.
- Unsupervised cross-lingual representation learning at scale. In Dan Jurafsky, Joyce Chai, Natalie Schluter, and Joel Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Online, pages 8440–8451. https://doi.org/10.18653/v1/2020.acl-main.747.
- Paul Cook and Suzanne Stevenson. 2010. Automatically identifying changes in the semantic orientation of words. In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC’10). European Language Resources Association (ELRA), Valletta, Malta.
- Information-theoretic metric learning. In Proceedings of the 24th International Conference on Machine Learning. Association for Computing Machinery, New York, NY, USA, ICML ’07, page 209–216. https://doi.org/10.1145/1273496.1273523.
- Jeffrey Dean and Sanjay Ghemawat. 2004. Mapreduce: Simplified data processing on large clusters. In OSDI’04: Sixth Symposium on Operating System Design and Implementation. San Francisco, CA, pages 137–150.
- Short-term meaning shift: A distributional exploration. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, pages 2069–2075. https://doi.org/10.18653/v1/N19-1210.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, pages 4171–4186.
- Time-out: Temporal referencing for robust modeling of lexical semantic change. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, pages 457–470. https://doi.org/10.18653/v1/P19-1044.
- Do not fire the linguist: Grammatical profiles help language models detect semantic change. In Nina Tahmasebi, Syrielle Montariol, Andrey Kutuzov, Simon Hengchen, Haim Dubossarsky, and Lars Borin, editors, Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change. Association for Computational Linguistics, Dublin, Ireland, pages 54–67. https://doi.org/10.18653/v1/2022.lchange-1.6.
- Dimensionality reduction by learning an invariant mapping. In 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2 (CVPR’06). IEEE, volume 2, pages 1735–1742.
- Diachronic word embeddings reveal statistical laws of semantic change. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Berlin, Germany, pages 1489–1501. https://doi.org/10.18653/v1/P16-1141.
- Temporal analysis of language through neural language models. In Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science. Association for Computational Linguistics, Baltimore, MD, USA, pages 61–65. https://doi.org/10.3115/v1/W14-2517.
- Statistically significant detection of linguistic change. In WWW 2015. pages 625–635.
- Andrey Kutuzov and Mario Giulianelli. 2020. UiO-UvA at SemEval-2020 task 1: Contextualised embeddings for lexical semantic change detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation. International Committee for Computational Linguistics, Barcelona (online), pages 126–134. https://doi.org/10.18653/v1/2020.semeval-1.14.
- Diachronic word embeddings and semantic shifts: a survey. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, pages 1384–1397. https://aclanthology.org/C18-1117.
- Andrey Kutuzov and Lidia Pivovarova. 2021. Three-part diachronic semantic change dataset for Russian. In Nina Tahmasebi, Adam Jatowt, Yang Xu, Simon Hengchen, Syrielle Montariol, and Haim Dubossarsky, editors, Proceedings of the 2nd International Workshop on Computational Approaches to Historical Language Change 2021. Association for Computational Linguistics, Online, pages 7–13. https://doi.org/10.18653/v1/2021.lchange-1.2.
- Grammatical profiling for semantic change detection. In Arianna Bisazza and Omri Abend, editors, Proceedings of the 25th Conference on Computational Natural Language Learning. Association for Computational Linguistics, Online, pages 423–434. https://doi.org/10.18653/v1/2021.conll-1.33.
- Explaining and improving BERT performance on lexical semantic change detection. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, Online, pages 192–202. https://doi.org/10.18653/v1/2021.eacl-srw.25.
- Mind the gap: Assessing temporal generalization in neural language models. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems. https://openreview.net/forum?id=73OmmrCfSyy.
- AM2iCo: Evaluating word meaning in context across low-resource languages with adversarial examples. In Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih, editors, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, pages 7151–7162. https://doi.org/10.18653/v1/2021.emnlp-main.571.
- Ilya Loshchilov and Frank Hutter. 2019. Decoupled weight decay regularization. In International Conference on Learning Representations.
- TimeLMs: Diachronic language models from Twitter. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, Dublin, Ireland, pages 251–260. https://doi.org/10.18653/v1/2022.acl-demo.25.
- SemEval-2021 task 2: Multilingual and cross-lingual word-in-context disambiguation (MCL-WiC). In Alexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, and Xiaodan Zhu, editors, Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021). Association for Computational Linguistics, Online, pages 24–36. https://doi.org/10.18653/v1/2021.semeval-1.3.
- Leveraging contextual embeddings for detecting diachronic semantic shift. In Proceedings of the Twelfth Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, pages 4811–4819. https://aclanthology.org/2020.lrec-1.592.
- Quantitative analysis of culture using millions of digitized books. Science 331(6014):176–182. https://doi.org/10.1126/science.1199644.
- Variance matters: Detecting semantic differences without corpus/word alignment. In Houda Bouamor, Juan Pino, and Kalika Bali, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Singapore, pages 15609–15622. https://doi.org/10.18653/v1/2023.emnlp-main.965.
- (chat)gpt v bert: Dawn of justice for semantic change detection. In arXiv (accepted to Findings of EACL2024). https://arxiv.org/abs/2401.14040.
- Mohammad Taher Pilehvar and Jose Camacho-Collados. 2019. WiC: the word-in-context dataset for evaluating context-sensitive meaning representations. In Jill Burstein, Christy Doran, and Thamar Solorio, editors, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, pages 1267–1273. https://doi.org/10.18653/v1/N19-1128.
- Maxim Rachinskiy and Nikolay Arefyev. 2021. Zeroshot crosslingual transfer of a gloss language model for semantic change detection. In Computational linguistics and intellectual technologies: Papers from the annual conference Dialogue. volume 20. https://doi.org/10.28995/2075-7182-2021-20-578-586.
- XL-WiC: A multilingual benchmark for evaluating semantic contextualization. In Bonnie Webber, Trevor Cohn, Yulan He, and Yang Liu, editors, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, pages 7193–7206. https://doi.org/10.18653/v1/2020.emnlp-main.584.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Association for Computational Linguistics, Hong Kong, China, pages 3980–3990.
- Time masking for temporal language models. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. Association for Computing Machinery, New York, NY, USA, WSDM ’22, pages 833–841. https://doi.org/10.1145/3488560.3498529.
- Guy D. Rosin and Kira Radinsky. 2022. Temporal attention for language models. In Findings of the Association for Computational Linguistics: NAACL 2022. Association for Computational Linguistics, Seattle, United States, pages 1498–1508. https://doi.org/10.18653/v1/2022.findings-naacl.112.
- SemEval-2020 task 1: Unsupervised lexical semantic change detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation. International Committee for Computational Linguistics, Barcelona (online), pages 1–23. https://doi.org/10.18653/v1/2020.semeval-1.1.
- An information-theoretic approach to prompt engineering without ground truth labels. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Dublin, Ireland, pages 819–862.
- Survey of computational approaches to lexical semantic change detection. Computational approaches to semantic change 6:1.
- Can word sense distribution detect semantic changes of words? In Houda Bouamor, Juan Pino, and Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics, Singapore, pages 3575–3590. https://doi.org/10.18653/v1/2023.findings-emnlp.231.
- Elizabeth Closs Traugott and Richard B. Dasher. 2001. Prior and current work on semantic change, Cambridge University Press, page 51–104. Cambridge Studies in Linguistics. https://doi.org/10.1017/CBO9780511486500.004.
- Discovering universal geometry in embeddings with ICA. In Houda Bouamor, Juan Pino, and Kalika Bali, editors, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Singapore, pages 4647–4675. https://doi.org/10.18653/v1/2023.emnlp-main.283.
- Dynamic word embeddings for evolving semantic discovery. In WSDM 2018. page 673–681. https://doi.org/10.1145/3159652.3159703.
- Improving temporal generalization of pre-trained language models with lexical semantic change. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Abu Dhabi, United Arab Emirates, pages 6380–6393. https://aclanthology.org/2022.emnlp-main.428.
- Learning sense-specific static embeddings using contextualised word embeddings as a proxy. In Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation. Association for Computational Lingustics, Shanghai, China, pages 493–502. https://aclanthology.org/2021.paclic-1.52.
- Taichi Aida (7 papers)
- Danushka Bollegala (84 papers)