A Comparison of Architectures and Pretraining Methods for Contextualized Multilingual Word Embeddings (1912.10169v1)

Published 15 Dec 2019 in cs.CL

Abstract: The lack of annotated data in many languages is a well-known challenge within the field of multilingual NLP. Therefore, many recent studies focus on zero-shot transfer learning and joint training across languages to overcome data scarcity for low-resource languages. In this work we (i) perform a comprehensive comparison of state-ofthe-art multilingual word and sentence encoders on the tasks of named entity recognition (NER) and part of speech (POS) tagging; and (ii) propose a new method for creating multilingual contextualized word embeddings, compare it to multiple baselines and show that it performs at or above state-of-theart level in zero-shot transfer settings. Finally, we show that our method allows for better knowledge sharing across languages in a joint training setting.

Citations (16)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A Comparison of Architectures and Pretraining Methods for Contextualized Multilingual Word Embeddings (1912.10169v1)

Summary

Related Papers