It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT (2010.08275v1)

Published 16 Oct 2020 in cs.CL

Abstract: Recent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple methods that expose remarkable translation capabilities with no fine-tuning. The results suggest that most of this information is encoded in a non-linear way, while some of it can also be recovered with purely linear tools. As part of our analysis, we test the hypothesis that mBERT learns representations which contain both a language-encoding component and an abstract, cross-lingual component, and explicitly identify an empirical language-identity subspace within mBERT representations.

Authors (4)

Hila Gonen (30 papers)
Shauli Ravfogel (38 papers)
Yanai Elazar (44 papers)
Yoav Goldberg (142 papers)

Citations (47)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT (2010.08275v1)

Summary

Related Papers