End-to-end Training and Decoding for Pivot-based Cascaded Translation Model (2305.02261v1)

Published 3 May 2023 in cs.CL and cs.AI

Abstract: Utilizing pivot language effectively can significantly improve low-resource machine translation. Usually, the two translation models, source-pivot and pivot-target, are trained individually and do not utilize the limited (source, target) parallel data. This work proposes an end-to-end training method for the cascaded translation model and configures an improved decoding algorithm. The input of the pivot-target model is modified to weighted pivot embedding based on the probability distribution output by the source-pivot model. This allows the model to be trained end-to-end. In addition, we mitigate the inconsistency between tokens and probability distributions while using beam search in pivot decoding. Experiments demonstrate that our method enhances the quality of translation.

Authors (5)

Hao Cheng (190 papers)
Meng Zhang (184 papers)
Liangyou Li (36 papers)
Qun Liu (230 papers)
Zhihua Zhang (118 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

End-to-end Training and Decoding for Pivot-based Cascaded Translation Model (2305.02261v1)

Summary

Related Papers