Segmental Recurrent Neural Networks (1511.06018v2)

Published 18 Nov 2015 in cs.CL and cs.LG

Abstract: We introduce segmental recurrent neural networks (SRNNs) which define, given an input sequence, a joint probability distribution over segmentations of the input and labelings of the segments. Representations of the input segments (i.e., contiguous subsequences of the input) are computed by encoding their constituent tokens using bidirectional recurrent neural nets, and these "segment embeddings" are used to define compatibility scores with output labels. These local compatibility scores are integrated using a global semi-Markov conditional random field. Both fully supervised training -- in which segment boundaries and labels are observed -- as well as partially supervised training -- in which segment boundaries are latent -- are straightforward. Experiments on handwriting recognition and joint Chinese word segmentation/POS tagging show that, compared to models that do not explicitly represent segments such as BIO tagging schemes and connectionist temporal classification (CTC), SRNNs obtain substantially higher accuracies.

Authors (3)

Lingpeng Kong (134 papers)
Chris Dyer (91 papers)
Noah A. Smith (224 papers)

Citations (123)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Segmental Recurrent Neural Networks (1511.06018v2)

Summary

Related Papers