2000 character limit reached
BKTreebank: Building a Vietnamese Dependency Treebank (1710.05519v2)
Published 16 Oct 2017 in cs.CL
Abstract: Dependency treebank is an important resource in any language. In this paper, we present our work on building BKTreebank, a dependency treebank for Vietnamese. Important points on designing POS tagset, dependency relations, and annotation guidelines are discussed. We describe experiments on POS tagging and dependency parsing on the treebank. Experimental results show that the treebank is a useful resource for Vietnamese language processing.