Learning from Partially Annotated Data: Example-aware Creation of Gap-filling Exercises for Language Learning (2306.01584v2)
Abstract: Since performing exercises (including, e.g., practice tests) forms a crucial component of learning, and creating such exercises requires non-trivial effort from the teacher, there is a great value in automatic exercise generation in digital tools in education. In this paper, we particularly focus on automatic creation of gapfilling exercises for language learning, specifically grammar exercises. Since providing any annotation in this domain requires human expert effort, we aim to avoid it entirely and explore the task of converting existing texts into new gap-filling exercises, purely based on an example exercise, without explicit instruction or detailed annotation of the intended grammar topics. We contribute (i) a novel neural network architecture specifically designed for aforementioned gap-filling exercise generation task, and (ii) a real-world benchmark dataset for French grammar. We show that our model for this French grammar gap-filling exercise generation outperforms a competitive baseline classifier by 8% in F1 percentage points, achieving an average F1 score of 82%. Our model implementation and the dataset are made publicly available to foster future research, thus offering a standardized evaluation and baseline solution of the proposed partially annotated data prediction task in grammar exercise creation.
- Naveed Afzal and Ruslan Mitkov. 2014. Automatic generation of multiple choice questions using dependency-based semantic relations. Soft Computing, 18(7):1269–1281.
- Manish Agarwal and Prashanth Mannem. 2011. Automatic gap-fill question generation from text books. In Proceedings of the sixth workshop on innovative use of NLP for building educational applications, pages 56–64.
- Maha Al-Yahya. 2011. Ontoque: a question generation engine for educational assesment based on domain ontologies. In 2011 IEEE 11th International Conference on Advanced Learning Technologies, pages 393–395. IEEE.
- Learning to reuse distractors to support multiple choice question generation in education. IEEE Transactions on Learning Technologies.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258.
- Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901.
- Electra: Pre-training text encoders as discriminators rather than generators. In International Conference on Learning Representations.
- The siette automatic assessment environment. International Journal of Artificial Intelligence in Education, 26(1):270–292.
- Unsupervised cross-lingual representation learning at scale. arXiv preprint arXiv:1911.02116.
- Analyzing students’ perceptions to improve the design of an automated assessment tool in online distributed programming. Computers & Education, 128:159–170.
- Barbara Gross Davis. 2009. Tools for teaching. John Wiley & Sons.
- Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
- Learning to ask: Neural question generation for reading comprehension. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1342–1352, Vancouver, Canada. Association for Computational Linguistics.
- Constructing open cloze tests using generation and discrimination capabilities of transformers. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1263–1273, Dublin, Ireland. Association for Computational Linguistics.
- Automatic generation system of multiple-choice cloze questions and its evaluation. Knowledge Management & E-Learning: An International Journal, 2(3):210–224.
- Jennifer Hill and Rahul Simha. 2016. Automatic generation of context-based fill-in-the-blank exercises using co-occurrence likelihoods and google n-grams. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, pages 23–30.
- John Lee and Stephanie Seneff. 2007. Automatic generation of cloze items for prepositions. In Eighth Annual Conference of the International Speech Communication Association.
- Anna Malinova and Olga Rahneva. 2016. Automatic generation of english language test questions using mathematica. In CBU International Conference Proceedings, volume 4, pages 906–909.
- Learning to automatically generate fill-in-the-blank quizzes. arXiv preprint arXiv:1806.04524.
- A computer-aided environment for generating multiple-choice test items. Natural language engineering, 12(2):177–194.
- John W Oller Jr. 1973. Cloze tests of second language proficiency and what they measure 1. Language learning, 23(1):105–118.
- Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35:27730–27744.
- Automatic generation of multiple choice questions from domain ontologies. e-Learning, 1:427–434.
- Generating grammar exercises. In The 7th Workshop on Innovative Use of NLP for Building Educational Applications, NAACL-HLT Worskhop 2012, pages 147–157.
- A selection strategy to improve cloze question quality. In Proceedings of the Workshop on Intelligent Tutoring Systems for Ill-Defined Domains. 9th International Conference on Intelligent Tutoring Systems, Montreal, Canada, pages 22–32.
- Using cognitive models to develop quality multiple-choice questions. Medical teacher, 38(8):838–843.
- Exploring the limits of transfer learning with a unified text-to-text transformer. The Journal of Machine Learning Research, 21(1):5485–5551.
- Automatic generation of language exercises based on a universal methodology: An analysis of possibilities. Bulletin of the Transilvania University of Brasov. Series IV: Philology and Cultural Studies, pages 29–48.
- Katherine Stasaski and Marti A Hearst. 2017. Multiple choice question generation utilizing an ontology. In Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, pages 303–312.
- Measuring non-native speakers’ proficiency of english by using a test with automatically-generated fill-in-the-blank questions. In Proceedings of the second workshop on Building Educational Applications Using NLP, pages 61–68.
- Automatic question tagging with deep neural networks. IEEE Transactions on Learning Technologies, 12(1):29–43.
- Evaluation of automatically generated english vocabulary questions. Research and practice in technology enhanced learning, 12(1):1–21.
- Wilson L Taylor. 1953. “cloze procedure”: A new tool for measuring readability. Journalism quarterly, 30(4):415–433.
- Generalizing from a few examples: A survey on few-shot learning. ACM computing surveys (csur), 53(3):1–34.
- Chain of thought prompting elicits reasoning in large language models. In Advances in Neural Information Processing Systems.