Abstract
In this paper, we introduce a dependency treebank of spoken second language (L2) English that is annotated with part of speech (Penn POS) tags and syntactic dependencies (Universal Dependencies). We then evaluate the degree to which the use of this treebank as training data affects POS and UD annotation accuracy for L1 web texts, L2 written texts, and L2 spoken texts as compared to models trained on L1 texts only.
Original language | English |
---|---|
Title of host publication | BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings |
Editors | Ekaterina Kochmar, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Nitin Madnani, Anais Tack, Victoria Yaneva, Zheng Yuan, Torsten Zesch |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 39-45 |
Number of pages | 7 |
ISBN (Electronic) | 9781955917834 |
Publication status | Published - 2022 |
Event | 17th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2022 - Seattle, United States Duration: 2022 Jul 15 → … |
Publication series
Name | BEA 2022 - 17th Workshop on Innovative Use of NLP for Building Educational Applications, Proceedings |
---|
Conference
Conference | 17th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2022 |
---|---|
Country/Territory | United States |
City | Seattle |
Period | 22/7/15 → … |
Bibliographical note
Publisher Copyright:© 2022 Association for Computational Linguistics.
All Science Journal Classification (ASJC) codes
- Language and Linguistics
- Software
- Linguistics and Language