Paragraph specific n-gram approaches to automatically assessing essay quality

Scott Crossley, Caleb DeFore, Kris Kyle, Jianmin Dai, Danielle S. McNamara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

In this paper, we describe an n-gram approach to automatically assess essay quality in student writing. Underlying this approach is the development of n-gram indices that examine rhetorical, syntactic, grammatical, and cohesion features of paragraph types (introduction, body, and conclusion paragraphs) and entire essays. For this study, we developed over 300 n-gram indices and assessed their potential to predict human ratings of essay quality. A combination of these n-gram indices explained over 30% of the variance in human ratings for essays in a training and testing corpus. The findings from this study indicate the strength of using n-gram indices to automatically assess writing quality. Such indices not only explain text-based factors that influence human judgments of essay quality, but also provide new methods for automatically assessing writing quality.

Original languageEnglish
Title of host publicationProceedings of the 6th International Conference on Educational Data Mining, EDM 2013
EditorsSidney K. D'Mello, Rafael A. Calvo, Andrew Olney
PublisherInternational Educational Data Mining Society
ISBN (Electronic)9780983952527
Publication statusPublished - 2013 Jan 1
Event6th International Conference on Educational Data Mining, EDM 2013 - Memphis, United States
Duration: 2013 Jul 62013 Jul 9

Publication series

NameProceedings of the 6th International Conference on Educational Data Mining, EDM 2013

Conference

Conference6th International Conference on Educational Data Mining, EDM 2013
Country/TerritoryUnited States
CityMemphis
Period13/7/613/7/9

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'Paragraph specific n-gram approaches to automatically assessing essay quality'. Together they form a unique fingerprint.

Cite this