The tool for the automatic analysis of lexical sophistication (TAALES): version 2.0

Kristopher Kyle, Scott Crossley, Cynthia Berger

Research output: Contribution to journalArticlepeer-review

124 Citations (Scopus)


This study introduces the second release of the Tool for the Automatic Analysis of Lexical Sophistication (TAALES 2.0), a freely available and easy-to-use text analysis tool. TAALES 2.0 is housed on a user’s hard drive (allowing for secure data processing) and is available on most operating systems (Windows, Mac, and Linux). TAALES 2.0 adds 316 indices to the original tool. These indices are related to word frequency, word range, n-gram frequency, n-gram range, n-gram strength of association, contextual distinctiveness, word recognition norms, semantic network, and word neighbors. In this study, we validated TAALES 2.0 by investigating whether its indices could be used to model both holistic scores of lexical proficiency in free writes and word choice scores in narrative essays. The results indicated that the TAALES 2.0 indices could be used to explain 58% of the variance in lexical proficiency scores and 32% of the variance in word-choice scores. Newly added TAALES 2.0 indices, including those related to n-gram association strength, word neighborhood, and word recognition norms, featured heavily in these predictor models, suggesting that TAALES 2.0 represents a substantial upgrade.

Original languageEnglish
Pages (from-to)1030-1046
Number of pages17
JournalBehavior Research Methods
Issue number3
Publication statusPublished - 2018 Jun 1

Bibliographical note

Publisher Copyright:
© 2017, Psychonomic Society, Inc.

All Science Journal Classification (ASJC) codes

  • Experimental and Cognitive Psychology
  • Developmental and Educational Psychology
  • Arts and Humanities (miscellaneous)
  • Psychology (miscellaneous)
  • Psychology(all)


Dive into the research topics of 'The tool for the automatic analysis of lexical sophistication (TAALES): version 2.0'. Together they form a unique fingerprint.

Cite this