Predicting perceptions of the lexical richness of short French, German, and Portuguese texts using text-based indices

Jan Vanhove; Audrey Bonvin; Amelia Lambelet; Raphael Berthele

doi:10.17239/jowr-2019.10.03.04

Predicting perceptions of the lexical richness of short French, German, and Portuguese texts using text-based indices

Authors

Jan Vanhove
Audrey Bonvin
Amelia Lambelet
Raphael Berthele

DOI:

https://doi.org/10.17239/jowr-2019.10.03.04

Keywords:

human ratings, lexical diversity, lexical richness, lexical sophistication, predictive modelling

Abstract

We investigated how well readers’ perceptions of the lexical richness of short texts can be predicted on the basis of automatically computable indices of the texts’ lexical properties. 3,060 French, German and Portuguese texts (between 9 and 284 words long) written by 8- to 10-year-olds were rated for their lexical richness by between 3 and 18 uninstructed raters, and over 150 indices were derived from these texts. We found that the ratings could to a substantial degree be predicted on the basis of these indices and that the accuracy with which the ratings of shorter texts could be predicted was comparable to that of longer texts. For French and German, the greatest predictive power was attained by opaque models with scores of predictors, but models with fewer predictors based on a 6-dimensional framework of lexical richness perception or even with a single, easily computed predictor, Guiraud’s index, fared only slightly worse.

Downloads

Abstract
PDF

Published

2019-02-15

Issue

Vol. 10 No. 3 (2019)

Section

Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 Unported License.

How to Cite

Vanhove, J., Bonvin, A., Lambelet, A., & Berthele, R. (2019). Predicting perceptions of the lexical richness of short French, German, and Portuguese texts using text-based indices. Journal of Writing Research, 10(3), 499-525. https://doi.org/10.17239/jowr-2019.10.03.04

Predicting perceptions of the lexical richness of short French, German, and Portuguese texts using text-based indices

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles