Lexical diversity

Lexical diversity is one aspect of 'lexical richness' and refers to the ratio of different unique word stems (types) to the total number of words (tokens). The term is used in applied linguistics and is quantitatively calculated using numerous different measures including Text-Type Ratio (TTR), vocd[1], and the measure of textual lexical diversity (MTLD)[2].

A common problem with lexical diversity measures, especially TTR, is that text samples containing large number of tokens give lower values for TTR since it is often necessary for the writer or speaker to re-use several function words. One consequence of this is that lexical diversity is better used for comparing texts of equal length.[3] Newer measures of lexical diversity attempt to account for sensitivity to text length.

Definitions

In a 2013 article Scott Jarvis proposed that lexical diversity, similar to diversity in ecology, is a perceptual phenomenon. Lexical redundancy is a positive counterpart of lexical diversity in the same way as lexical variability is the mirror image of repetition. According to Jarvis's model, lexical diversity includes variability, volume, evenness, rarity, dispersion and disparity.[4]

According to Jarvis, the six properties of lexical diversity should be measured by the following indices.

Property Measure
Variability Measure of Textual Lexical Diversity (MTLD)
Volume Total number of words in the text
Evenness Standard deviation of tokens per type
Rarity Mean BNC rank
Dispersion Mean distance between tokens of type
Disparity Mean number of words per sense or Latent Semantic Analysis
gollark: I simply automatically colorize my timetables.
gollark: Still, it seems inefficient. In radio, experimentation can now be done with general purpose SDRs, which is a lot faster than getting hardware built/obtained or something.
gollark: I see.
gollark: Why have dedicated task-specific physical hardware for all the different operations?
gollark: Someone should really develop software defined crystals for this sort of thing.

References

  1. McCarthy, Phillip; Jarvis, Scott (2007). "vocd: A theoretical and empirical evaluation". Language Testing. 24 (4): 459–488. doi:10.1177/0265532207080767.
  2. McCarthy, Phillip (2005). "An assessment of the range and usefulness of lexical diversity measures and the potential of the measure of textual, lexical diversity (MTLD)". Doctoral Dissertation via Proquest Dissertations and Theses. (UMI No. 3199485).
  3. Lexical diversity and lexical density in speech and writing: A developmental perspective - V Johansson - Working Papers in Linguistics, 2009
  4. Jarvis, Scott (2013). "Capturing the Diversity in Lexical Diversity". Language Learning. 63: 87–106. doi:10.1111/j.1467-9922.2012.00739.x.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.