This measure not only improves the parametric properties of the WF values, because it is a logarithmic transformation, but it also offers a scale from 0 to 7 with a more natural and straightforward interpreta- tion. Zipf values were computed for all the corpora consid- ered in the following ...
Following the previous SUBTLEX databases, we collected data for word frequency and contextual diversity. Word frequency (WF) has traditionally been expressed as the number of occurrences per million words in the corpus (relative frequency). Over the years, it has been increasingly common to use the...