Internal functions used in textstat_lexdiv(), for computing
lexical diversity measures on dfms or tokens objects
compute_lexdiv_dfm_stats(x, measure = NULL, log.base = 10) compute_lexdiv_tokens_stats( x, measure = c("MATTR", "MSTTR"), MATTR_window, MSTTR_segment )
| x | a dfm object |
|---|---|
| measure | a list of lexical diversity measures. |
| log.base | a numeric value defining the base of the logarithm (for measures using logs) |
| MATTR_window | a numeric value defining the size of the moving window for computation of the Moving-Average Type-Token Ratio (Covington & McFall, 2010) |
| MSTTR_segment | a numeric value defining the size of the each segment for the computation of the the Mean Segmental Type-Token Ratio (Johnson, 1944) |
a data.frame with a document column containing the
input document name, followed by columns with the lexical diversity
statistic, in the order in which they were supplied as the measure
argument.
compute_lexdiv_dfm_stats in an internal function that
computes the lexical diversity measures from a dfm input.
compute_lexdiv_tokens_stats in an internal function that
computes the lexical diversity measures from a dfm input.