Get lines text-metrics from a df
compute_nchar_metrics.RdGiven a character vector, return a data.frame with lines text-mining metrics :
values are related to each element of the vector provided by the user.
Value
A data.frame sorted in the same order as the character vector provided by the user, providing for each element of the vector :
n_charcharacterNumber of characters - including spacing - in a line.n_char_wo_spacecharacterNumber of characters - without spacing - in a line.word_count_colnamecharacterNumber of words in a line.vowel_count_colnamecharacterNumber of vowels in a line.
Examples
if (FALSE) { # \dontrun{
# Example 1: Construct a corpus from local folders
text = c("the lazy lazy lazy rat terrier jump over the quick brown fox"
, "cause lazy rat terrier are the best dog")
compute_nchar_metrics(text)
} # }