Compute a similarity network — compute_similarity

Based on a lines to lines Levenchtein distance (% of similarity) between lines. Assuming lines are from several files. Used for adding stats to corpus (methods_stats)

Usage

compute_similarity_network(df, group_col, text_col)

Arguments

df: data.frame - A data.frame with a vector of char and a group col
group_col: character - Name of the group vector
text_col: character - Name of the text column vector, i.e. the vector used for compute similarity