Skip to contents

Based on a lines to lines Levenchtein distance (% of similarity) between lines. Assuming lines are from several files. Used for adding stats to corpus (methods_stats)

Usage

compute_similarity_network(df, group_col, text_col)

Arguments

df

data.frame - A data.frame with a vector of char and a group col

group_col

character - Name of the group vector

text_col

character - Name of the text column vector, i.e. the vector used for compute similarity