[nlp] How to compute the similarity between two text documents?