document_distance Document distance string comparison model to compute a metric about coherency of context in two document