Speed up views generation #7

benel · 2011-10-07T07:28:04Z

For now, different views have a loop on words (corpus_lexicometrics, document_lexicometrics, kwic, phrase). To improve views generation performance, we could try to emit in the same loop views that have similar keys and reduce functions.

This seems to be the case of phrase and corpus_lexicometrics:

for each word1 {
  get word2 and word3
  if you have word3 {
    emit([word1, word2, word3])
  } else {
    emit(word1)
  }
}

The text was updated successfully, but these errors were encountered:

benel · 2011-10-07T08:20:33Z

Note: This algorithm is just an illustration. There is a more optimized way to do that by getting words only once and remembering the 2 immediate previous words.

benel · 2011-10-10T18:00:18Z

No gain in generation time with the all-in-one view (c844940) compared to corpus_lexicometrics + document_lexicometrics + phrase (e46c11e). :'(

benel closed this as completed Oct 29, 2011

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up views generation #7

Speed up views generation #7

benel commented Oct 7, 2011

benel commented Oct 7, 2011

benel commented Oct 10, 2011

Speed up views generation #7

Speed up views generation #7

Comments

benel commented Oct 7, 2011

benel commented Oct 7, 2011

benel commented Oct 10, 2011