Caching Resource #23

pratikpoddar · 2014-05-11T11:34:04Z

Running

import langid
langid.classify("India is a country") ## statement number 1

takes a lot of time to run "statement number 1".

but,

import langid
langid.classify("I like cricket")
langid.classify("India is a country") ## statement number 2

does not take a lot of time to run "statement number 2".

So, Does langid.classify caches some information? Can we manage that? Thanks

saffsd · 2014-05-11T13:13:54Z

The module-level function 'classify' is just a convenience wrapper of a LanguageIdentifier instance, which checks if a global instance exists. If not, it unpacks one - this is why it takes time for the first run. If you want to invoke the unpacking manually, call langid.load_model() first. This will cause the model to be loaded into memory (which can then be accessed as langid.identifier). No other caching occurs.

saffsd closed this as completed May 11, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caching Resource #23

Caching Resource #23

pratikpoddar commented May 11, 2014

saffsd commented May 11, 2014

Caching Resource #23

Caching Resource #23

Comments

pratikpoddar commented May 11, 2014

saffsd commented May 11, 2014