Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

100% CPU load when instantiating simhash.Corpus #2

Closed
jnieuviarts opened this issue Mar 20, 2013 · 6 comments
Closed

100% CPU load when instantiating simhash.Corpus #2

jnieuviarts opened this issue Mar 20, 2013 · 6 comments

Comments

@jnieuviarts
Copy link

@dlecocq (following my twitter message).
Here are some more precise elements to diagnose the problem.

Install steps

Install package :
python setup.py install
/usr/lib/python2.7/distutils/dist.py:267: UserWarning: Unknown distribution option: 'dependencies'
warnings.warn(msg)
running install
running build
running build_py
running build_ext
skipping 'simhash/table.cpp' Cython extension (up-to-date)
running install_lib
running install_egg_info
Removing /home/jerry/temp/simhash/lib/python2.7/site-packages/simhash-0.1.0-py2.7.egg-info
Writing /home/jerry/temp/simhash/lib/python2.7/site-packages/simhash-0.1.0-py2.7.egg-info

(I tried an install with and without a virtualenv)

package list

pip freeze
Cython==0.18
argparse==1.2.1
-e git://github.com/seomoz/simhash-py.git@1e2039d#egg=simhash-dev
wsgiref==0.1.2

Judy install without any error message

@baojie
Copy link

baojie commented Jun 20, 2014

same issue here. CPU 100% and program seems dead:

>>> import simhash
>>> corpus = simhash.Corpus(6, 3)

@dlecocq
Copy link
Contributor

dlecocq commented Jun 30, 2014

What OS are you running this on?

@baojie
Copy link

baojie commented Jul 1, 2014

Ubuntu 13.04 32 bit

@dlecocq
Copy link
Contributor

dlecocq commented Jul 1, 2014

I'm able to reproduce this in a vagrant image now. I hope to have a fix soon.

@dlecocq
Copy link
Contributor

dlecocq commented Jul 1, 2014

I'm sorry to report that until we migrate away from libJudy, we will be unable to support 32-bit systems. A more detailed explanation of why can be found in the commits in #7

@baojie
Copy link

baojie commented Jul 1, 2014

I see. For the time being, I'm using a pure python implementation from https://github.com/liangsun/simhash . Wish we can switch back to this faster implementation later. Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants