Skip to content
This repository has been archived by the owner on May 24, 2018. It is now read-only.

errors when running kmeans algorithm #4

Closed
SiNZeRo opened this issue May 21, 2015 · 6 comments
Closed

errors when running kmeans algorithm #4

SiNZeRo opened this issue May 21, 2015 · 6 comments

Comments

@SiNZeRo
Copy link

SiNZeRo commented May 21, 2015

I run this code in windows, rabit.lib is complied with VS2010.
Program output 'Socket Connect Error:No error', could you suggest a possible way to solve this?
Does this problem cause by socket?

python ../tracker/dmlc_local.py -n 3 kmeans.exe ../data/data.svm 10 10 .output
2015-05-21 21:29:52,278 INFO start listen on 137.189.56.119:9091
2015-05-21 21:29:52,509 INFO @tracker All of 3 nodes getting started
[21:29:52] d:\data\wt\tools\dmlc-core\src\data/basic_row_[i2t1e:r2.9h::572[9]2: 1 d:f:2i\9nd:ia5st2ha] \ rwdet:at\didiaanntg\a t\aowtot lt5si\4ad.
nm5\l4t5co5-o clMosBr\/eds\meslcrc
c-\cdoartea\/sbracs\idca_trao/wb_aistiecr_.rho:w7_9i:t efri.nhi:s7h9 :r efaidniinsgh  arte a4d6i.n1g5 3a8t  MB5/4s.e5c4
55 MB/sec
Socket Connect Error:No error
Socket Connect Error:No error
Socket Connect Error:No error
Exception in thread Thread-2:
Traceback (most recent call last):
  File "C:\Users\wt\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.3.0.1715.win-x86_64\lib\threading.py", line 810, in __bootstrap_inner
    self.run()
  File "C:\Users\wt\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.3.0.1715.win-x86_64\lib\threading.py", line 763, in run
    self.__target(*self.__args, **self.__kwargs)
  File "../tracker/dmlc_local.py", line 69, in exec_cmd
    os.exit(-1)
AttributeError: 'module' object has no attribute 'exit'

Exception in thread Thread-3:
Traceback (most recent call last):
  File "C:\Users\wt\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.3.0.1715.win-x86_64\lib\threading.py", line 810, in __bootstrap_inner
    self.run()
  File "C:\Users\wt\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.3.0.1715.win-x86_64\lib\threading.py", line 763, in run
    self.__target(*self.__args, **self.__kwargs)
  File "../tracker/dmlc_local.py", line 69, in exec_cmd
    os.exit(-1)
AttributeError: 'module' object has no attribute 'exit'

Exception in thread Thread-4:
Traceback (most recent call last):
  File "C:\Users\wt\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.3.0.1715.win-x86_64\lib\threading.py", line 810, in __bootstrap_inner
    self.run()
  File "C:\Users\wt\AppData\Local\Enthought\Canopy\App\appdata\canopy-1.3.0.1715.win-x86_64\lib\threading.py", line 763, in run
    self.__target(*self.__args, **self.__kwargs)
  File "../tracker/dmlc_local.py", line 69, in exec_cmd
    os.exit(-1)
AttributeError: 'module' object has no attribute 'exit'
@tqchen
Copy link
Member

tqchen commented May 21, 2015

seems to be the problem of windows port of rabit. Can you try run the basic examples of rabit in rabit repo, and see if the problem occurs again?

@SiNZeRo
Copy link
Author

SiNZeRo commented May 22, 2015

@tqchen Hi, thanks for your reply.
There is not error when running cc files in /guide.
For kmeans, when run with -n 1 or 2, it runs without error. Also, when num_cluster=1 and n=1,2,3,4, it runs without error.
I also try to set hostip as 127.0.0.1, but it does not work.

@tqchen
Copy link
Member

tqchen commented May 28, 2015

@SiNZeRo There was a platform dependent issue that I resolved recently, mainly because I tested on linux and there are some difference between window's and linux's socket, can you try again if the thing work for you now? Thanks

@SiNZeRo
Copy link
Author

SiNZeRo commented May 29, 2015

@tqchen hi, it works for k-means example. thanks for help.

@tqchen tqchen closed this as completed May 29, 2015
@SiNZeRo
Copy link
Author

SiNZeRo commented Jun 2, 2015

hi, @tqchen , I run kmeans with 40 clusters using 20 local process, which provides 8x speed up over single process.
Is this result reasonable? Could you suggest me some settings to understanding the capability of rabit.

Thanks a lot!

@tqchen
Copy link
Member

tqchen commented Jun 2, 2015

Hi @SiNZeRo it depends on filesize, communication vs computation etc. So you might want to run some timing experiment to check where the bottleneck are

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants