New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parallel backend is broken for doParallel >1.0.6 #7
Comments
Thanks for reporting this. |
Recently, I encountered the same error while running NMF in Linux Platform but 'not' in window Platform. As you have indicated above the problem has been fixed. Could you please be explicit with example. Here is my code:
Here is my system information: other attached packages: I appreciate your help for fixing the parallel run issue in Linux. Thanks. Regards, |
Have you tried using the latest version on CRAN (0.20.5)? library(NMF)
x <- rmatrix(100, 20)
res <- nmf(x, rank=4, method="ns", theta=0.7, seed = 123456, nrun=8, .opt = "vP4") Results:
|
I finally be able to run model in recent version of NMF 0.20.5 using multicore in Linux System without any bug as before. Thank you for responding. |
My model successfully ran with run=4 but stopped running just before finishing for 50 runs with the same data set: Here is the successful run case:
Failed run: My two attempts yielded the following messages: How do I debug the causes of this failure. Note that I ran this with 45 million rows and 15 variables. It seems to me memory was not an issue. I appreciate your suggestions. Thanks, |
Debug option is not generating useful information for identifying the causes of failure:
Running rollback clean up ...Restoring RNG settings ...RNG kind: Mersenne-Twister / InversionRNG state: 403L, 7L, ..., -1289165921L [4de1642ab154e963c6ea7ef488e195d8]OK Restoring NMF options ... OKRestoring previous foreach backend '' ... OKDeleting temporary directory '/XXXX/XXXXX/XXXXX/XXXXX/NMF_1d015bc581a' ... OKTiming stopped at: 2698.415 345.833 549.012 |
I recently ran the same model with run=50 using 4 cores in Batch mode, it returned successfully! I am now experimenting with 16 cores and will update you as soon as I get results. At this point in time, I do not consider having any bug in the NMF source code. Here is few lines from log of my 50th run: Processing partial results ... OKNMF computation exit status ... OKRunning normal exit clean up ...Restoring NMF options ... OKRestoring previous foreach backend '' ... OKUpdating RNG settings ... OKRNG kind: Mersenne-Twister / InversionRNG state: 403L, 1L, ..., 425501564L [c7f400f3798e6384ca89b63934b32173]Deleting temporary directory '/home/XXXXXX/XXXXXX/XXXXX/NMF_53456e0662dc' ... OK
446964.134 8829.809 156572.488 Thanks, |
I ran a model with nrun=100 but it failed to complete the compilation after finishing all the run. Please give me your email address to send you my debug log for your analysis. I experimented twice with different algorithms, both yielded the same error at the end. |
doParallel package version > 1.0.6 doesn't load as the parallel backend.
The error message refers to this line:
https://github.com/renozao/NMF/blob/master/R/parallel.R#L316
The internal variable doParallel:::info has been removed since version 1.0.7
The text was updated successfully, but these errors were encountered: