You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The workload is using Naive Bayes to classify spam. The feature table is 600 x 262144 sparse table.
If I used a small table such as 16 x 262144, it can pass.
I am not sure if I can save DAL sparse numeric table to file. If that works, I can provide dataset.
Expected behavior
Not hang or print error message
Output/Screenshots
Features row x column: 150 x 262144
oneDAL (native): Number of CPU threads used: 1
oneDAL (native): training model with fastCSR method
NaiveBayes (native): start local step compute ... should print something here if localAlgorithm.compute is finished. but the process hang forever.
The following is the correct output if I am using another small local table (row x column: 16 x 262144)
Features row x column: 16 x 262144
oneDAL (native): Number of CPU threads used: 1
oneDAL (native): training model with fastCSR method
NaiveBayes (native): start local step compute
local step compute finished
NaiveBayes (native): local step compute took 0.022 secs
NaiveBayes (native): start ccl::gather
NaiveBayes (native): ccl::gather took 0.228 secs
NaiveBayes (native): start master step compute
NaiveBayes (native): master step compute took 0.041 secs
oneDAL (native): training model finished
training took 1.058 secs
NaiveBayesDAL compute took 1.063481252 secs
NaiveBayesDAL result conversion took 0.109307419 secs
@lordoz234 Thanks for the work. We found the root cause is from our sparse data conversion code. We will let you know if there is additional problem. The issue is closed.
Describe the bug
Each local table is row x column 150x 262144 sparse table, hang in this line:
https://github.com/oap-project/oap-mllib/blob/master/mllib-dal/src/main/native/NaiveBayesDALImpl.cpp#L46
To Reproduce
Steps to reproduce the behavior:
The workload is using Naive Bayes to classify spam. The feature table is 600 x 262144 sparse table.
If I used a small table such as 16 x 262144, it can pass.
I am not sure if I can save DAL sparse numeric table to file. If that works, I can provide dataset.
Expected behavior
Not hang or print error message
Output/Screenshots
Features row x column: 150 x 262144
oneDAL (native): Number of CPU threads used: 1
oneDAL (native): training model with fastCSR method
NaiveBayes (native): start local step compute
... should print something here if localAlgorithm.compute is finished. but the process hang forever.
The following is the correct output if I am using another small local table (row x column: 16 x 262144)
Features row x column: 16 x 262144
oneDAL (native): Number of CPU threads used: 1
oneDAL (native): training model with fastCSR method
NaiveBayes (native): start local step compute
local step compute finished
NaiveBayes (native): local step compute took 0.022 secs
NaiveBayes (native): start ccl::gather
NaiveBayes (native): ccl::gather took 0.228 secs
NaiveBayes (native): start master step compute
NaiveBayes (native): master step compute took 0.041 secs
oneDAL (native): training model finished
training took 1.058 secs
NaiveBayesDAL compute took 1.063481252 secs
NaiveBayesDAL result conversion took 0.109307419 secs
Environment:
The text was updated successfully, but these errors were encountered: