Hello,
I'm running dask on a PBSCluster, and was getting "Unknown job_id" errors when workers attach to the scheduler. It doesn't seem to be causing any problems (jobs still process ok), but wanted to see how I could go about getting rid of these messages. I turned on debugging and get this:
DEBUG:job id for new worker: 11202854.server.address.domain
DEBUG:11202854.server.address.domain is a new job or restarting worker
ERROR:Unknown job_id: 11202854.server.address.domain for worker tcp://ip-address:37826
(actual addresses replaced by "server.address.domain", actual ip address replaced by ip-address)
What's happening here, and how can I stop the "Unknown job_id" errors? Thanks!
Note: I think it might have to do with the regex of the job_id? Earlier in the log I get:
DEBUG:started job: 11202854
So I think perhaps it's looking for "11202854.server.address.domain" when it should be looking for just "11202854"
Hello,
I'm running dask on a PBSCluster, and was getting "Unknown job_id" errors when workers attach to the scheduler. It doesn't seem to be causing any problems (jobs still process ok), but wanted to see how I could go about getting rid of these messages. I turned on debugging and get this:
(actual addresses replaced by "server.address.domain", actual ip address replaced by ip-address)
What's happening here, and how can I stop the "Unknown job_id" errors? Thanks!
Note: I think it might have to do with the regex of the job_id? Earlier in the log I get:
So I think perhaps it's looking for "11202854.server.address.domain" when it should be looking for just "11202854"