-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
daemon stopping without reporting errors #158
Comments
Are there any files of the form |
Hi. Yes, this is the content of the corresponding /tmp/sdt_stacktrace_*.log Trace function called from '/fas_c/UTENTI/franco/miniconda/miniconda2/envs/franco-synda-environment/lib/python2.7/site-packages/synda-3.11-py2.7.egg-info/scripts/sddaemon.py' file in 'start' function at line 118 I am quite new to synda, Any suggestions? |
failed_url is a table which newly has to be in the database.
Now I realize that we need an automated way to update the database thus, or to function without it, or at least issue a warning when it isn't there. I will work on that. |
BTW, the table "failed_url" is needed so that if a data node fails to supply data, Synda can go try another data node. |
I've just tried that but when I launch the daemon again I still get the same error as before: |
I'm sorry, I gave the wrong sqlite command. It should be |
Now it worked.Thanks a lot. |
I’m looking for how this could happen. Was it a brand new database, nothing in the file table?
Jeff
From: Rafael Abreu <notifications@github.com>
Sent: Wednesday, October 21, 2020 8:55 AM
To: Prodiguer/synda <synda@noreply.github.com>
Cc: Painter, Jeff <painter1@llnl.gov>; Mention <mention@noreply.github.com>
Subject: Re: [Prodiguer/synda] daemon stopping without reporting errors (#158)
I am having the same problem. Used the command provided by @painter1<https://github.com/painter1> but after that, another problem seems to occur:
============= Trace function called from '/vol0/rafaleca/miniconda3/envs/synda-env/lib/python2.7/site-packages/synda-3.12-py2.7.egg-info/scripts/sddaemon.py' file in 'start' function at line 118 Exception occured at 2020-10-21 09:59:03.228342 Traceback (most recent call last): File "/vol0/rafaleca/miniconda3/envs/synda-env/lib/python2.7/site-packages/synda-3.12-py2.7.egg-info/scripts/sddaemon.py", line 115, in start main_loop() File "/vol0/rafaleca/miniconda3/envs/synda-env/lib/python2.7/site-packages/synda-3.12-py2.7.egg-info/scripts/sddaemon.py", line 66, in main_loop sdtaskscheduler.event_loop() File "/vol0/rafaleca/miniconda3/envs/synda-env/lib/python2.7/site-packages/synda-3.12-py2.7.egg-info/scripts/sdtaskscheduler.py", line 166, in event_loop sdfiledao.highest_waiting_priority( True, True ) #initializes cache of max priorities File "/vol0/rafaleca/miniconda3/envs/synda-env/lib/python2.7/site-packages/synda-3.12-py2.7.egg-info/scripts/sdfiledao.py", line 228, in highest_waiting_priority return (highest_waiting_priority.vals).get(data_nodes[0],None) IndexError: list index out of range
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub<#158 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAVLQMJIDYNXX3T4MUKKRNLSL373VANCNFSM4SCSH7PA>.
|
This error involving sdfiledao.py line 228 (list index out of range) can be bypassed by editing sdconst.py to set GET_FILES_CACHING to False. However, if you have a large database and are simultaneously downloading from several data nodes, you will take a performance hit. |
@francocatalano and Rafael Abreu: the problem involving failed_urls was supposedly fixed about a year and a half ago. There was about a one-month window in which this problem was clearly possible - although I can't certainly exclude the possibility of a bug in that fix. So I have some questions:
Thank you! |
Thanks for the update @painter1. I deleted my original comment because I was able to get it working by running the daemon after running As for the versions, I am using synda version 3.12 installed with conda and sqlite3 version 3.33.0. |
@rafaelcabreu, the database itself has a version number, different from the sqlite3 version number. Would you please check that? |
After updating synda to v3.11 I got into the following problem.
I define a selection file to get some datasets from CMIP6, like this:
project=CMIP6
source_id=IPSL-CM6A-LR
experiment_id=historical
member_id=r1i1p1f1
table_id=Amon
frequency=mon
variable_id=tas psl pr hfls hfss evspsbl rlds rlus rsds rsus prw
The search command seems to work properly:
synda search -s selection/my_test_selfile.txt
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.hfss.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.tas.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.rlus.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.pr.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.rsus.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.psl.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.rlds.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.evspsbl.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.rsds.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.hfls.gr.v20180803
new CMIP6.CMIP.IPSL.IPSL-CM6A-LR.historical.r1i1p1f1.Amon.prw.gr.v20180803
Then I launch install:
synda install -s selection/my_test_selfile.txt
11 file(s) will be added to the download queue.
Once downloaded, 1.3 GB of additional disk space will be used.
Do you want to continue? [Y/n] Y
11 file(s) enqueued
You can follow the download using 'synda watch' and 'synda queue' commands
The daemon is not running. To start it, use 'synda daemon start'.
But when I launch the daemon it starts and stops immediately without apparently giving any error:
synda daemon start
Handing over to daemon process, you can check the daemons logs at /sgi_specs/a/.synda/log/transfer.log
synda watch
Daemon not running
and the transfer log file reports only the following info:
2020-10-03 12:18:38,583 INFO SDDAEMON-001 Daemon starting ...
INFO: Connected to /sgi_specs/a/.synda/db/sdt.db
2020-10-03 12:18:38,584 INFO SDTSCHED-533 Connected to /sgi_specs/a/.synda/db/sdt.db
2020-10-03 12:18:38,584 INFO SDTSCHED-993 Starting watchdog..
2020-10-03 12:18:38,585 INFO SDFILDAO-200 get_files time is 0.000953, search select * from file where status=:status ORDER BY priority DESC, checksum with {'status': 'running'}
Daemon successfully started
Anyone knows what's going on?
Thanks a lot for your help.
The text was updated successfully, but these errors were encountered: