Reduce artimed extras #172

mvdbeek · 2016-06-15T11:04:47Z

Summary of changes:

Use startup.sh script for launching ansible and starting up supervisor
Strip artimed_extras to the data manager functionality only
Simplify get_tool_list_from_galaxy.py script (no need for admin api key anymore)

drosofff · 2016-06-15T11:27:29Z

We should rename the artimed_extras role to galaxykickstart

mvdbeek · 2016-06-15T11:55:19Z

We should rename the artimed_extras role to galaxykickstart

I think we should remove it completely and move the data managers part to a new data managers role (If we decide we want to keep this functionality)

drosofff · 2016-06-15T12:34:30Z

I am ok with complete removing and a data_managers role.

mvdbeek · 2016-06-17T12:44:56Z

I propose to create a script folder at the root, with the install_tool_shed_tools.py, generate_tool_list_from_ga_workflow_files.py and other scripts to come (I will work at a script that creates a group_vars, inventory, etc, from a workflow and/or a tool list

Can you outline this in an issue? I am looking at this as well.

skip unnecessary restarts.

…ncy dir to galaxy role

therefore move the action up from roles/galaxy.movedata/tasks/import.yml to roles/set_supervisor_env_vars/tasks/main.yml

This startup script takes as argument the inventory to be used and passes the process on to supervisor as PID1 (which allows graceful stopping of processes).

to work without admin api key for galaxy newer than 16.01.

so that an empty list simply causes skipping of the task.

and rename artimed_extras to data_managers.

update galaxy role and switch back to galaxyproject galaxy-extras role

galaxy_tools_admin_user_password to group_vars

mvdbeek · 2016-06-18T18:27:35Z

@drosofff this is ready for review!

drosofff · 2016-06-19T09:25:12Z

.gitmodules

@@ -1,9 +1,9 @@
 [submodule "roles/galaxyprojectdotorg.galaxy-tools"]
 	path = roles/galaxyprojectdotorg.galaxy-tools
-	url = https://github.com/galaxyproject/ansible-galaxy-tools.git
+	url = https://github.com/mvdbeek/ansible-galaxy-tools.git


ansible-galaxy-tools is forked in ARTbio repo. Why not relying on your fork (if rolling back the changeset revision to the one you are providing is required, it is not a problem to me)

Because we're automatically synchronizing this from upstream in a crontab, I'm afraid we may break that synchronization. The changes are already in galaxyproject/ansible-galaxy-tools#31, so as soon as that get's merged in we will have it in the artbio fork as well. (Which I consider to be a backup as discussed in #62)

OK. when you say in #62

I put this script in my crontab:
https://gist.github.com/mvdbeek/37b77326e6921f963993

with this we are updating our forks every 2 hours.

where is running your crontab ?

My imac in the lab.

Well, I am sure we can do better for the sustainability of the synchonization

drosofff · 2016-06-19T11:08:18Z

OK, look nice 👍
I would like to run a couple of installations with the reorganized roles before merging

drosofff · 2016-06-19T14:59:19Z

I have this error with the ansible-playbook run on the branch

TASK [galaxyprojectdotorg.galaxy-tools : Install Tool Shed tools] **************
failed: [localhost] => (item=extra-files/artimed/artimed_tool_list.yml) => {"changed": true, "cmd": ["/tmp/venv/bin/python", "install_tool_shed_tools.py", "-t", "artimed_tool_list.yml", "-a", "admin", "-g", "localhost"], "delta": "0:00:00.182402", "end": "2016-06-19 14:49:10.523772", "failed": true, "item": "extra-files/artimed/artimed_tool_list.yml", "rc": 1, "start": "2016-06-19 14:49:10.341370", "stderr": "Traceback (most recent call last):\n  File \"install_tool_shed_tools.py\", line 590, in <module>\n    install_tools(options)\n  File \"install_tool_shed_tools.py\", line 471, in install_tools\n    itl = installed_tool_revisions(gi)  # installed tools list\n  File \"install_tool_shed_tools.py\", line 170, in installed_tool_revisions\n    itl = tsc.get_repositories()\n  File \"/tmp/venv/local/lib/python2.7/site-packages/bioblend/galaxy/toolshed/__init__.py\", line 36, in get_repositories\n    return Client._get(self)\n  File \"/tmp/venv/local/lib/python2.7/site-packages/bioblend/galaxy/client.py\", line 147, in _get\n    raise ConnectionError(msg)\nbioblend.galaxy.client.ConnectionError: GET: error 403: '{\"err_msg\": \"Provided API key is not valid.\", \"err_code\": 403001}', 0 attempts left: None", "stdout": "", "stdout_lines": [], "warnings": []}

Apparently an API key issue.

However I had to sync / update the submodules that have been changed in the branch and do not guarantee that this is not the problem: from my git session:

From https://github.com/mvdbeek/ansible-galaxy-tools
 * [new branch]      install_individual_tools -> origin/install_individual_tools
 + 036bb22...c259caa master     -> origin/master  (forced update)
 * [new branch]      predefined_api_key -> origin/predefined_api_key
 * [new branch]      timeout    -> origin/timeout
Submodule path 'roles/galaxyprojectdotorg.galaxy-tools': checked out 'c259caa75621dadea8280dfa7d06db9df1c122bd'

mvdbeek · 2016-06-19T15:19:51Z

Submodule path 'roles/galaxyprojectdotorg.galaxy-tools': checked out 'c259caa75621dadea8280dfa7d06db9df1c122bd'

Yep, this should be 188e7cd136052f1e00efa3d19ffbcd9fe8f29dd5. In the ansible-artimed repo try a git submodule sync && git submodule update.

mvdbeek · 2016-06-19T18:35:25Z

@mvdbeek please test vagrant up, too.

Works for me, but I ran this on a new machine. I suspect the problem comes from updating docker.

"rmtree failed: [Errno 16] Device or resource busy: '/var/lib/docker/devicemapper'"}
    to retry, use: --limit @galaxy.retry

This is a lockup, can you do a supervisorctl stop docker and see if it passes through?

drosofff · 2016-06-19T21:57:14Z

TASK [data_managers : Run data managers] ***************************************
failed: [localhost] => (item=extra-files/artimed/artimed_data_manager_tasks.yml) => {"changed": true, "cmd": ["/tmp/venv/bin/python", "install_tool_shed_tools.py", "-d", "extra-files/artimed/artimed_data_manager_tasks.yml", "-a", "admin", "-g", "localhost"], "delta": "0:00:00.116701", "end": "2016-06-19 19:10:39.312228", "failed": true, "item": "extra-files/artimed/artimed_data_manager_tasks.yml", "rc": 1, "start": "2016-06-19 19:10:39.195527", "stderr": "Traceback (most recent call last):\n  File \"install_tool_shed_tools.py\", line 654, in <module>\n    run_data_managers(options)\n  File \"install_tool_shed_tools.py\", line 415, in run_data_managers\n    kl = load_input_file(dbkeys_list_file)  # Input file contents\n  File \"install_tool_shed_tools.py\", line 126, in load_input_file\n    with open(tool_list_file, 'r') as f:\nIOError: [Errno 2] No such file or directory: 'extra-files/artimed/artimed_data_manager_tasks.yml'", "stdout": "", "stdout_lines": [], "warnings": []}

RUNNING HANDLER [galaxyprojectdotorg.galaxy : restart galaxy] ******************

RUNNING HANDLER [galaxyprojectdotorg.galaxy : email administrator with changeset id] ***

PLAY RECAP *********************************************************************
localhost                  : ok=156  changed=92   unreachable=0    failed=1

…s.py

mvdbeek · 2016-06-20T07:18:52Z

@drosofff
This should work now (and has been broken ever since we moved these to extra-files, if this has ever worked). The reason we notice this now is that I have removed all these run_* (like run_data_managers) variables.

drosofff · 2016-06-20T08:53:04Z

It is not:

commit 5822fbc5a645bb0a4310d1051c540955b1dd29e8
Author: Marius van den Beek <m.vandenbeek@gmail.com>
Date:   Mon Jun 20 09:17:31 2016 +0200

    When copying task lists, only pass basename to install_tool_shed_tools.py

ansible-playbook -i inventory_files/artimed galaxy.yml on a fresh IFB instance

TASK [data_managers : Remove data manager task file] ***************************
failed: [localhost] => (item=extra-files/artimed/artimed_data_manager_tasks.yml) => {"failed": true, "item": "extra-files/artimed/artimed_data_manager_tasks.yml", "msg": "rmtree failed: [Errno 2] No such file or directory: '/tmp/ccRRi7Mf.s'"}

RUNNING HANDLER [galaxyprojectdotorg.galaxy : restart galaxy] ******************

RUNNING HANDLER [galaxyprojectdotorg.galaxy : email administrator with changeset id] ***

PLAY RECAP *********************************************************************
localhost                  : ok=158  changed=94   unreachable=0    failed=1

Please test your commits yourself cause I have no time to do it anymore this week

mvdbeek · 2016-06-20T08:59:41Z

Please test your commits yourself cause I have no time to do it anymore this week

I have tested this in vagrant, where it works. If you don't have time to test it this week then don't test it, I will move forward.

drosofff · 2016-07-01T12:12:34Z

Coming back to this PR after numerous testing and restesting in vagrant, IFB cloud, AWS cloud...

first issue

How this branch currently diverge from the gcc2016 branch ? are the modifications in Vagrantfile the only changes ? If yes, the question is shall we merge this branch, or the gcc2016 ?

second issue

I think that the role ansible-galaxy-tools/tasks/main.yml should contain additional code such as

- include: restart_galaxy.yml
  when: galaxy_tools_install_tools  # this condition is even optional in my opinion

otherwise, the new playbook implies that you have to restart manually Galaxy which is a regression from the current master. I understand that this comes from a notify statement from another role, whose log should be also removed if we restart Galaxy in our playbook.
I have tested this additional code and it seems to work.

Third (most important) issue

There is a complex issue (at least for me) with the /tmp directory. Probably with the rights of /tmp but could be also its deletion... or its non-deletion.
The facts are that this /tmp is implied in various errors when you play or replay the playbook with different inventory files.

Here is an exemple:

TASK [galaxyprojectdotorg.galaxy-tools : Create Galaxy bootstrap user] *********
fatal: [localhost]: FAILED! => {"changed": true, "cmd": ["/home/galaxy/galaxy/.venv/bin/python", "manage_bootstrap_user.py", "-c", "/home/galaxy/galaxy/config/galaxy.ini", "create", "-e", "admin@galaxy.org", "-u", "cloud", "-p", "admin", "-a", "admin"], "delta": "0:00:02.506624", "end": "2016-06-29 16:51:12.106424", "failed": true, "rc": 1, "start": "2016-06-29 16:51:09.599800", "stderr": "Traceback (most recent call last):\n  File \"manage_bootstrap_user.py\", line 230, in <module>\n    log = _setup_global_logger()\n  File \"manage_bootstrap_user.py\", line 86, in _setup_global_logger\n    file_handler = logging.FileHandler('/tmp/galaxy_tools_bootstrap_user.log')\n  File \"/usr/lib/python2.7/logging/__init__.py\", line 903, in __init__\n    StreamHandler.__init__(self, self._open())\n  File \"/usr/lib/python2.7/logging/__init__.py\", line 928, in _open\n    stream = open(self.baseFilename, self.mode)\nIOError: [Errno 13] Permission denied: '/tmp/galaxy_tools_bootstrap_user.log'", "stdout": "", "stdout_lines": [], "warnings": []}
    to retry, use: --limit @galaxy.retry

PLAY RECAP *********************************************************************
localhost                  : ok=120  changed=18   unreachable=0    failed=1

But it can also happen that the restarting of supervisorctl (galaxy:uwsgi) fails due to the absence of this /tmp (probably deleted in a previous playbook round). You can restart just by mkdir /tmp && chmod 777 /tmp

And last, but not least, I finally figured out why the installation of deseq2 package systematically fails with the new simplified playbook:
This is precisely the absence of the /tmp and/or too restricted access rights if it already exists.
From the galaxy admin panel, the repair of this tool (which include reinstalling libxml) won't work until you manually mkdir /tmp && chmod 777 /tmp

In summary, the behavior of the /tmp file along the playbook run is not clear to me because I understand that it can be manipulated by several submodules, including our galaxy-tools submodule. But I feel this important /tmp feature is still a bit floppy (not well automated yet).

4th issue

the data_managers role is not crystal clear too me. Is it really an important feature or just a rest from the previous playbook ?

Finally, I would really like much to merge this PR (or a PR from gcc2016 if equivalent) with the master, to move forward. But avoiding regression in the automation. As Bjorn said, we are working for usability not for geeks.

drosofff · 2016-07-03T14:44:05Z

roles/data_managers/tasks/data_managers.yml

+  file: dest={{ galaxy_tools_base_dir }}/install_tool_shed_tools.py state=absent
+
+- name: Remove data manager task file
+  file: src={{ item }} dest={{ galaxy_tools_base_dir }}/ state=absent


should be

file: dest={{ galaxy_tools_base_dir }}/{{ item|basename }} state=absent

otherwise, /tmp is deleted and replaying the playbook fails

this said, the data_managers role seems obsolete.

Absolutely, I agree.

mvdbeek · 2016-07-04T08:58:16Z

first issue

How this branch currently diverge from the gcc2016 branch ? are the modifications in Vagrantfile the only changes ? If yes, the question is shall we merge this branch, or the gcc2016 ?

Yes, those are the only changes ... I wanted to demo ansible without the automatic provisioning that vagrant up does. I would prefer to merge the gcc2016 branch, but ultimately i don't think this is important.

second issue

I think that the role ansible-galaxy-tools/tasks/main.yml should contain additional code such as

include: restart_galaxy.yml
when: galaxy_tools_install_tools # this condition is even optional in my opinion
otherwise, the new playbook implies that you have to restart manually Galaxy which is a regression from the current master. I understand that this comes from a notify statement from another role, whose log should be also removed if we restart Galaxy in our playbook.
I have tested this additional code and it seems to work.

I am intentionally removing these things, as they should be done once and once only when the play has finished, from inside the play, not the role (The role should only notify of a necessary restart, while the play implements the restart. I'll add this before merging the PR). All this restarting unnecessarily slows down the playbook, which in return limits the amount of testing we can do in travis.

For the third issue, it comes down to #172 (comment) , which should solve most of these problems. The undelrying problem is that the tool installation script is copied and removed, which doesn't really make sense. It should become part of ephemeris, and then we just install ephemeris.

drosofff · 2016-07-04T21:58:52Z

roles/handlers/galaxy.yml

@@ -1,6 +1,9 @@
 - name: start galaxy
  supervisorctl: name='galaxy:' state=started

+- name: restart galaxy
+  supervisorctl: name='galaxy:' state=restarted
+
 - name: restart galaxy handler


this one is never just "start" ?

@drosofff

Thanks @drosofff.

drosofff · 2016-07-05T22:44:23Z

roles/galaxyprojectdotorg.galaxy-extras

@@ -1 +1 @@
-Subproject commit fda11a2e1fe72c5a425079fcf50369f318287217
+Subproject commit 48750d41f75de22dbd4059ae72fee2e7f6f4673e


should not this updated to 34ec0ce959c8b4a2f95c1c7f19104cffbd73696e ?

mvdbeek force-pushed the reduce_artimed_extras branch 3 times, most recently from 1e144ff to 8b9e6b6 Compare June 17, 2016 11:36

mvdbeek mentioned this pull request Jun 17, 2016

Use rundeck to launch/configure ansible/docker playbooks #161

Closed

mvdbeek force-pushed the reduce_artimed_extras branch 2 times, most recently from bc3280f to 0ca05ab Compare June 17, 2016 16:10

mvdbeek added 12 commits June 17, 2016 18:10

Remove debug statements, include proftpd in ansible-galaxy-extras

7b01c5c

skip unnecessary restarts.

Expose GALAXY_GID/UID env vars for proftpd

7df2395

Fix PBKDF2 in ansible-galaxy-extras and move creation of tool_depende…

c9d0929

…ncy dir to galaxy role

Always adjust prefix

60a84bc

therefore move the action up from roles/galaxy.movedata/tasks/import.yml to roles/set_supervisor_env_vars/tasks/main.yml

Use a startup script

35bafaa

This startup script takes as argument the inventory to be used and passes the process on to supervisor as PID1 (which allows graceful stopping of processes).

Delegate admin user creation to galaxy-tools role

65a2313

Rename and simplify get_tool_yml_from_gi.py

47e7815

to work without admin api key for galaxy newer than 16.01.

Remove unnecessary options and use data_manager_task_file_list

566949c

so that an empty list simply causes skipping of the task.

Remove unnecessary runs of artimed-extras

a661918

and rename artimed_extras to data_managers.

Role updates: Point to mvdbeek for ansible-galaxy-tools

52fa53a

update galaxy role and switch back to galaxyproject galaxy-extras role

Add copy-pasta for adding the install_tool_shed_tools.py script

e5bba47

Add galaxy_tools_create_bootstrap_user=yes and

aca14f7

galaxy_tools_admin_user_password to group_vars

mvdbeek force-pushed the reduce_artimed_extras branch from b2ede40 to 2cf6689 Compare June 17, 2016 17:53

Adjust galaxy-tools role to allow admin as password

2cf6689

drosofff reviewed Jun 19, 2016
View reviewed changes

When copying task lists, only pass basename to install_tool_shed_tool…

5822fbc

…s.py

drosofff reviewed Jul 3, 2016
View reviewed changes

Track ansible-galaxy-extras update

be4e34d

mvdbeek force-pushed the reduce_artimed_extras branch from baa61c9 to 8c2aa1c Compare July 4, 2016 10:29

drosofff reviewed Jul 4, 2016
View reviewed changes

mvdbeek added the Work in progress/WIP label Jul 5, 2016

mvdbeek force-pushed the reduce_artimed_extras branch 2 times, most recently from 38e21e7 to d3cd54b Compare July 5, 2016 18:29

mvdbeek added 5 commits July 5, 2016 20:30

Only remove tasks file instead of galaxy_tools_base_dir

1ed95dd

Thanks @drosofff.

Add handler to galaxy.yml play

b550f62

Prepare roles for handlers

c4eff91

Base image on artbio/ansible-galaxy-os for reduced build times

1aa3b82

Use more handlers

d3cd54b

drosofff reviewed Jul 5, 2016
View reviewed changes

mvdbeek added 4 commits July 6, 2016 14:57

Remove non-sense tools from artimed and remove data managers

e46c222

Switch back to galaxyproject roles

5345b5d

Remove posql discovery from ensure_postresql role

a7bb271

Fix when in handlers

566bd02

mvdbeek removed the Work in progress/WIP label Jul 6, 2016

State the required ansible version

1333f1d

drosofff merged commit f95df15 into master Jul 7, 2016

drosofff deleted the reduce_artimed_extras branch June 8, 2017 09:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce artimed extras #172

Reduce artimed extras #172

mvdbeek commented Jun 15, 2016 •

edited

Loading

drosofff commented Jun 15, 2016

mvdbeek commented Jun 15, 2016

drosofff commented Jun 15, 2016

mvdbeek commented Jun 17, 2016

mvdbeek commented Jun 18, 2016

drosofff Jun 19, 2016

mvdbeek Jun 19, 2016 •

edited

Loading

drosofff Jun 19, 2016

mvdbeek Jun 19, 2016

drosofff Jun 19, 2016

drosofff commented Jun 19, 2016

drosofff commented Jun 19, 2016

mvdbeek commented Jun 19, 2016

mvdbeek commented Jun 19, 2016

drosofff commented Jun 19, 2016

mvdbeek commented Jun 20, 2016

drosofff commented Jun 20, 2016

mvdbeek commented Jun 20, 2016

drosofff commented Jul 1, 2016

drosofff Jul 3, 2016

drosofff Jul 3, 2016

mvdbeek Jul 3, 2016

mvdbeek commented Jul 4, 2016

drosofff Jul 4, 2016

drosofff Jul 5, 2016

		@@ -1 +1 @@
		Subproject commit fda11a2e1fe72c5a425079fcf50369f318287217
		Subproject commit 48750d41f75de22dbd4059ae72fee2e7f6f4673e

Reduce artimed extras #172

Reduce artimed extras #172

Conversation

mvdbeek commented Jun 15, 2016 • edited Loading

drosofff commented Jun 15, 2016

mvdbeek commented Jun 15, 2016

drosofff commented Jun 15, 2016

mvdbeek commented Jun 17, 2016

mvdbeek commented Jun 18, 2016

Choose a reason for hiding this comment

mvdbeek Jun 19, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drosofff commented Jun 19, 2016

drosofff commented Jun 19, 2016

mvdbeek commented Jun 19, 2016

mvdbeek commented Jun 19, 2016

drosofff commented Jun 19, 2016

mvdbeek commented Jun 20, 2016

drosofff commented Jun 20, 2016

mvdbeek commented Jun 20, 2016

drosofff commented Jul 1, 2016

first issue

second issue

Third (most important) issue

4th issue

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mvdbeek commented Jul 4, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mvdbeek commented Jun 15, 2016 •

edited

Loading

mvdbeek Jun 19, 2016 •

edited

Loading