Ansible Playbook for BMI Installation #153

djfinn14 · 2017-10-27T17:52:09Z

No description provided.

naved001

mostly looks good. @chemistry-sourabh will do a comprehensive review after he learns ansible.

naved001 · 2017-10-27T17:55:44Z

scripts/install/production/README.md

+      $ sudo apt-get install software-properties-common  
+      $ sudo apt-add-repository ppa:ansible/ansible  
+      $ sudo apt-get update  
+      $ sudo apt-get install ansible  


if you put these commands in a code block and remove the dollar sign, we could just copy-paste the all commands into a terminal.

command 1 command 2

naved001 · 2017-10-27T18:04:50Z

scripts/install/production/roles/tgt/tasks/main.yml

+    protocol: tcp
+    match: tcp
+    destination_port: 3260
+    jump: ACCEPT


This rule wouldn't persist through a reboot. We should document this somewhere. For centos 7 and up, firewalld is what's used to manage iptables rules; there's no iptables service that can save the rules (it can be installed separately, but it didn't save the configuration for me :/)

is there a way to setup these rules using firewalld here?

https://www.rootusers.com/how-to-open-a-port-in-centos-7-with-firewalld/

I removed iptables and added firewalld

naved001 · 2017-10-27T18:09:25Z

scripts/install/production/roles/tgt/tasks/main.yml

+  become: true
+  when: ansible_distribution == 'CentOS' or ansible_distribution == 'Red Hat Enterprise Linux'
+
+- name: Change SELinux to permissive for CentOS


This wouldn't be permanent. Do we only want to set this during the installation? to make it permanent we need to edit some file and save it.

From my testing, when using the selinux task in ansible it actually does change the file and it persists across reboots.

apoorvemohan · 2017-10-29T17:33:18Z

@pgrosu Could you please review this one?

radonm · 2017-11-02T17:34:57Z

Readme needs update - The Bare Metal Imaging (BMI) is a core component of the Massachusetts Open Cloud - it is mass open cloud now...

pgrosu · 2017-11-02T20:38:37Z

Hi Dan,

This is a great start, and if you prefer you can send my requests via email in private. So as this not part of Travis could you please provide me with the two Ubuntu and CentOS preconfiguration environments for both - these can be VMs on a specific deployment - and the step-by-step details on how the settings for all the necessary configurations and minimum version restrictions. Will the Yaml files (main.yml, site.yml, etc) run as is without any changes? Where was this tested on? For which environment were the DHCP ranges created? Either some step-by-step documentation or information would be needed for me to add to the appropriate configuration entries of the UAT framework for each of these test scenarios. As Rado indicated if we look at the README file, statements like the following don't give me confidence in what I should do:

Modify bmi_config.cfg to match whatever your current HIL and Ceph setup is.
Modify dnsmasq.conf within roles/dhcp/tasks/main.yml to match your requirements.
Comment out any of the roles you don't want run in site.yml

( The above was taken from: https://github.com/djfinn14/ims/blob/0eb117f424bc94e86cefbd16dc1dd9aa69aa41f9/scripts/install/production/README.md )

I am happy to test, but I would like some documentation similar to how I provided with my manuals to perform validations for deployment. I'm not trying to be a pain, but I'm swamped and would not like to start guessing. We need to maintain the nice predictability we initiated this summer, where we were only document-driven. In fact, if there is no clear documentation we should not not accept PRs.

I attached the two BMI manuals as a guide and reference:

Thanks,
Paul

radonm

See comments about ceph packages install

radonm · 2017-11-02T21:33:58Z

scripts/install/production/roles/bmi/tasks/main.yml

+  become: true
+
+- name: Install cephlibs
+  pip:


Why are you using pip for this ? Packages should be coming from yum repos on centos you can get it by
yum -y install http://download.ceph.com/rpm-luminous/el7/noarch/ceph-release-1-1.el7.noarch.rpm
and then yum install whatever you need from ceph

On rhel there is some code in the dev scripts using rhcs 1.2, if you change that to 2x you will get the up to date packages or you can use upstream the same as above on CentOS

Does pip install latest dev code for ceph? Latest dev != production

As far as I can tell, that package actually isn't a ceph package, it is a package created by a 3rd party that provides rados and rbd python bindings to connect BMI to an exisiting ceph cluster. I also believe the only way to install it is through pip.

https://pypi.python.org/pypi/python-cephlibs/ is what you guys use? hmmm... two years old code marked as deprecated? you should probably use the official rados/rbd bindings (whose source code is here https://github.com/ceph/ceph/tree/master/src/pybind and listed on pypi simply as "rados" and "rbd"). I'll make an issue about it. (But for now, what you're doing is fine)

djfinn14 · 2017-11-07T15:38:50Z

@pgrosu I sent you an email with this but also want to have it here:

Could you please provide me with the two Ubuntu and CentOS preconfiguration environments for both - these can be VMs on a specific deployment - and the step-by-step details on how the settings for all the necessary configurations and minimum version restrictions

I am not entirely sure what you are want from me here. You need a clean VM (CentOS, RHEL or Ubuntu) that is set up to communicate to a Ceph cluster and HIL. I personally tested it in PRB by geting a clone of the bmi-dev vm, doing my best to wipe all of the packages and bmi setup within it, and then running the playbook.

Will the Yaml files (main.yml, site.yml, etc) run as is without any changes?

Yes, the YAML files can run without any changes, but like is stated you will want to make changes to the files I recommended so that it installs correctly according to your environment.

Where was this tested on? For which environment were the DHCP ranges created?

I first tested this on my kumo VMs. I had a CentOS and Ubuntu VM that I could rebuild. Those tests just were to make sure things like the tgt and dnsmaq services were getting started. You can technically run the dev install scripts for Ceph and HIL and then copy bmi_config.cfg.test into bmi_config.cfg and run the Ansible playbook if you want to have a self contained "toy" setup to see how it runs. My real test came on that cloned BMI-dev VM I mentioned earlier. I saved the bmiconfig file and dnsmasq config file and did my best to wipe everything else, then ran the playbook and tested to make sure I could run the normal BMI commands such as adding an image to the database, listing the database, provision/deprovisioning a node.

Modify bmi_config.cfg to match whatever your current HIL and Ceph setup is.

If you actually look at the bmi_config.cfg file, you can see it has instructions on what to put for each field, and there is a bmi_config.cfg.test file that has example settings.

Modify dnsmasq.conf within roles/dhcp/tasks/main.yml to match your requirements.

If you look at roles/dhcp/tasks/main.yml you can see there are pre-filled in setting for the dnsmasq.conf. You can keep the defaults, or you may want to change things like the interface you are using.

Comment out any of the roles you don't want run in site.yml

If you open the site.yml file you can see there are 3 roles listed. I tried to make each role self contained, so if you already had tgt setup, for example, you comment out "- tgt" and then run the playbook and you would only install the dhcp and bmi.

Let me know if this answers your questions.

pgrosu · 2017-11-08T18:52:39Z

Hi Dan (@djfinn14),

I am in the middle of a couple of hard research problems I working through and are taking most of my time, so I'll give a quick overview behind what I am asking. You have done a lot of great work here, but now there is one more bridge that needs crossing. So in my experience through different software projects, the easiest way I have found them to grow their user-base is by having a clearly guided transition to implementation from a minimal starting point. This means that you have to think like a new user, and thus educate and guide your perspective users from start to finish. That undoubtably takes time and work beyond a set of configurations and a Readme file. Imagine you are a new user who sees our MOC/IMS Github location, and wants to better understand why such an Ansible playbook implementation important, how to test it from a minimal starting point and how it will help them. I'm not saying explain everything, but if you pick a person who is new to our project or the MOC, and provide him/her with your set of instructions, would they be able to reproduce them without Googling or inquiring other resources? Do they understand the connection to the rest of the project? Would all users get the same result? This is a foundation of system validation. Since this not yet part of a smoke-test on a continuous-integration platform, this is even more pertinent.

Hope it makes sense and is helpful,
Paul

naved001 · 2017-11-08T18:56:09Z

@pgrosu You could still review the ansible script nonetheless. Everything doesn't have to be blocked on just one thing.

pgrosu · 2017-11-10T20:20:37Z

@naved001 I understand what you are saying, but we want spend a bit more time at the beginning to save us simple, overlooked gaps as the project grows - otherwise this becomes more internal knowledge, which has a high-probability of shrinking the user-base over time. It is okay to have a human check as a secondary check, driven by a set of SOPs (Standard Operating Procedures) as a primary set of operational semantics when performing functional testing in order to guarantee repeatability. That is why we initiated that process through a first set of manuals/guides we created over the summer. Over time we want those to become automated as a large set of tests/scenarios for continuous integration that is more thorough than Travis, which would encompass things such as system validation.

@djfinn14 If you have time after today's meeting we can sit together for some of this.

naved001 · 2017-11-15T03:13:21Z

@pgrosu
Could you make a list of things that you want @djfinn14 to do to get your approval on this PR? Please be as specific as you can and keep it simple. Once you pin down an exact set of requirements, we can work on it one at a time. Meanwhile, you could review at the ansible script itself (the main meat of this PR).

Just keep in mind that this script is aimed at people with sufficient/reasonable know-how of the linux world.

apoorvemohan

iPXE and PXE are not being setup with this ansible scripts

apoorvemohan · 2017-11-30T23:51:34Z

scripts/install/production/README.md

+      sudo yum install ansible
+      ```
+
+2. Add your hosts to the ansible hosts file (/etc/ansible/hosts)


How about having an example here? On how to append the hostname to /etc/ansible/hosts. Not sure if it makes sense to have an example here?

e.g.
#ungrouped localhost for BMI installation
192.168.122.76

apoorvemohan · 2017-12-10T22:26:06Z

scripts/install/production/roles/bmi/tasks/main.yml

+  environment:
+    HIL_USERNAME: hil
+    HIL_PASSWORD: secret
+  with_items:


For Python 2.7.5 I had to execute "pip install requests urllib3 pyOpenSSL --force --upgrade" on "CentOS Linux release 7.4.1708 (Core)" to install BMI.

apoorvemohan · 2017-12-10T22:49:26Z

scripts/install/production/README.md

+3. Modify bmi_config.cfg to match whatever your current HIL and Ceph setup is.
+
+4. Modify dnsmasq.conf within roles/dhcp/tasks/main.yml to match your requirements.
+


add instruction to modify HIL credentials in scripts/install/production/roles/bmi/tasks/main.yml

apoorvemohan · 2017-12-10T23:00:12Z

scripts/install/production/roles/bmi/tasks/main.yml

+- name: Bootstrap the database
+  command: "{{ item }}"
+  environment:
+    HIL_USERNAME: hil


The HIL environment variables were not set for me using after the installation completed successfully

apoorvemohan · 2017-12-11T21:02:16Z

scripts/install/production/roles/tgt/tasks/main.yml

+
+- name: Change SELinux to permissive for CentOS
+  selinux:
+    policy: targeted


selinux needs to "disabled" for BMI to work

Permissive is basically disabled with warnings on. No need to disable it.

Again, BMI pro not working with permissive. Tested in Kumo.

from this page here

in permissive mode SELinux does not enforce its policy, but only logs what it would have blocked (or granted)

applications that are SELinux-aware might still behave differently with permissive mode than when SELinux is completely disabled

Based on the second point, I'll defer to you on this one. But do you know what selinux aware app we have that needs it disabled? TGT?

cc: @chemistry-sourabh

I don't completely understand the meaning of "selinux aware". I'll have to read on it.

apoorvemohan · 2017-12-11T21:02:44Z

scripts/install/production/roles/tgt/tasks/main.yml

+   - gcc
+   - cpan
+   - make
+   - firewalld


firewalld needs to be "disabled" for BMI to work

No. Firewalld just manages iptables rules for you in CentOS. Why would you outright disable it? And Dan tested this setup, so it definitely works.

it is dropping DHCP request during BMI provision

I tested in kumo

but that has nothing to do with firewalld itself. It only manages iptables rules.

If the firewalld service is disabled on the machine (by default, it's enabled on centos), then we have to directly make changes to iptables. But the problem with that is those changes aren't saved since there's no iptables.service on centos anymore (you have to run the iptables command everytime on boot, or add to rc.local).

See if firewalld is running, and then see if iptables has rules for dhcp port(s)?

I suppose allowing port 67 and 68 should when firewalld is running. Needs to be tested tough.

Updated the README to include instructions on modifying the hosts file, the HIL credentials and bashrc. Also modified firewalld and selinux.

djfinn14 and others added 12 commits September 15, 2017 16:01

Initial commit for ansible production install

5aa7c15

Translated necessary parts of install_packages to ansible

a971804

Created Roles and added dhcp install

a55e0b8

Added pxe role and modified bmi install tasks

53aa517

Added lines for Ubuntu Installation

8f046f6

Added to bmi main.yml and renamed iscsi to tgt

faac70b

Added tgt file, small changes to the other 3 roles

bbe77a3

Fixed DB issues with the install and added a README

1b9fd29

Removed pxe role and made changes to file paths

c2e95e0

Added more meaningful titles to tasks, updated README.

505deaa

Fixed Directory permissions issue and fixed small typo.

302bfcf

Removed unecessary hosts file.

0eb117f

naved001 reviewed Oct 27, 2017

View reviewed changes

apoorvemohan requested a review from pgrosu October 29, 2017 17:33

Updated README to have code blocks and replaced iptables with firewalld

2cae815

apoorvemohan requested a review from radonm November 2, 2017 17:29

naved001 approved these changes Nov 2, 2017

View reviewed changes

radonm suggested changes Nov 2, 2017

View reviewed changes

This was referenced Nov 15, 2017

Use official rados/rbd bindings for our ceph client #155

Open

Ansible vs Shell Script for BMI Install #145

Closed

apoorvemohan self-requested a review December 17, 2017 20:38

apoorvemohan suggested changes Dec 17, 2017

View reviewed changes

Addressed Apoorve's review comments

5a90c58

Updated the README to include instructions on modifying the hosts file, the HIL credentials and bashrc. Also modified firewalld and selinux.

apoorvemohan removed the request for review from pgrosu February 13, 2018 22:15

apoorvemohan approved these changes Apr 13, 2018

View reviewed changes

Merge branch 'master' into production_install

3ae4aa3

naved001 merged commit 02679ad into CCI-MOC:master Apr 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ansible Playbook for BMI Installation #153

Ansible Playbook for BMI Installation #153

djfinn14 commented Oct 27, 2017

naved001 left a comment

naved001 Oct 27, 2017

naved001 Oct 27, 2017

djfinn14 Nov 2, 2017

naved001 Oct 27, 2017

djfinn14 Nov 2, 2017

apoorvemohan commented Oct 29, 2017

radonm commented Nov 2, 2017

pgrosu commented Nov 2, 2017

radonm left a comment

radonm Nov 2, 2017

djfinn14 Nov 3, 2017

jeremyfreudberg Nov 3, 2017

djfinn14 commented Nov 7, 2017

pgrosu commented Nov 8, 2017

naved001 commented Nov 8, 2017

pgrosu commented Nov 10, 2017

naved001 commented Nov 15, 2017 •

edited

apoorvemohan left a comment

apoorvemohan Nov 30, 2017

apoorvemohan Dec 10, 2017

apoorvemohan Dec 10, 2017

apoorvemohan Dec 10, 2017

apoorvemohan Dec 11, 2017

naved001 Dec 17, 2017

apoorvemohan Dec 17, 2017

naved001 Dec 17, 2017

apoorvemohan Dec 17, 2017

apoorvemohan Dec 11, 2017

naved001 Dec 17, 2017

apoorvemohan Dec 17, 2017

apoorvemohan Dec 17, 2017

naved001 Dec 17, 2017

apoorvemohan Dec 17, 2017 •

edited

		3. Modify bmi_config.cfg to match whatever your current HIL and Ceph setup is.

		4. Modify dnsmasq.conf within roles/dhcp/tasks/main.yml to match your requirements.

Ansible Playbook for BMI Installation #153

Ansible Playbook for BMI Installation #153

Conversation

djfinn14 commented Oct 27, 2017

naved001 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apoorvemohan commented Oct 29, 2017

radonm commented Nov 2, 2017

pgrosu commented Nov 2, 2017

radonm left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

djfinn14 commented Nov 7, 2017

pgrosu commented Nov 8, 2017

naved001 commented Nov 8, 2017

pgrosu commented Nov 10, 2017

naved001 commented Nov 15, 2017 • edited

apoorvemohan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

apoorvemohan Dec 17, 2017 • edited

Choose a reason for hiding this comment

naved001 commented Nov 15, 2017 •

edited

apoorvemohan Dec 17, 2017 •

edited