Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build on cfncluster 16.04 LTS release once it is available for AWS #78

Open
aculich opened this issue Apr 11, 2016 · 3 comments
Open

build on cfncluster 16.04 LTS release once it is available for AWS #78

aculich opened this issue Apr 11, 2016 · 3 comments

Comments

@aculich
Copy link
Contributor

aculich commented Apr 11, 2016

I've filed a feature request on the cfncluster issue tracker for them to release a 16.04 LTS based AMI that we can generate a custom build using their cookbook scripts with packer for our AWS image (rather than their other AMI-snapshot method of customization).

@paciorek
Copy link
Contributor

Note: BCE-2016-spring.json now supports building BCE on top of a CFN Ubuntu image as the base image. Currently working with the CFN 14.04 LTS image. So we don't need to use their cookbook scripts; we can just work off their AMIs.

@aculich
Copy link
Contributor Author

aculich commented Aug 29, 2016

@dougalb says, "[16.04 LTS cfncluster AMI] has been released as part of cfncluster-1.3".

@paciorek
Copy link
Contributor

paciorek commented Dec 2, 2016

I've just built and tested this as BCE-2016-fall-cfncluster-preview, which should (shortly) be public on AWS in the Oregon region. This was done using our Packer build system, building on top of the CFN Ubuntu 16.04 base image (as done previously for 14.04).

I verified that simple multi-node submissions via either SGE or SLURM work and invoke auto-scaling if needed.

Only major issue was that I had to prevent installation of ubuntu's lightdm package as that was pulling in the upstart package and the presence of /sbin/start was causing a problem when a virtual cluster was instantiated via 'cfncluster create', related to a cfncluster chef recipe trying to start the gmetad service and somehow attempting to do that via upstart. For future reference, error message from /var/log/cfn-init.log (on the instatiated master node) is given below.

On the branch cfn_fall_2016 (this can probably be safely merged into master), I updated a bit of the R/Python packaging (e.g., R packages and the Miniconda version) but did not update the Python packages (there's some manual labor there). Worth a discussion in terms of whether we are going to release a new general BCE version in the near future and coordinating the CFNcluster version with that.

service[gmetad] action enable (up to date)
  * service[gmetad] action restart

    ================================================================================
    Error executing action `restart` on resource 'service[gmetad]'
    ================================================================================

    Mixlib::ShellOut::ShellCommandFailed
    ------------------------------------
    Expected process to exit with [0], but received '1'
    ---- Begin output of /sbin/start gmetad ----
    STDOUT:
    STDERR: start: Unable to connect to Upstart: Failed to connect to
socket /com/ubuntu/upstart: Connection refused
    ---- End output of /sbin/start gmetad ----
    Ran /sbin/start gmetad returned 1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants