Deploying a unicorn app causes 502s. Can we use sockets instead of a port? #386

ryana · 2013-12-16T21:05:13Z

Hey there,

Every time I deploy, I got a 502 for ~ 20 seconds. I believe the issue is that nginx is pointing at the new port before the new unicorn is up. This issue might be fixed if nginx was pointing at a socket instead of a port. However, if I understand how dokku works, then the whole idea of using Unicorn may not work out of the box. That is, nothing is telling dokku to send a USR2 to Unicorn (http://unicorn.bogomips.org/SIGNALS.html) to tell it to restart.

Can anyone with Unicorn/dokku experience chime in? Thanks!

tspacek · 2013-12-19T11:49:54Z

I have the same issue. I'll let you know if I work it out, but if anyone with more experience of this has any info it would be much appreciated.

ryana · 2013-12-19T14:11:32Z

After noodling on this for a bit, I think this is less about using sockets v. ports and more about managing when the host nginx is reloaded. Bottom line is that with dokku, we cannot use USR2 to reload Unicorn. However, Unicorn still has the forking niceties that effectively lets us "scale dynos" -- something that appears to be lacking in dokku right now.

I think fixing this issue this will require removing the nginx-vhosts plugin and adding a unicorn-nginx plugin (which I'm going to write) that will:

On initial deploy:

Start a new container
Writes the nginx config in a way that uses a socket on the host OS (child VM port proxies to host socket file)

On redeploy:

Start a new container
In a predeploy hook, copy the current CONTAINER file to a safe place and delete the file (dokku looks for this file and, upon it's existence, kills the current container automatically).
2a. This stops the current container from being killed
Wait until the new container responds to a GET
Rewrite the host nginx to forward from the correct port
Kill old container

Let me know if this makes sense.

tspacek · 2013-12-19T14:27:58Z

This is the same conclusion I came to over the last few hours. It sounds like besides using a socket this would be applicable to most deploys though?—
Sent from Mailbox for iPhone

On Fri, Dec 20, 2013 at 1:11 AM, Ryan Angilly notifications@github.com
wrote:

After noodling on this for a bit, I think this is less about using sockets v. ports and more about managing when the host nginx is reloaded. Bottom line is that with dokku, we cannot use USR2 to reload Unicorn. However, Unicorn still has the forking niceties that effectively lets us "scale dynos" -- something that appears to be lacking in dokku right now.
I think fixing this issue this will require removing the nginx-vhosts plugin and adding a unicorn-nginx plugin (which I'm going to write) that will:
On initial deploy:

Start a new container

Writes the nginx config in a way that uses a socket on the host OS (child VM port proxies to host socket file)
On redeploy:

Start a new container

In a predeploy hook, copy the current CONTAINER file to a safe place and delete the file (dokku looks for this file and, upon it's existence, kills the current container automatically).
2a. This stops the current container from being killed

Wait until the new container responds to a GET

Rewrite the host nginx to forward from the correct port

Kill old container

Let me know if this makes sense.

Reply to this email directly or view it on GitHub:
#386 (comment)

ryana · 2013-12-19T14:46:03Z

Yeah this is more of a general "oh crap that's right we can't use the zero-downtime features of unicorn when we're using containers" -- so actually, it could (should) probably be generalized to not be unicorn specific at all. Something like:

start the new container
don't kill old container
wait until a GET succeeds on the new container $PORT
rewrite host nginx conf & reload it
kill old container

tspacek · 2013-12-19T14:51:05Z

This would be really useful for my cases, and I suspect for others too.

Sounds conceptually similar to https://devcenter.heroku.com/articles/labs-preboot

—
Sent from Mailbox for iPhone

On Fri, Dec 20, 2013 at 1:46 AM, Ryan Angilly notifications@github.com
wrote:

Yeah this is more of a general "oh crap that's right we can't use the zero-downtime features of unicorn when we're using containers" -- so actually, it could (should) probably be generalized to not be unicorn specific at all. Something like:

start the new container

don't kill old container

wait until a GET succeeds on the new container $PORT

rewrite host nginx conf & reload it

5. kill old container

Reply to this email directly or view it on GitHub:
#386 (comment)

ryana · 2013-12-19T14:52:15Z

Yeah. I haven't contributed much to open source lately... I think I'm due ;) Give me the wknd and I should have something worked out.

plietar · 2013-12-19T14:52:47Z

That sounds good.
The call to the post-deploy hook should only happen once the container is ready
Killing the old container/replacing CONTAINER should be moved to the post-deploy step.

The problem is getting this the work in a generic manner.
A GET / might not be best for all apps.
Also, what happens if an app crashes/hangs ? We would need a (configurable) timeout.

All of that could probably implemented as an extension to the dokku ps command (PR #298)

tspacek · 2013-12-19T14:57:30Z

Heroku seems to just wait 3 mins before switching over (with preboot), which is still better than the current sequential stop then start.

The downside is if you're running any workers in the same container they'll be ticking away doubled up for a while.
—
Sent from Mailbox for iPhone

On Fri, Dec 20, 2013 at 1:53 AM, Paul Lietar notifications@github.com
wrote:

That sounds good.
The call to the post-deploy hook should only happen once the container is ready
Killing the old container/replacing CONTAINER should be moved to the post-deploy step.
The problem is getting this the work in a generic manner.
A GET / might not be best for all apps.
Also, what happens if an app crashes/hangs ? We would need a (configurable) timeout.

All of that could probably implemented as an extension to the dokku ps command (PR #298)

Reply to this email directly or view it on GitHub:
#386 (comment)

ryana · 2013-12-19T14:59:53Z

Yeah what I have in mind:

Timeout on the new container start configurable
GET string configurable
Another timeout to wait for & ensure the old container dies (also configurable) so that resources aren't getting eaten up

I don't like the 3 minute thing. I'd rather do it right w/ a GET.

@plietar when you talk about moving the container kill to the post-deploy, are you suggesting that I look at making changes to dokku? Or should I start w/ wrapping all this up in a plugin that follows the current hook interface?

tspacek · 2014-01-13T01:21:20Z

Anything I can do to help on this?

ryana · 2014-01-17T00:13:13Z

Finally getting around to it now... hoping for a PR you can look at this evening...

ryana · 2014-01-17T01:30:46Z

Alright here's what I'm proposing: https://github.com/ryana/dokku/compare/zero-downtime?expand=1

You need to disable the nginx-vhosts plugin, but I just tested this a few times and it works. Actually, after thinking about it for a few minutes, this could probably just replace the nginx-vhosts plugin.... What do we think about that?

tspacek · 2014-01-17T04:22:14Z

I'll give this a whirl over the weekend.

I don't know enough about everything else going on to say whether replacing nginx-vhosts would be a good idea.

ryana · 2014-01-17T14:00:10Z

Awesome. I'll probably be pulling it out into a standalone plugin and putting it into production today. The reason I said it could replace nginx-vhosts is that it's very heavily based on nginx-vhosts. It's really just that plugin with a bunch of code moved around and a looping curl to wait until the new container is up. It needs some configuration options, but want to make sure it's working first.

plietar · 2014-01-17T14:06:43Z

@ryana There's already a lot of good work being done on the nginx integration by @mikexstudios (multiple domains support, ...)
Rather then modifying it, you could make a nginx-prereload hook, and loop with curl in there.

You might also want to have a look at #298 for better process management, even tough not much work has been done there.

ryana · 2014-01-17T14:13:57Z

Oh cool. Somehow I missed when you mentioned #298 in your earlier comment.

I saw the nginx-prereload hook, but due to its placement after re-writing nginx.conf, if the new container doesn't start you're left with an nginx.conf on the FS that doesn't reflect nginx's current state. That feels wrong, no?

Also, regarding the dokku ps stuff, any idea on the timeline for getting that pulled in?

plietar · 2014-01-18T00:31:07Z

This is certainly wrong, hadn't though about it.
Also the nginx part should be kept as seperate as possible from the rest of dokku.

I'd suggest adding a deploy-check, that would run between the actual deploy and the post-deploy hook. (Or nested in a post-deploy hook with high priority, so that it runs before the nginx stuff, eg 00_dokku-standard)

Based on the exit code of that hook, either continue with post-deploy (exit code 0), wait a few seconds and try again or fail.

This way we can extend the criteria for detection with new plugins. If at least one plugin returns a non zero value pluginhook aborts all plugins and returns it.

plietar · 2014-01-18T00:33:14Z

Concerning dokku ps, @slaskis has put a lot of effort initially, but hasn't updated it much since.
So there's no timeline for now, until slaskis or someone else has time to complete it.

slaskis · 2014-01-18T11:46:52Z

Oh this looks really interesting @ryana. Zero downtime is an issue with ps that I didn't get around to solve.

Mainly because the way I used named docker containers, which was really nice and convenient, but it wouldn't allow it to have two containers with the same name (and renaming them was not possible). So to get it to work I'm thinking that it would have to move from named docker containers and store the container id in a file in something like $APP/ps/$name which simply reads the container name.

But I have been running ps in production for a few months now and I have found some other kinks that needs to be fixed as well. And I'm curious to try it with newer docker too, might solve a few instabilities.

josegonzalez · 2014-11-16T09:42:22Z

You can now specify checks in a CHECKS file as of #562 (which was released in v0.3.0). Change the DOKKU_CHECKS_WAIT environment variable to your desired amount (~30 seconds in this case).

Closing.

AlJohri mentioned this issue Mar 20, 2014

502 Bad Gateway on fresh install - nginx & node #499

Closed

slaskis mentioned this issue Mar 22, 2014

dokku ps #298

Closed

10 tasks

josegonzalez closed this as completed Nov 16, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploying a unicorn app causes 502s. Can we use sockets instead of a port? #386

Deploying a unicorn app causes 502s. Can we use sockets instead of a port? #386

ryana commented Dec 16, 2013

tspacek commented Dec 19, 2013

ryana commented Dec 19, 2013

tspacek commented Dec 19, 2013

Let me know if this makes sense.

ryana commented Dec 19, 2013

tspacek commented Dec 19, 2013

5. kill old container

ryana commented Dec 19, 2013

plietar commented Dec 19, 2013

tspacek commented Dec 19, 2013

All of that could probably implemented as an extension to the dokku ps command (PR #298)

ryana commented Dec 19, 2013

tspacek commented Jan 13, 2014

ryana commented Jan 17, 2014

ryana commented Jan 17, 2014

tspacek commented Jan 17, 2014

ryana commented Jan 17, 2014

plietar commented Jan 17, 2014

ryana commented Jan 17, 2014

plietar commented Jan 18, 2014

plietar commented Jan 18, 2014

slaskis commented Jan 18, 2014

josegonzalez commented Nov 16, 2014

Deploying a unicorn app causes 502s. Can we use sockets instead of a port? #386

Deploying a unicorn app causes 502s. Can we use sockets instead of a port? #386

Comments

ryana commented Dec 16, 2013

tspacek commented Dec 19, 2013

ryana commented Dec 19, 2013

tspacek commented Dec 19, 2013

Let me know if this makes sense.

ryana commented Dec 19, 2013

tspacek commented Dec 19, 2013

5. kill old container

ryana commented Dec 19, 2013

plietar commented Dec 19, 2013

tspacek commented Dec 19, 2013

All of that could probably implemented as an extension to the dokku ps command (PR #298)

ryana commented Dec 19, 2013

tspacek commented Jan 13, 2014

ryana commented Jan 17, 2014

ryana commented Jan 17, 2014

tspacek commented Jan 17, 2014

ryana commented Jan 17, 2014

plietar commented Jan 17, 2014

ryana commented Jan 17, 2014

plietar commented Jan 18, 2014

plietar commented Jan 18, 2014

slaskis commented Jan 18, 2014

josegonzalez commented Nov 16, 2014