Unicorn does not come up (error 502) after hard restart of Docker server #1305

IlyaSemenov · 2017-07-27T04:23:18Z

Steps to reproduce

Run GitLab using the guide
Power cycle the server running Docker

Actual result

GitLab will never come up fully, showing error 502.

The docker container logs will have this:

2017-07-26 23:20:38,558 INFO spawned: 'unicorn' with pid 612
2017-07-26 23:20:39,160 INFO exited: unicorn (exit status 1; not expected)
...
2017-07-26 23:20:46,864 INFO spawned: 'unicorn' with pid 647
2017-07-26 23:20:47,312 INFO exited: unicorn (exit status 1; not expected)
2017-07-26 23:20:48,313 INFO gave up: unicorn entered FATAL state, too many start retries too quickly

unicorn_stderr.log will have this:

...
/home/git/gitlab/vendor/bundle/ruby/2.3.0/gems/unicorn-5.1.0/lib/unicorn/http_server.rb:195:in `pid=': Already running on PID:601 (or pid=/home/git/gitlab/tmp/pids/unicorn.pid is stale) (ArgumentError)
        from /home/git/gitlab/vendor/bundle/ruby/2.3.0/gems/unicorn-5.1.0/lib/unicorn/http_server.rb:127:in `start'
        from /home/git/gitlab/vendor/bundle/ruby/2.3.0/gems/unicorn-5.1.0/bin/unicorn_rails:209:in `<top (required)>'
        from /home/git/gitlab/vendor/bundle/ruby/2.3.0/bin/unicorn_rails:22:in `load'
        from /home/git/gitlab/vendor/bundle/ruby/2.3.0/bin/unicorn_rails:22:in `<main>'

Workaround

The only way to bring up GitLab will be to docker exec into the container, manually delete the stale pid file and restart the container:

docker exec -it gitlab rm /home/git/gitlab/tmp/pids/unicorn.pid && docker restart gitlab

Expected result

GitLab comes up without manual intervention.

The text was updated successfully, but these errors were encountered:

asbjornenge · 2017-10-03T07:55:46Z

Any progress here? I'm trying to run gitlab latest in a docker swarm and getting stuck on this. Is the pidfile still located there for latest version? What version of gitlab were you trying @IlyaSemenov ?

lucpolak · 2017-10-11T13:15:05Z

Hello,
I have the exactly same issue.
Sometimes Gitlab dont start successfully (after server reboot)
Interested about resolution of this issue.

asbjornenge · 2017-10-11T20:13:10Z

@lucpolak I finally got it working just using a more beefy server. I was trying to run on a g1-small on GCP, but upgrading to a n-standard-2 did the trick 👍

lucpolak · 2017-10-12T07:18:20Z

Hey @asbjornenge, My server is pretty good. The VM is hosted on ESXI with Intel 4c CPU and 32Gb RAM.
It is provided by OVH.
The VM is Ubuntu with docker installed on it and 4Gb RAM allowed.

I have another VM with same config with gitlab-ce installed without docker and all works fine ;-(

arthurkrupa · 2017-12-21T09:22:50Z

We had the same issue on a DigitalOcean 4-core VPS with 8GB RAM (~30 regular users and a lot of CI pipelines).

What helped was reducing the number of unicorn workers from 8 to 6 (using the UNICORN_WORKERS variable).

Mario-Eis · 2018-03-05T09:35:04Z

Exact same issue on Synology NAS. Reducing the workers did not solve the issue.
Maybe it should be mentioned, that it already worked fine. The issues started about 1-2 month ago. Maybe with 10.2.x or 10.3.x

Mario-Eis · 2018-04-16T14:34:54Z

Is there a workaround like automatically removing the pid file at startup?

HengCC · 2018-04-22T06:14:29Z

遇到了同样的问题. INFO exited: unicorn (exit status 1; not expected)2018-04-22 06:08:53,643 INFO spawned: 'unicorn' with pid 587
2018-04-22 06:08:54,647 INFO success: unicorn entered RUNNING state, process has stayed up for > than 1 seconds
(startsecs)

bsakweson · 2018-04-22T13:07:35Z

sameersbn/gitlab:10.6.4

I am seeing this same behavior at the moment and can hardly understand how to go about resolving it. I am in the process of deploying gitlab in our on-prem kubernetes cluster. Some googling shows that some people have had success beefing up memory for the running instance. I beefed up pod spec to use up to 4G RAM but that has also been futile. Here is what I am seeing in the log before kubernetes restart the containing as an effort to repair it. In essence, it hangs here:

2018-04-22 09:01:04,820 CRIT Supervisor running as root (no user in config file)
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/cron.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/gitaly.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/gitlab-workhorse.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/mail_room.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/nginx.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/sidekiq.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/sshd.conf" during parsing
2018-04-22 09:01:04,820 WARN Included extra file "/etc/supervisor/conf.d/unicorn.conf" during parsing
2018-04-22 09:01:04,824 INFO RPC interface 'supervisor' initialized
2018-04-22 09:01:04,825 CRIT Server 'unix_http_server' running without any HTTP authentication checking
2018-04-22 09:01:04,825 INFO supervisord started with pid 1
2018-04-22 09:01:05,827 INFO spawned: 'gitaly' with pid 592
2018-04-22 09:01:05,829 INFO spawned: 'sidekiq' with pid 593
2018-04-22 09:01:05,831 INFO spawned: 'unicorn' with pid 594
2018-04-22 09:01:05,833 INFO spawned: 'gitlab-workhorse' with pid 595
2018-04-22 09:01:05,835 INFO spawned: 'cron' with pid 600
2018-04-22 09:01:05,853 INFO spawned: 'nginx' with pid 601
2018-04-22 09:01:05,855 INFO spawned: 'sshd' with pid 603
2018-04-22 09:01:07,564 INFO success: gitaly entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:01:07,564 INFO success: sidekiq entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:01:07,564 INFO success: unicorn entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:01:07,564 INFO success: gitlab-workhorse entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:01:07,564 INFO success: cron entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:01:07,564 INFO success: nginx entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:01:07,564 INFO success: sshd entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:06:03,655 WARN received SIGTERM indicating exit request
2018-04-22 09:06:03,656 INFO waiting for sshd, gitlab-workhorse, sidekiq, cron, nginx, gitaly, unicorn to die
2018-04-22 09:06:03,657 INFO stopped: sshd (exit status 0)
2018-04-22 09:06:03,662 INFO stopped: nginx (exit status 0)
2018-04-22 09:06:03,663 INFO stopped: cron (terminated by SIGTERM)
2018-04-22 09:06:03,665 INFO stopped: gitlab-workhorse (terminated by SIGTERM)
2018-04-22 09:06:05,094 INFO stopped: unicorn (exit status 0)
2018-04-22 09:06:07,097 INFO waiting for sidekiq, gitaly to die
2018-04-22 09:06:07,669 INFO stopped: sidekiq (exit status 0)
2018-04-22 09:06:07,676 INFO stopped: gitaly (exit status 1)

See these two lines where it dies and look at how long it took for it to stop:

2018-04-22 09:01:07,564 INFO success: sshd entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2018-04-22 09:06:03,655 WARN received SIGTERM indicating exit request

It just sits at this point until the container is restarted by kubernetes. I have also increased initialDelaySeconds: 300, a relatively higher number to see if that resolves it but no luck.

LM1LC3N7 · 2018-05-21T13:44:02Z

docker exec -it gitlab rm /home/git/gitlab/tmp/pids/unicorn.pid && docker restart gitlab

This solved my issue, thanks :-)

fover0932 · 2018-06-02T02:09:25Z

docker exec -it gitlab rm /home/git/gitlab/tmp/pids/unicorn.pid && docker restart gitlab

This solved my issue, thanks :-)

-----me too, thanks!!!

compurator · 2018-06-02T03:18:37Z

2018년 6월 2일 (토) 오전 11:09, fover0932 <notifications@github.com>님이 작성:

…

docker exec -it gitlab rm /home/git/gitlab/tmp/pids/unicorn.pid && docker restart gitlab This solved my issue, thanks :-) -----me too, thanks!!! — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#1305 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEzfyNS3nKC5dlYdnxxZX8NRDwxwuiZyks5t4fPcgaJpZM4OkxsJ> .

herrmanthegerman · 2018-07-05T09:48:36Z

I'm seeing this issue just now on my GitLab installation on a Synolog NAS.

I installed GitLab via Package Center, i.e, I'm using the package provided by Synology which is based on an old version (sameersbn/gitlab:9.4.4).

Fixed the issue by removing the stale PID file. Thanks!

sharkymcdongles · 2018-12-06T17:23:17Z

Any actual solution rather than a mitigation? Is it just that my docker container doesn't have enough memory assigned or is there something misconfigured?

StefanCristian · 2019-01-14T19:50:17Z

In my case, the problem was the fact that I was using wrong signed-SSL certificates, and especially the wrong dhparam.pem certificate
Nginx didn't recognize them and faulted with that
The bad side of the story is that it didn't show up in the logs anywhere
@bsakweson @sharkymcdongles did you try with self-signed for a short test?

jcberthon · 2019-01-15T11:42:20Z

We experienced a file system full and had to restart GitLab. After restart we also had a error 502.

I did:

# gitlab-ctl status
run: alertmanager: (pid 551) 1449s; run: log: (pid 545) 1449s
run: gitaly: (pid 593) 1449s; run: log: (pid 589) 1449s
run: gitlab-monitor: (pid 597) 1449s; run: log: (pid 592) 1449s
run: gitlab-pages: (pid 558) 1449s; run: log: (pid 556) 1449s
run: gitlab-workhorse: (pid 553) 1449s; run: log: (pid 548) 1449s
run: logrotate: (pid 596) 1449s; run: log: (pid 591) 1449s
run: nginx: (pid 579) 1449s; run: log: (pid 578) 1449s
run: node-exporter: (pid 552) 1449s; run: log: (pid 547) 1449s
run: postgres-exporter: (pid 563) 1449s; run: log: (pid 560) 1449s
run: postgresql: (pid 561) 1449s; run: log: (pid 557) 1449s
run: prometheus: (pid 594) 1449s; run: log: (pid 590) 1449s
run: redis: (pid 549) 1449s; run: log: (pid 543) 1449s
run: redis-exporter: (pid 550) 1449s; run: log: (pid 544) 1449s
run: registry: (pid 542) 1449s; run: log: (pid 540) 1449s
run: sidekiq: (pid 541) 1449s; run: log: (pid 539) 1449s
run: sshd: (pid 20) 1480s; run: log: (pid 19) 1480s
run: unicorn: (pid 33646) 1s; run: log: (pid 559) 1449s

All services were up except unicorn which kept on restarting.

I've checked the log files of unicorn and it stated:

ArgumentError: Already running on PID:777 (or pid=/opt/gitlab/var/unicorn/unicorn.pid is stale)

So as already mentioned above a simple rm /opt/gitlab/var/unicorn/unicorn.pid was enough. Actually because GitLab (omnibus installation) was keeping on restarting unicorn, I did not have to restart anything. After a second, unicorn was up and running and GitLab was healthy again! :-)

gjrtimmer · 2019-02-12T12:46:17Z

Removing the PID and restarting also solved my issue.
Caused by reboot of my Synology NAS.

@solidnerd @sameersbn
Can we fix this permanently by adding a cleanup in the entrypoint ?

Example:

#!/bin/bash

#Define cleanup procedure
cleanup() {
    echo "Container stopped, performing cleanup..."
}

#Trap SIGTERM
trap 'cleanup' SIGTERM

#Execute a command
"${@}" &

#Wait
wait $!

#Cleanup
cleanup

JMLX42 · 2019-06-21T08:04:56Z

Could it be because docker kills the gitlab container before unicorn had enough time to shutdown?
Maybe we could try setting the --stop-timeout Docker setting to a higher value.

mgscreativa · 2020-01-08T16:22:19Z

Same here, this command worked for me but the PID path is different in my case opt/gitlab/var/unicorn/unicorn.pid

docker exec -it gitlab rm /opt/gitlab/var/unicorn/unicorn.pid && docker restart gitlab

I've put that in my cron file and it works!

abulka · 2020-01-29T00:40:54Z

The 502 problem happens when I stop and then start GitLab from the Synology package manager UI, which I don't consider to be a "hard restart". As such, its a problem every docker GitLab synology deployment is going to have very quickly.

Users will have to be lucky enough to find this thread and learn to run a docker command (the rm and restart mentioned above, which works for me) to fix the problem. And running a docker command in synology is not straightforward via the UI - you have to do the following:

This command has to be issued every time the NAS restarts etc. unless they use a cron job fix mentioned, which I'm not sure how to do on a synology - @mgscreativa can you please elaborate?

I think this issue is pretty serious and needs a proper fix.

mgscreativa · 2020-01-29T01:33:04Z

Hi @abulka sorry I don't have Synology hardware!

hannes-ucsc · 2020-01-31T04:56:00Z

Same here on an EC2 instance booting from a RancherOS AMI. So this is not specific to Synology. This occurred after sudo reboot. The workaround of running docker exec -it gitlab rm /opt/gitlab/var/unicorn/unicorn.pid && docker restart gitlab worked.

12.4.5 (539f5fc0384)

stale · 2020-05-06T23:22:43Z

This issue has been automatically marked as stale because it has not had any activity for the last 60 days. It will be closed if no further activity occurs during the next 7 days. Thank you for your contributions.

IlyaSemenov · 2020-05-07T05:33:58Z

sameersbn · 2020-05-07T05:39:07Z

Sorry about this. Does this issue still exists with the newer releases?

sameersbn · 2020-05-07T05:41:16Z

ah i see its present in 12.4.5 too. will make a fix soon

solidnerd · 2020-05-07T05:52:11Z

@sameersbn I think we need to this with puma right by now instead of unicorn.

stale bot added the wontfix label May 6, 2020

stale bot removed the wontfix label May 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unicorn does not come up (error 502) after hard restart of Docker server #1305

Unicorn does not come up (error 502) after hard restart of Docker server #1305

IlyaSemenov commented Jul 27, 2017 •

edited

Loading

asbjornenge commented Oct 3, 2017

lucpolak commented Oct 11, 2017 •

edited

Loading

asbjornenge commented Oct 11, 2017

lucpolak commented Oct 12, 2017

arthurkrupa commented Dec 21, 2017

Mario-Eis commented Mar 5, 2018

Mario-Eis commented Apr 16, 2018

HengCC commented Apr 22, 2018

bsakweson commented Apr 22, 2018 •

edited

Loading

LM1LC3N7 commented May 21, 2018

fover0932 commented Jun 2, 2018

compurator commented Jun 2, 2018 via email

herrmanthegerman commented Jul 5, 2018

sharkymcdongles commented Dec 6, 2018

StefanCristian commented Jan 14, 2019

jcberthon commented Jan 15, 2019

gjrtimmer commented Feb 12, 2019

JMLX42 commented Jun 21, 2019 •

edited

Loading

mgscreativa commented Jan 8, 2020

abulka commented Jan 29, 2020

mgscreativa commented Jan 29, 2020

hannes-ucsc commented Jan 31, 2020 •

edited

Loading

stale bot commented May 6, 2020

IlyaSemenov commented May 7, 2020

sameersbn commented May 7, 2020

sameersbn commented May 7, 2020

solidnerd commented May 7, 2020

Unicorn does not come up (error 502) after hard restart of Docker server #1305

Unicorn does not come up (error 502) after hard restart of Docker server #1305

Comments

IlyaSemenov commented Jul 27, 2017 • edited Loading

Steps to reproduce

Actual result

Workaround

Expected result

asbjornenge commented Oct 3, 2017

lucpolak commented Oct 11, 2017 • edited Loading

asbjornenge commented Oct 11, 2017

lucpolak commented Oct 12, 2017

arthurkrupa commented Dec 21, 2017

Mario-Eis commented Mar 5, 2018

Mario-Eis commented Apr 16, 2018

HengCC commented Apr 22, 2018

bsakweson commented Apr 22, 2018 • edited Loading

LM1LC3N7 commented May 21, 2018

fover0932 commented Jun 2, 2018

compurator commented Jun 2, 2018 via email

herrmanthegerman commented Jul 5, 2018

sharkymcdongles commented Dec 6, 2018

StefanCristian commented Jan 14, 2019

jcberthon commented Jan 15, 2019

gjrtimmer commented Feb 12, 2019

JMLX42 commented Jun 21, 2019 • edited Loading

mgscreativa commented Jan 8, 2020

abulka commented Jan 29, 2020

mgscreativa commented Jan 29, 2020

hannes-ucsc commented Jan 31, 2020 • edited Loading

stale bot commented May 6, 2020

IlyaSemenov commented May 7, 2020

sameersbn commented May 7, 2020

sameersbn commented May 7, 2020

solidnerd commented May 7, 2020

IlyaSemenov commented Jul 27, 2017 •

edited

Loading

lucpolak commented Oct 11, 2017 •

edited

Loading

bsakweson commented Apr 22, 2018 •

edited

Loading

JMLX42 commented Jun 21, 2019 •

edited

Loading

hannes-ucsc commented Jan 31, 2020 •

edited

Loading