Is there a way to delay container startup to support dependant services with a longer startup time #374

dancrumb · 2014-08-04T23:33:24Z

I have a MySQL container that takes a little time to start up as it needs to import data.

I have an Alfresco container that depends upon the MySQL container.

At the moment, when I use fig, the Alfresco service inside the Alfresco container fails when it attempts to connect to the MySQL container... ostensibly because the MySQL service is not yet listening.

Is there a way to handle this kind of issue in Fig?

d11wtq · 2014-08-04T23:35:32Z

At work we wrap our dependent services in a script that check if the link is up yet. I know one of my colleagues would be interested in this too! Personally I feel it's a container-level concern to wait for services to be available, but I may be wrong :)

nubs · 2014-08-04T23:36:05Z

We do the same thing with wrapping. You can see an example here: https://github.com/dominionenterprises/tol-api-php/blob/master/tests/provisioning/set-env.sh

bfirsh · 2014-08-04T23:43:42Z

It'd be handy to have an entrypoint script that loops over all of the links and waits until they're working before starting the command passed to it.

This should be built in to Docker itself, but the solution is a way off. A container shouldn't be considered started until the link it exposes has opened.

dancrumb · 2014-08-04T23:45:52Z

@bfirsh that's more than I was imagining, but would be excellent.

A container shouldn't be considered started until the link it exposes has opened.

I think that's exactly what people need.

For now, I'll be using a variation on https://github.com/aanand/docker-wait

silarsis · 2014-08-04T23:49:27Z

Yeah, I'd be interested in something like this - meant to post about it earlier.

The smallest impact pattern I can think of that would fix this usecase for us would be to be the following:

Add "wait" as a new key in fig.yml, with similar value semantics as link. Docker would treat this as a pre-requisite and wait until this container has exited prior to carrying on.

So, my docker file would look something like:

db:
  image: tutum/mysql:5.6

initdb:
  build: /path/to/db
  link:
    - db:db
  command: /usr/local/bin/init_db

app:
  link:
    - db:db
  wait:
    - initdb

On running app, it will start up all the link containers, then run the wait container and only progress to the actual app container once the wait container (initdb) has exited. initdb would run a script that waits for the database to be available, then runs any initialisations/migrations/whatever, then exits.

That's my thoughts, anyway.

dnephin · 2014-08-05T05:36:26Z

(revised, see below)

dsyer · 2014-08-14T13:03:48Z

+1 here too. It's not very appealing to have to do this in the commands themselves.

jcalazan · 2014-08-15T15:54:59Z

+1 as well. Just ran into this issue. Great tool btw, makes my life so much easier!

arruda · 2014-08-16T11:00:20Z

+1 would be great to have this.

prologic · 2014-08-19T13:11:58Z

+1 also. Recently run into the same set of problems

chymian · 2014-08-19T14:38:21Z

+1 also. any statement from dockerguys?

codeitagile · 2014-08-22T08:34:31Z

I am writing wrapper scripts as entrypoints to synchronise at the moment, not sure if having a mechanism in fig is wise if you have other targets for your containers that perform orchestration a different way. Seems very application specific to me, as such the responsibility of the containers doing the work.

prologic · 2014-08-22T08:39:20Z

After some thought and experimentation I do kind of agree with this.

As such an application I'm building basically has a synchronous
waitfor(host, port) function that lets me waits for services the application
is depending on (either detected via environment or explicitly
configuration via cli options).

cheers
James

James Mills / prologic

E: prologic@shortcircuit.net.au
W: prologic.shortcircuit.net.au

On Fri, Aug 22, 2014 at 6:34 PM, Mark Stuart notifications@github.com
wrote:

I am writing wrapper scripts as entrypoints to synchronise at the moment,
not sure if having a mechanism in fig is wise if you have other targets for
your containers that perform orchestration a different way. Seems very
application specific to me as such the responsibility of the containers
doing the work.

—
Reply to this email directly or view it on GitHub
#374 (comment).

shuron · 2014-08-31T11:52:21Z

Yes some basic "depend's on" neeeded here...
so if you have 20 container, you just wan't to run fig up and everything starts with correct order...
However it also have some timeout option or other failure catching mechanisms

ahknight · 2014-10-23T16:07:32Z

Another +1 here. I have Postgres taking longer than Django to start so the DB isn't there for the migration command without hackery.

dnephin · 2014-10-23T16:10:20Z

@ahknight interesting, why is migration running during run ?

Don't you want to actually run migrate during the build phase? That way you can startup fresh images much faster.

ahknight · 2014-10-23T16:15:16Z

There's a larger startup script for the application in question, alas. For now, we're doing non-DB work first, using nc -w 1 in a loop to wait for the DB, then doing DB actions. It works, but it makes me feel dirty(er).

dnephin · 2014-10-23T16:20:03Z

I've had a lot of success doing this work during the fig build phase. I have one example of this with a django project (still a work in progress through): https://github.com/dnephin/readthedocs.org/blob/fig-demo/dockerfiles/database/Dockerfile#L21

No need to poll for startup. Although I've done something similar with mysql, where I did have to poll for startup because the mysqld init script wasn't doing it already. This postgres init script seems to be much better.

arruda · 2014-10-24T02:38:43Z

Here is what I was thinking:

Using the idea of moby/moby#7445 we could implement this "wait_for_helth_check" attribute in fig?
So it would be a fig not a Docker issue?

is there anyway of making fig check the tcp status on the linked container, if so then I think this is the way to go. =)

docteurklein · 2014-11-10T13:09:37Z

@dnephin can you explain a bit more what you're doing in Dockerfiles to help this ?
Isn't the build phase unable to influence the runtime?

dnephin · 2014-11-10T15:25:55Z

@docteurklein I can. I fixed the link from above (https://github.com/dnephin/readthedocs.org/blob/fig-demo/dockerfiles/database/Dockerfile#L21)

The idea is that you do all the slower "setup" operations during the build, so you don't have to wait for anything during container startup. In the case of a database or search index, you would:

start the service
create the users, databases, tables, and fixture data
shutdown the service

all as a single build step. Later when you fig up the database container it's ready to go basically immediately, and you also get to take advantage of the docker build cache for these slower operations.

docteurklein · 2014-11-10T15:43:48Z

nice! thanks :)

arruda · 2014-11-11T14:35:25Z

@dnephin nice, hadn't thought of that .

oskarhane · 2014-12-05T21:27:00Z

+1 This is definitely needed.
An ugly time delay hack would be enough in most cases, but a real solution would be welcome.

dnephin · 2014-12-05T21:34:06Z

Could you give an example of why/when it's needed?

dacort · 2014-12-05T21:37:06Z

In the use case I have, I have an Elasticsearch server and then an application server that's connecting to Elasticsearch. Elasticsearch takes a few seconds to spin up, so I can't simply do a fig up -d because the application server will fail immediately when connecting to the Elasticsearch server.

ddossot · 2014-12-05T21:37:08Z

Say one container starts MySQL and the other starts an app that needs MySQL and it turns out the other app starts faster. We have transient fig up failures because of that.

oskarhane · 2014-12-05T21:45:47Z

crane has a way around this by letting you create groups that can be started individually. So you can start the MySQL group, wait 5 secs and then start the other stuff that depends on it.
Works in a small scale, but not a real solution.

arruda · 2014-12-06T01:28:52Z

@oskarhane not sure if this "wait 5 secs" helps, in some cases in might need to wait more (or just can't be sure it won't go over the 5 secs)... it's isn't much safe to rely on time waiting.
Also you would have to manually do this waiting and loading the other group, and that's kind of lame, fig should do that for you =/

Silex · 2017-03-08T20:16:54Z

@vladikoff: more info about version 3 at #4305

Basically, it won't be supported, you have to make your containers fault-tolerant instead of relying on docker-compose.

shin- · 2017-03-21T22:10:16Z

I believe this can be closed now.

slava-nikulin · 2017-05-11T03:14:47Z

Unfortunatelly, condition is not supported anymore in v3. Here is workaround, that I've found:

website:
    depends_on:
      - 'postgres'
    build: .
    ports:
      - '3000'
    volumes:
      - '.:/news_app'
      - 'bundle_data:/bundle'
    entrypoint: ./wait-for-postgres.sh postgres 5432

  postgres:
    image: 'postgres:9.6.2'
    ports:
      - '5432'

wait-for-postgres.sh:

#!/bin/sh

postgres_host=$1
postgres_port=$2
shift 2
cmd="$@"

# wait for the postgres docker to be running
while ! pg_isready -h $postgres_host -p $postgres_port -q -U postgres; do
  >&2 echo "Postgres is unavailable - sleeping"
  sleep 1
done

>&2 echo "Postgres is up - executing command"

# run the command
exec $cmd

riuvshin · 2017-05-11T07:25:04Z

@slava-nikulin custom entrypoint is a common practice, it is almost the only (docker native) way how you can define and check all conditions you need before staring your app in a container.

mbdas · 2017-05-11T10:50:29Z

Truth is there was a lot of debate and I think the 2.x support for the conditional support to natively integrate with health checks and order the startup was a much needed support. Docker does not support local pod of containers natively and when it does it will have to support something similar again just like kubernetes for example provides the semantics. Docker 3.x is a series to bring swarm support into compose and hence bunch of options has been dropped keeping the distributed nature in mind. 2.x series preserves the original compose/local topology features. Docker has to figure out how to merge these 2 versions because forcing swarm onto compose by reducing feature set of compose is not a welcome direction.

…

On May 10, 2017, at 8:15 PM, Slava Nikulin ***@***.***> wrote: Unfortunatelly, condition is not supported anymore in v3. Here is workaround, that I've found: website: depends_on: - 'postgres' build: . ports: - '3000' volumes: - '.:/news_app' - 'bundle_data:/bundle' entrypoint: ./wait-for-postgres.sh postgres 5432 postgres: image: 'postgres:9.6.2' ports: - '5432' wait-for-postgres.sh: #!/bin/sh postgres_host=$1 postgres_port=$2 cmd="$@" # wait for the postgres docker to be running while ! pg_isready -h $postgres_host -p $postgres_port -q -U postgres; do >&2 echo "Postgres is unavailable - sleeping" sleep 1 done >&2 echo "Postgres is up - executing command" # run the command exec $cmd — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

patrickml · 2017-06-22T03:34:47Z

I was able to do something like this
// start.sh

#!/bin/sh
set -eu

docker volume create --name=gql-sync
echo "Building docker containers"
docker-compose build
echo "Running tests inside docker container"
docker-compose up -d pubsub
docker-compose up -d mongo
docker-compose up -d botms
docker-compose up -d events
docker-compose up -d identity
docker-compose up -d importer
docker-compose run status
docker-compose run testing

exit $?

// status.sh

#!/bin/sh

set -eu pipefail

echo "Attempting to connect to bots"
until $(nc -zv botms 3000); do
    printf '.'
    sleep 5
done
echo "Attempting to connect to events"
until $(nc -zv events 3000); do
    printf '.'
    sleep 5
done
echo "Attempting to connect to identity"
until $(nc -zv identity 3000); do
    printf '.'
    sleep 5
done
echo "Attempting to connect to importer"
until $(nc -zv importer 8080); do
    printf '.'
    sleep 5
done
echo "Was able to connect to all"

exit 0

// in my docker compose file

  status:
    image: yikaus/alpine-bash
    volumes:
      - "./internals/scripts:/scripts"
    command: "sh /scripts/status.sh"
    depends_on:
      - "mongo"
      - "importer"
      - "events"
      - "identity"
      - "botms"

usamaB · 2017-10-30T10:26:33Z

I have a similar problem but a bit different. I have to wait for MongoDB to start and initialize a replica set.
Im doing all of the procedure in docker. i.e. creating and authentication replica set. But I have another python script in which I have to connect to the primary node of the replica set. I'm getting an error there.

docker-compose.txt
Dockerfile.txt
and in the python script im trying to do something like this
for x in range(1, 4): client = MongoClient(host='node' + str(x), port=27017, username='admin', password='password') if client.is_primary: print('the client.address is: ' + str(client.address)) print(dbName) print(collectionName) break

Am having difficulty in doing so, anyone has any idea?

chaicode88 · 2018-04-26T20:19:27Z

@patrickml If I don't use docker compose, How I you do it with Dockerfile?
I need 'cqlsh' to execute my build_all.cql. However, 'cqlsh' is not ready...have to wait for 60 seconds to be ready.

cat Dockerfile

FROM store/datastax/dse-server:5.1.8

USER root

RUN apt-get update
RUN apt-get install -y vim

ADD db-scripts-2.1.33.2-RFT-01.tar /docker/cms/
COPY entrypoint.sh /entrypoint.sh

WORKDIR /docker/cms/db-scripts-2.1.33.2/
RUN cqlsh -f build_all.cql

USER dse

=============

Step 8/9 : RUN cqlsh -f build_all.cql
---> Running in 08c8a854ebf4
Connection error: ('Unable to connect to any servers', {'127.0.0.1': error(111, "Tried connecting to [('127.0.0.1', 9042)]. Last error: Connection refused")})
The command '/bin/sh -c cqlsh -f build_all.cql' returned a non-zero code: 1

A race condition between these two containers was causing the database to sometimes get cleaned after migrations had been run. Rather than hack together scripts to track this state I'm simply removing the clean service from the docker-compose configuration. See also: * https://docs.docker.com/compose/startup-order/ * docker/compose#374

Because `docker-compose` is not capable of checking open ports docker/compose#374

henroFall · 2020-10-27T09:22:00Z

Requires= var-lib-libvirt.mount var-lib-libvirt-images-ram.mount

Lunar2kPS · 2022-05-24T18:16:18Z

In case anyone comes back to this years later, read this wonderful page from the Docker Compose docs for updated info!

skorokithakis · 2022-05-24T18:18:52Z

There are also health checks built in now, which solve the problem better.

bfirsh mentioned this issue Aug 6, 2014

Proposal: Containers should not be considered started until the TCP ports they expose are open moby/moby#7445

Closed

vladikoff mentioned this issue Mar 8, 2017

add auth server vladikoff/fxa-docker-dev#7

Closed

shin- closed this as completed Mar 21, 2017

Silex mentioned this issue Mar 21, 2017

Missing Depends_ON functionality during start process within docker-swarm moby/moby#31333

Open

weiweishi mentioned this issue Apr 10, 2017

small config changes; added env variable to enable temp login in prod… ualbertalib/Hydranorth2#145

Merged

tuxdna mentioned this issue May 29, 2017

data-model-importer API fails to run via docker compose fabric8-analytics/fabric8-analytics-common#19

Closed

miteshvp mentioned this issue Jun 9, 2017

Gremlin server can not be started on local machine (in Docker) fabric8-analytics/gremlin-docker#8

Closed

phrohdoh mentioned this issue Jun 14, 2017

Running Docker-Compose, Error at Step 13 monicahq/monica#251

Closed

konfuciusu mentioned this issue Aug 2, 2017

[WIP] Реалізація міграцій БД з використанням doctrine migrations hurtom/toloka#74

Open

42 tasks

yacota mentioned this issue Aug 25, 2017

Problems with stagemonitor-metrics index mapping in ElasticSearch5.4.3 / Kibana5.4.3 stagemonitor/stagemonitor#308

Closed

aleskandro mentioned this issue Feb 8, 2018

Improve exception handling in scanner and msg handler. eMarco/sysmonitoring-dht-chord-mongodb-replica-manager#50

Closed

vijayvepa pushed a commit to vijayvepa/AnsibleExamples that referenced this issue Dec 20, 2018

use fix for docker/compose#374

877696a

rwynn mentioned this issue Jan 9, 2019

direct reads in high availability mode rwynn/monstache#153

Closed

tomkerkhove mentioned this issue Aug 21, 2019

Provide support for Job containers microsoft/DockerTools#205

Open

rpdelaney mentioned this issue Apr 9, 2020

Remove db_clean service from docker-compose.yml CMSgov/easi-app#136

Merged

helloworlde mentioned this issue May 21, 2020

docker-compose 中启动nacos和seata，nacos仅仅作为seata的注册中心，启动抛异常 apache/incubator-seata#2706

Closed

1 task

abitrolly added a commit to abitrolly/cheat.sh that referenced this issue Jul 29, 2020

Wait 3 seconds to let server to start

3905047

Because `docker-compose` is not capable of checking open ports docker/compose#374

Ben-Haslam mentioned this issue Dec 21, 2020

Docker compose clearmatics/services-testnet-agc#18

Merged

kg0r0 mentioned this issue Aug 16, 2021

An error occurs if accessed during system startup. kg0r0/lua-resty-openidc-example#10

Open

Is there a way to delay container startup to support dependant services with a longer startup time #374

Is there a way to delay container startup to support dependant services with a longer startup time #374

Comments

dancrumb commented Aug 4, 2014

d11wtq commented Aug 4, 2014

nubs commented Aug 4, 2014

bfirsh commented Aug 4, 2014

dancrumb commented Aug 4, 2014

silarsis commented Aug 4, 2014

dnephin commented Aug 5, 2014

dsyer commented Aug 14, 2014

jcalazan commented Aug 15, 2014

arruda commented Aug 16, 2014

prologic commented Aug 19, 2014

chymian commented Aug 19, 2014

codeitagile commented Aug 22, 2014

prologic commented Aug 22, 2014

shuron commented Aug 31, 2014

ahknight commented Oct 23, 2014

dnephin commented Oct 23, 2014

ahknight commented Oct 23, 2014

dnephin commented Oct 23, 2014

arruda commented Oct 24, 2014

docteurklein commented Nov 10, 2014

dnephin commented Nov 10, 2014

docteurklein commented Nov 10, 2014

arruda commented Nov 11, 2014

oskarhane commented Dec 5, 2014

dnephin commented Dec 5, 2014

dacort commented Dec 5, 2014

ddossot commented Dec 5, 2014

oskarhane commented Dec 5, 2014

arruda commented Dec 6, 2014

Silex commented Mar 8, 2017

shin- commented Mar 21, 2017

slava-nikulin commented May 11, 2017 • edited

riuvshin commented May 11, 2017

mbdas commented May 11, 2017 via email

patrickml commented Jun 22, 2017 • edited

usamaB commented Oct 30, 2017

chaicode88 commented Apr 26, 2018

cat Dockerfile

henroFall commented Oct 27, 2020

Lunar2kPS commented May 24, 2022

skorokithakis commented May 24, 2022

slava-nikulin commented May 11, 2017 •

edited

patrickml commented Jun 22, 2017 •

edited