Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remote worker missing #262

Closed
gaos1 opened this issue Jan 19, 2016 · 5 comments
Closed

remote worker missing #262

gaos1 opened this issue Jan 19, 2016 · 5 comments

Comments

@gaos1
Copy link

gaos1 commented Jan 19, 2016

We have a concourse TSA in AWS US and a remote worker in AWS China.
We often see the following errors in this worker's logs:
worker for container is missing
Our worker is indeed missing when we executed below commands to check:
fly ws

Here are some related logs in worker job groundcrew:
-- beacon.stderr.log

Write failed: Broken pipe
Write failed: Broken pipe
Write failed: Broken pipe
Connection to concourse.xxx.xxx closed by remote host.

-- beacon.stdout.log

2016/01/07 06:46:00 heartbeat took 2.574034443s
2016/01/07 06:46:31 heartbeat took 1.242054249s
2016/01/07 06:47:03 heartbeat took 1.338272014s
2016/01/07 06:47:34 heartbeat took 1.306922357s
2016/01/07 06:48:05 heartbeat took 756.882852ms
2016/01/07 06:48:36 heartbeat took 791.133605ms
2016/01/07 06:49:06 heartbeat took 847.64856ms
2016/01/07 06:49:37 heartbeat took 786.748922ms
2016/01/07 06:50:13 heartbeat took 5.626379388s
@concourse-bot
Copy link
Collaborator

Hi there!

We use Pivotal Tracker to provide visibility into what our team is working on. A story for this issue has been automatically created.

The current status is as follows:

This comment, as well as the labels on the issue, will be automatically updated as the status in Tracker changes.

@gaos1
Copy link
Author

gaos1 commented Jan 19, 2016

per your suggestion, we set tsa.heartbeat_interval to 60s and the problem still keeps happening

@xoebus
Copy link
Contributor

xoebus commented Mar 3, 2016

Does the connection between the two machines drop for more than 60 seconds at a time?

@vito
Copy link
Member

vito commented Apr 10, 2016

There's honestly not a lot we can do about this. If there's a bad enough connection between your workers and the rest of Concourse they'll occasionally go away and come back. The TTLs are fairly generous out of the box; if the worker has become unavailable, it's likely that even if it were still registered you'd just see network failures trying to reach it.

@concourse-bot
Copy link
Collaborator

Hello again!

All stories related to this issue have been accepted, so I'm going to automatically close this issue.

At the time of writing, the following stories have been accepted:

If you feel there is still more to be done, or if you have any questions, leave a comment and we'll reopen if necessary!

cdutra pushed a commit that referenced this issue Aug 4, 2016
[finishes #127219723]

Submodule src/github.com/concourse/atc 1ee1c48..a3023dd:
  > ginkgo blur reoreded everything
  > return all public pipelines on GetAllPipelines endpoint
  > only check for basic auth on get token endpoint
  > do not default team name to main if not provided
  > do not default team name to main in api
Submodule src/github.com/concourse/fly cb98e4c..3269297:
  > make --team-name required
  > retrieves token if there is no auth method set
Submodule src/github.com/concourse/testflight fdd0c3f..abd306e:
  > do not default team name to 'main'
Submodule src/github.com/onsi/ginkgo e43390e..74c678d:
  > Make JUnit reporter include failure location in message. (#262)
  > remove 1.4 from travis.yml
  > Add gcflags option (#276)
  > Revert "Use the go1.5 build tag to handle vendor exceptions" (#274)
  > Merge pull request #272 from fsouza/fix-vendor
  > Add flaky test mitigation (#261)
  > Allow units and precision in benchmark (#266)
  > Add Solaris support (#264)
  > Merge pull request #259 from kwadrat/master
  > Merge branch 'apvail-spell-fix'
  > Fix go16 vendor
  > Merge pull request #250 from james-lawrence/master
  > Merge pull request #228 from jayunit100/RegexFileNameFiltering
  > Fix test flakiness
  > Merge pull request #235 from mboersma/fix-travis
  > fix compilation on older versions of go
  > fix issue where packages that reference vendored dependencies weren't compiling
  > Merge pull request #216 from sha1sum/master
  > Merge pull request #209 from luxas/build_on_arm64
  > Merge pull request #212 from cfmobile/master
  > Merge pull request #210 from cfmobile/master
Submodule src/github.com/onsi/gomega 2152b45..9ed8da1:
  > Merge pull request #166 from trayo/patch-2
  > Merge pull request #164 from wendorf/assert_typo
  > Merge remote-tracking branch 'origin/pr/163'
  > Merge pull request #160 from tinygrasshopper/fix_failing_close_ghttp
  > Merge pull request #150 from tinygrasshopper/build-fix
  > Merge pull request #159 from WesleyJeanette/patch-1
  > Merge pull request #157 from kwadrat/master
  > Merge pull request #141 from mariantalla/gomega-yaml-matcher
  > Reset tmpDir in gexec.CleanupBuildArtifacts
  > Update test description for match json tests.
  > Make the error message for expected JSON values having the wrong type accurate
  > Merge pull request #133 from tjarratt/be-identical-to-matcher
  > Merge pull request #132 from tjarratt/improve-match-json-error-message
  > Merge pull request #128 from tinygrasshopper/have-cap
  > drop 1.4 from travis
  > ghttp tests should now pass in 1.6
  > CloseClientConnections test uses http.Post instead of http.Get to avoid retries
  > add tip to .travis.yml
  > Merge pull request #125 from cfmobile/master
  > Merge pull request #122 from cfmobile/master
  > Merge pull request #119 from jim-slattery-rs/gitignore_idea
  > Merge pull request #118 from jim-slattery-rs/fix_up_succeed

Signed-off-by: Yucheng Tu <ytu@pivotal.io>
vito added a commit that referenced this issue Apr 11, 2018
vmware-archive/fly#201

Submodule src/github.com/concourse/fly f9d92f5..2be951a:
  > Merge branch 'govau-unmarshalstrict'
  > Merge pull request #214 from alepee/patch-1
Submodule src/gopkg.in/yaml.v2 e4d366fc..5420a8b6:
  > Use underlying float precision when formatting floats (#353)
  > Fix typo in tab error message (#208).
  > Fix misspell of precede in ported code (#216)
  > Fix type on Marshal docs (#206).
  > Fixed typos in docstrings (#179).
  > Drop unnecessary explicit timestamp tags.
  > Fix broken test from last merge.
  > increment non-zero scanner error lines (#319)
  > Convert int to float when explicitly tagged.
  > Encode and decode arrays.
  > Fix edge case when decoding MinInt as -0b.
  > Remove mention of non-existent examples folder.
  > Fix curious assumption from the original C reader.
  > Ensure scanner has data before checking for blanks.
  > Drop invalid simple key assertion.
  > Improve map stabilization logic.
  > Fix unstable map key ordering (#195).
  > Merge pull request #336 from rogpeppe/025-go.mod
  > Merge pull request #335 from rogpeppe/024-merge-devel
  > Merge pull request #253 from heldtogether/patch-1
  > Merge pull request #308 from rogpeppe/016-revert-v2-PR273
  > Merge pull request #273 from rogpeppe/006-timestamps
  > Merge pull request #299 from rogpeppe/009-gofmt
  > Merge pull request #281 from houshengbo/fix-incorrect-line-number
  > Correct documentation for Marshal (#287)
  > Merge pull request #289 from rliebz/null-fix
  > Replace LICENSE text with actual license (#274)
  > Merge pull request #272 from rogpeppe/005-cleaner-tag-scan
  > Merge pull request #103 from andreychernih/bugfix/non-specific-tags
  > Merge pull request #271 from rogpeppe/004-embedded-example
  > Merge pull request #94 from mvo5/feature/embeded-structs-example
  > Merge pull request #264 from hiveminded/v2
  > Merge pull request #262 from wupeka/v2
  > Remove unreachable code to fix go vet (#249)
  > Fix dead URL for yaml specification (#240)
  > Tighten restrictions on float decoding (#171)
  > Fix decode test for Go 1.8 (#217)
  > Fix unmarshaler handling of empty strings.
  > new license in the README file (#189)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants