Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The ping with sn is stopped? #250

Open
lurenpluto opened this issue May 4, 2023 Discussed in #249 · 2 comments
Open

The ping with sn is stopped? #250

lurenpluto opened this issue May 4, 2023 Discussed in #249 · 2 comments
Assignees
Labels
BDT bucky data transfer protocol bug Something isn't working SN SN Server

Comments

@lurenpluto
Copy link
Member

Discussed in #249

Originally posted by streetycat May 4, 2023
I found that my OOD has been offline.

After diagnosis, I found that there is no Ping package to sn, I don't known why the ping is abort.

The gateway has running 6 days, the following log is the earliest that I can find:
gateway_bdt_1727_r00023.log

The following log is the latest:
gateway_bdt_1727_rCURRENT.log

@lurenpluto lurenpluto added the bug Something isn't working label May 4, 2023
@lurenpluto
Copy link
Member Author

lurenpluto commented May 4, 2023

There are several suggestions on how to post logs

  1. Since the log itself is relatively large, we suggest using zip compression to upload it, which can also save upload time
  2. If it is the log of gateway process, because the log is divided into the main log and bdt log, so even when the bdt module has problems, it should be accompanied by a main log, at least it can see some version-related information, easy to diagnose

Thanks for providing further log to help diagnose @streetycat

@lurenpluto lurenpluto added the BDT bucky data transfer protocol label May 4, 2023
@jing-git jing-git pinned this issue May 6, 2023
@jing-git jing-git unpinned this issue May 6, 2023
@lurenpluto lurenpluto added the SN SN Server label May 8, 2023
@lurenpluto lurenpluto added this to the GC-supported Release milestone May 9, 2023
@lurenpluto
Copy link
Member Author

Considering the complexity of the SN ping mechanism and network conditions, consider add SN ping alive detection mechanism, similar to the current process stuck detection and task deadlock detection, if the SN ping is not updated for a period of time, then the ping is considered stuck, you can try to restart the gateway to avoid the entire gateway process in a "fake dead" state

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BDT bucky data transfer protocol bug Something isn't working SN SN Server
Projects
Status: 🐞Discovered Bugs
Development

No branches or pull requests

2 participants