Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

/cluster/projects/p697 mount hangs on p697-appn-norment01 #64

Closed
ofrei opened this issue Feb 1, 2021 · 22 comments
Closed

/cluster/projects/p697 mount hangs on p697-appn-norment01 #64

ofrei opened this issue Feb 1, 2021 · 22 comments

Comments

@ofrei
Copy link
Contributor

ofrei commented Feb 1, 2021

It seem that /cluster/projects/p697 mount hangs on p697-appn-norment01.
I can use p697-submit to access the data on cluster.
I can ssh p697-appn-norment01, and I can access tsd/p697/data/durable/
However I can not access /cluster/projects/p697 from p697-appn-norment01.

https://rt.uio.no/SelfService/Display.html?id=4249055

@Sandeek
Copy link

Sandeek commented Feb 1, 2021

@Sandeek Sandeek closed this as completed Feb 1, 2021
@ofrei ofrei reopened this Feb 2, 2021
@ofrei
Copy link
Contributor Author

ofrei commented Feb 2, 2021

@Sandeek It seem we have the same issue on p697-appn-norment01. Could you double-check and fix as before? If it happens again it is possible to investigate further? I think Sabry had some insights, either this was related to lack of space on /tmp folder or some other things...

@idaElken
Copy link

idaElken commented Feb 2, 2021

I have the same issue. on p697-appn-norment01

@ofrei
Copy link
Contributor Author

ofrei commented Feb 2, 2021

@idaElken as a workaround you could use p697-submit or p697-submit2 machines - they work fine for me as of now

@idaElken
Copy link

idaElken commented Feb 2, 2021

@ofrei . Thanks - I'll try that!

@Sandeek
Copy link

Sandeek commented Feb 2, 2021 via email

@Sandeek
Copy link

Sandeek commented Feb 2, 2021 via email

@E-Claire
Copy link

E-Claire commented Feb 8, 2021

This issue is happening for me again.

@Sandeek
Copy link

Sandeek commented Feb 8, 2021 via email

@Sandeek
Copy link

Sandeek commented Feb 8, 2021

Dear Claire,

Can you specify your p697 username? and from when you are facing this issue?

@E-Claire
Copy link

E-Claire commented Feb 8, 2021

My username is p697-elizabethc.
I had been working on the appn node and then it suddenly just stoped working. So I tried re-connecting and could access tsd but not the cluster - which is when I replied here.

@E-Claire
Copy link

E-Claire commented Feb 8, 2021

I just tried logging into appn and accessing the cluster now (from the appn node) and I am able to - so super weird that the issue seems really intermittent

@Sandeek
Copy link

Sandeek commented Feb 8, 2021 via email

@E-Claire
Copy link

E-Claire commented Feb 8, 2021

Okay, thanks for looking into this Sundeep!!

In case this happens again - is there a certain amount of time that you would recommend waiting before reporting to see if the outage will just fix its self?

@Sandeek
Copy link

Sandeek commented Feb 8, 2021

This issue has been persisting for some time now, TSD doesn't have much clue about why this occurs frequently, you can let me if and when it occurs.

@E-Claire
Copy link

E-Claire commented Feb 8, 2021

Okay, thanks

@idaElken
Copy link

idaElken commented Feb 9, 2021

Unsure whether this is related but now do not get past login for neither:
p697-appn-norment01.tsd.usit.no; nor
p697-submit.tsd.usit.no

After apparently logging in successfully, it hangs:
Screen Shot 2021-02-09 at 09 38 29

Any advice welcome :-)

@idaElken
Copy link

idaElken commented Feb 9, 2021

Also hangs when trying to access
p697-appn-norment01.tsd.usit.no
from VMware and putty.

But apparently known issue (sorry for posting):
https://www.uio.no/english/services/it/research/sensitive-data/log/nfs-hangs-on-submit-hosts.html

/Ida

@Sandeek
Copy link

Sandeek commented Feb 9, 2021 via email

@ofrei
Copy link
Contributor Author

ofrei commented Feb 10, 2021

This is resolved (works for me, also reported in operation log).
@Sandeek please add to https://docs.google.com/forms/d/e/1FAIpQLSfyQtSd3intuKkb5O4hmmPq5UzX6EhuCk95ovNfHULc7DIBKg/viewform and close this ticket

@Sandeek Sandeek closed this as completed Feb 10, 2021
@E-Claire
Copy link

I have this same problem again with the p697-appn hanging when I try and access the cluster. However, I am able to access the cluster through p697-submit.

@idaElken
Copy link

Same for me - p697-appn got slower and slower throughout the morning, until it crashed.Now using p697-submit

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants