-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Regression]Subscriber-db is getting locked very frequently and AMF is not sending any response to UE #11088
Comments
In API handle_subs_authentication_info_ans If the input parameter status is not ok, then the imsi value is not gettig populated in authentication response towards AMF. 275638 Sun Jan 09 12:46:42 2022 7FC0660ED700 INFO AMF-AP tasks/amf/amf_app_main.cpp :0062 Received msg from :[TASK_GRPC_SERVICE] id:[137] name:[AMF_APP_SUBS_AUTH_INFO_RESP] |
#pr #11109 raised |
Reason for the issue seems to be due to SqLite operation done by 2 process :
So the most probable reason seems to be simultaneous access of db by both the processes. The periodic operation happens roughly around 15 minutes . Additional debug prints like print_grpc_payload setting to true in subscriberdb.yml can help in getting additional info for debugging. |
Could understand that there is possible clash between AMF fetch and Sub_db periodic Updates with Orc8r. |
Tried the following steps to narrow down the failure cases where subscriber-db is getting locked.
In all the above cases couldn't not see any subscriber-db lock which is causing the regression TC failure. |
Hi @ulaskozat @panyogesh @daljit-mahi : [syslog, mmelog and 3gpp pcaps enclosed below] [Snap of syslog] - enclosing full syslog also |
@arun-magma : We are able to emulate the issue in our setup with some python sqlite commands. Still its not a fix but it handles the exception and also dumps the Process (name and PID) which is holding the lock. PR is #11438. Its under review once merge I will update. |
Hi @panyogesh, I hope this issue is reproducible during the SUCI Reg related loop test (need to go for 100 loops) as probability of occurrence is every 5min, will test & let you know. |
is this the fix for this issue ? |
#11438 is first part of the fix which will handle the exception and send the error code to AMF. |
#11454 pr raised for this issue. |
Issue is not observed in the latest runs. So closing with appropriate logs. |
Your Environment
Describe the Issue
Subscriber-db is getting locked very frequently and AMF is not sending any response to UE
To Reproduce
Execute the entire Regression SUIT and we can see the subscriber-db is getting locked and when a Reg-Request is received from UE , AMF is not sending any response back as it is not getting any answer from subscriberdb .
Expected behavior
AMF should wait for the response from subscriberdb for a certain time duration and if there is no response it should send a Reg-Reject.
Notice the ISSUE in the following TC in Todays(6-1-2022) Regression Run
TC1a Hyper_SUCI_No_PDU_LoopTest
TC6c Concurrent GUTI Registration with 2 UE (Mix of Valid + Invalid UE Reg - no subdb)
TC1d Concurrent SUCI Registration with 2 UE (Mix of Valid + Invalid UE Reg) P0 (subscriber Lock)
TC1h Verify unsupported DNN config P1.7 (subscriber Lock)
Screenshots:
syslogs:
mme.log: No Answer received from subscriberdb.
Fail_Case:
General_Flow:https://app.zenhub.com/files/170803235/950d955c-39c8-4e1b-9c6c-4e541dd2cc93/download
Noticed above Logs hit at different points which caused the TC-Failure.
The text was updated successfully, but these errors were encountered: