Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ARP entry is out of sync between kernel and APPL_DB after warm reboot if the ARP entry is updated more than once during warm reboot in PFC watchdog warm reboot test #13341

Closed
stephenxs opened this issue Jan 12, 2023 · 1 comment
Assignees
Labels
NVIDIA Triaged this issue has been triaged

Comments

@stephenxs
Copy link
Collaborator

stephenxs commented Jan 12, 2023

Description

There is a logic to send lossless traffic to the DUT to verify whether the traffic is discarded while the egress queue is under a PFC storm in the PFCWD warm reboot test.
The flow is like this:

  1. It randomly chooses a port set to use in the test.
  2. It chooses the first port in the port set to test and configures an IP address, eg. 192.168.0.2 on the corresponding port in PTF docker to send the traffic. The DUT learns the first port's MAC address as the MAC of 192.168.0.2 in ARP entry.
  3. It warm-reboots the DUT and chooses the second port in the port set to test and configures the same IP address on the corresponding port in PTF docker (which is a different PTF port from the one chozen in step 2).
  4. DUT's kernel learns the second port's MAC address as the MAC of 192.168.0.2 in the ARP entry after warm-reboot, which means 192.168.0.2's ARP entry is updated.

But after warm reboot, what I observed is that

  • the output of ip neigh indicates the MAC in 192.168.0.2 is the second port's MAC address, which is correct
  • in APPL_DB (and below, ASIC_DB, SAI, SDK, and ASIC), the first port's MAC address is still used as the MAC of 192.168.0.2, which is wrong.
  • the MAC of the first port isn't learned since it doesn't send any traffic anymore
  • as a result, the traffic sent from the second port gets flooded

The log when the issue is observed

Jan 11 01:30:40.206914 mtbc-sonic-01-2410 INFO swss#supervisord 2023-01-11 01:30:40,206 INFO exited: restore_neighbors (exit status 0; expected)
Jan 11 01:30:40.213342 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- isNeighRestoreDone: neighbor table restore to kernel is done
Jan 11 01:30:40.213342 mtbc-sonic-01-2410 INFO swss#supervisord: neighsyncd Listens to neigh messages...
Jan 11 01:30:40.216894 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: Vlan1000:192.168.0.2, new value 
Jan 11 01:30:45.212413 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- setWarmStartState: neighsyncd warm start state changed to reconciled

An example log when the issue is NOT observed

Jan 12 03:42:33.517318 mtbc-sonic-01-2410 INFO swss#restore_neighbor: restore_neighbor service is done for system warmreboot
Jan 12 03:42:33.689348 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- isNeighRestoreDone: neighbor table restore to kernel is done
Jan 12 03:42:33.689842 mtbc-sonic-01-2410 INFO swss#supervisord: neighsyncd Listens to neigh messages...
Jan 12 03:42:33.762000 mtbc-sonic-01-2410 INFO swss#supervisord 2023-01-12 03:42:33,761 INFO exited: restore_neighbors (exit status 0; expected)
Jan 12 03:42:34.610528 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: Vlan1000:192.168.0.2, new value 
Jan 12 03:42:38.689387 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- reconcile: NEIGH_TABLE NEW, key: Vlan1000:192.168.0.2, neigh:24:8a:07:ab:a1:12, family:IPv4, cache-state:NEW, 
Jan 12 03:42:38.690414 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- setWarmStartState: neighsyncd warm start state changed to reconciled

There is a logic to hold all the ARP entries learned during warm reboot and apply them to APPL_DB only after it is reconciled.
I do see the following message in the log insertToMap: NEIGH_TABLE, found key: Vlan1000:192.168.0.2, new value per my understanding, it will push any keys whose state is NEW into APPL_DB in AppRestartAssist::reconcile() but I don't see messages like %s NEW, key: %s, %s in the else if (state == NEW) branch of the function, which is probably the cause.

Steps to reproduce the issue:

  1. run test script pfcwd/test_pfcwd_warm_reboot.py

the issue won't be caught by the script by default. but if we capture traffic on other ports within the same VLAN, the flooded traffic can benn seen.

Describe the results you received:

ARP entry is out of sync.

Describe the results you expected:

ARP should always be aligned between kernel and SONiC/SDK/ASIC.

Output of show version:

(paste your output here)
admin@mtbc-sonic-01-2410:~$ show version

SONiC Software Version: SONiC.202205_1_rc.12-d60ed3f21_Internal
Distribution: Debian 11.6
Kernel: 5.10.0-18-2-amd64
Build commit: d60ed3f21
Build date: Sat Dec 31 19:22:49 UTC 2022
Built by: sw-r2d2-bot@r-build-sonic-ci03-242

Platform: x86_64-mlnx_msn2410-r0
HwSKU: ACS-MSN2410
ASIC: mellanox
ASIC Count: 1
Serial Number: MT1639X02523
Model Number: MSN2410-CB2F
Hardware Revision: "A1"
Uptime: 01:37:11 up 8 min,  1 user,  load average: 1.57, 2.15, 1.31
Date: Thu 12 Jan 2023 01:37:11

Docker images:
REPOSITORY                                         TAG                                 IMAGE ID       SIZE
docker-syncd-mlnx                                  202205_1_rc.12-d60ed3f21_Internal   f818576f9b9d   869MB
docker-syncd-mlnx                                  latest                              f818576f9b9d   869MB
docker-fpm-frr                                     202205_1_rc.12-d60ed3f21_Internal   ee75a1c85813   496MB
docker-fpm-frr                                     latest                              ee75a1c85813   496MB
docker-orchagent                                   202205_1_rc.12-d60ed3f21_Internal   2314af1587bb   479MB
docker-orchagent                                   latest                              2314af1587bb   479MB
docker-teamd                                       202205_1_rc.12-d60ed3f21_Internal   6b04a311afb0   460MB
docker-teamd                                       latest                              6b04a311afb0   460MB
docker-macsec                                      202205_1_rc.11-d60ed3f21_Internal   4f7b3fb10f13   462MB
docker-platform-monitor                            202205_1_rc.12-d60ed3f21_Internal   478948a84604   868MB
docker-platform-monitor                            latest                              478948a84604   868MB
docker-snmp                                        202205_1_rc.12-d60ed3f21_Internal   033b50b59f59   489MB
docker-snmp                                        latest                              033b50b59f59   489MB
docker-dhcp-relay                                  202205_1_rc.11-d60ed3f21_Internal   d5f88b8a24d4   454MB
docker-sonic-telemetry                             202205_1_rc.12-d60ed3f21_Internal   e54856ce90c3   525MB
docker-sonic-telemetry                             latest                              e54856ce90c3   525MB
docker-lldp                                        202205_1_rc.12-d60ed3f21_Internal   68b2c3afdfa7   487MB
docker-lldp                                        latest                              68b2c3afdfa7   487MB
docker-database                                    202205_1_rc.12-d60ed3f21_Internal   c56dcb2f8f78   444MB
docker-database                                    latest                              c56dcb2f8f78   444MB
docker-mux                                         202205_1_rc.12-d60ed3f21_Internal   6ed113eec122   493MB
docker-mux                                         latest                              6ed113eec122   493MB
docker-router-advertiser                           202205_1_rc.12-d60ed3f21_Internal   e05b851d83e3   444MB
docker-router-advertiser                           latest                              e05b851d83e3   444MB
docker-sflow                                       202205_1_rc.12-d60ed3f21_Internal   da1869e43e5d   429MB
docker-sflow                                       latest                              da1869e43e5d   429MB
docker-nat                                         202205_1_rc.12-d60ed3f21_Internal   39694c3ecbf6   431MB
docker-nat                                         latest                              39694c3ecbf6   431MB
docker-sonic-mgmt-framework                        202205_1_rc.12-d60ed3f21_Internal   d57b752bacb5   557MB
docker-sonic-mgmt-framework                        latest                              d57b752bacb5   557MB
urm.nvidia.com/sw-nbu-sws-sonic-docker/sonic-wjh   1.3.1                               410975934531   310MB
urm.nvidia.com/sw-nbu-sws-sonic-docker/doroce      1.0.0-202205-internal-7             2aa171345b8f   200MB

Output of show techsupport:

(paste your output here or download and attach the file here )

Additional information you deem important (e.g. issue happens only occasionally):

Root cause analysis

The STATE of an entry in AppRestartAssist is maintained on a per-entry basis.
What happens if a NEW entry is updated with a value that existed in the cache? in that case, NEW state will be overridden to SAME and the new information will be lost.

  1. first port's MAC is in the ARP entry.
  2. kernel notifies ARP entry updated with the second's MAC. cache state => NEW
  3. kernel notifies ARP entry updated with whatever the first or second's MAC. cache state => SAME. the information updated in step 2 is lost.
  4. reconcile. SAME entry won't be programmed to the APPL_DB. the new information becomes SAME and won't be updated to APPL_DB, which means the new information is lost.

Details can be seen if set the severity to INFO

Jan 12 10:50:58.801935 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: Vlan1000:192.168.0.2, new value 
                                                                                                                                ^<<<<<<<============ first updated to new
Jan 12 10:50:58.807019 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: Received message NEIGH_TABLE, key: PortChannel104:fc00::7e, neigh:12:03:fe:c8:9a:c7, family:IPv6, , delete = 0
Jan 12 10:50:58.807019 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: PortChannel104:fc00::7e, same value
Jan 12 10:50:58.808143 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: Received message NEIGH_TABLE, key: PortChannel102:fc00::76, neigh:1e:20:2b:4d:e7:79, family:IPv6, , delete = 0
Jan 12 10:50:58.808143 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: PortChannel102:fc00::76, same value
Jan 12 10:50:58.808936 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: Received message NEIGH_TABLE, key: PortChannel103:fc00::7a, neigh:62:34:26:6d:0f:d8, family:IPv6, , delete = 0
Jan 12 10:50:58.808936 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: PortChannel103:fc00::7a, same value
Jan 12 10:50:58.809360 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- isLinkLocalEnabled: IPv6 Link local is not supported for lo 
Jan 12 10:50:58.809776 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: Received message NEIGH_TABLE, key: PortChannel101:fc00::72, neigh:1a:d2:50:3e:9e:4e, family:IPv6, , delete = 0
Jan 12 10:50:58.809776 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: PortChannel101:fc00::72, same value
Jan 12 10:50:58.811172 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- isLinkLocalEnabled: IPv6 Link local is not enabled on Vlan1000
Jan 12 10:51:01.432447 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: Received message NEIGH_TABLE, key: Vlan1000:192.168.0.2, neigh:24:8a:07:ab:a1:12, family:IPv4, , delete = 0
Jan 12 10:51:01.432447 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: Vlan1000:192.168.0.2, same value
                                                                                                        ^<<<<<<<<<<<========= second updated to the same (new) value but state is set to SAME
Jan 12 10:51:01.434169 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: Received message NEIGH_TABLE, key: Vlan1000:192.168.0.2, neigh:24:8a:07:ab:a1:12, family:IPv4, , delete = 0
Jan 12 10:51:01.434169 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- insertToMap: NEIGH_TABLE, found key: Vlan1000:192.168.0.2, same value
                                                                                                        ^<<<<<<<<<<<========= third updated to the same (new) value but state is set to SAME
Jan 12 10:51:03.796715 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- checkReconcileTimer: warmstart timer expired
Jan 12 10:51:03.796991 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel102:fc00::76, neigh:1e:20:2b:4d:e7:79, family:IPv6, cache-state:SAME, 
Jan 12 10:51:03.797170 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel102:10.0.0.59, neigh:1e:20:2b:4d:e7:79, family:IPv4, cache-state:SAME, 
Jan 12 10:51:03.797373 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel104:fc00::7e, neigh:12:03:fe:c8:9a:c7, family:IPv6, cache-state:SAME, 
Jan 12 10:51:03.797525 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: eth0:10.75.71.1, neigh:00:00:5e:00:01:04, family:IPv4, cache-state:SAME, 
Jan 12 10:51:03.797670 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel103:10.0.0.61, neigh:62:34:26:6d:0f:d8, family:IPv4, cache-state:SAME, 
Jan 12 10:51:03.797817 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel104:10.0.0.63, neigh:12:03:fe:c8:9a:c7, family:IPv4, cache-state:SAME, 
Jan 12 10:51:03.797952 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: Vlan1000:192.168.0.2, neigh:24:8a:07:ab:a1:12, family:IPv4, cache-state:SAME,
                                                                                                     ^<<<<<<<<<<<=========information that the entry has been updated is lost
Jan 12 10:51:03.798101 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: eth0:10.75.71.2, neigh:04:3f:72:64:36:48, family:IPv4, cache-state:SAME, 
Jan 12 10:51:03.798233 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel103:fc00::7a, neigh:62:34:26:6d:0f:d8, family:IPv6, cache-state:SAME, 
Jan 12 10:51:03.798379 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel101:fc00::72, neigh:1a:d2:50:3e:9e:4e, family:IPv6, cache-state:SAME, 
Jan 12 10:51:03.798511 mtbc-sonic-01-2410 INFO swss#neighsyncd: :- reconcile: NEIGH_TABLE SAME, key: PortChannel101:10.0.0.57, neigh:1a:d2:50:3e:9e:4e, family:IPv4, cache-state:SAME, 
Jan 12 10:51:03.805636 mtbc-sonic-01-2410 NOTICE swss#neighsyncd: :- setWarmStartState: neighsyncd warm start state changed to reconciled
Possible fix

Do not change an entry's state to SAME if it has been NEW.

@stephenxs stephenxs changed the title ARP entry is out of sync between kernel and ASIC_DB after warm reboot if the ARP entry is updated during warm reboot in PFC watchdog warm reboot test ARP entry is out of sync between kernel and APPL_DB after warm reboot if the ARP entry is updated more than once during warm reboot in PFC watchdog warm reboot test Jan 16, 2023
@neethajohn neethajohn added the Triaged this issue has been triaged label Jan 18, 2023
liat-grozovik pushed a commit to sonic-net/sonic-swss that referenced this issue Jan 19, 2023
…_DB if multiple updates are received from RTNL (#2619)

- What I did
Fix issue sonic-net/sonic-buildimage#13341 the issue that ARP entry is out of sync between kernel and APPL_DB
In AppRestartAssist::insertToMap, in case an entry has been updated more than once with the same value but different from the stored one, keep the state as NEW.

Eg.
Assume the entry's value that is restored from the warm reboot is V0, the following events received.

The first update with value V1 is received and handled by the if (found != appTableCacheMap[tableName].end()) branch,
* the state is set to NEW
* value is updated to V1
The second update with the same value V1 is received and handled by this branch
Originally, the state was set to SAME, which is wrong because V1 is different from the stored value V0
The correct logic should be: set the state to SAME only if the state is not NEW
This is a very rare case because most of times, the entry won't be updated multiple times

- Why I did it
To fix the issue.

- How I verified it
Mock test is added to cover the case.

Signed-off-by: Stephen Sun <stephens@nvidia.com>
@stephenxs
Copy link
Collaborator Author

Fixed by sonic-net/sonic-swss#2619

StormLiangMS pushed a commit to sonic-net/sonic-swss that referenced this issue Feb 10, 2023
…_DB if multiple updates are received from RTNL (#2619)

- What I did
Fix issue sonic-net/sonic-buildimage#13341 the issue that ARP entry is out of sync between kernel and APPL_DB
In AppRestartAssist::insertToMap, in case an entry has been updated more than once with the same value but different from the stored one, keep the state as NEW.

Eg.
Assume the entry's value that is restored from the warm reboot is V0, the following events received.

The first update with value V1 is received and handled by the if (found != appTableCacheMap[tableName].end()) branch,
* the state is set to NEW
* value is updated to V1
The second update with the same value V1 is received and handled by this branch
Originally, the state was set to SAME, which is wrong because V1 is different from the stored value V0
The correct logic should be: set the state to SAME only if the state is not NEW
This is a very rare case because most of times, the entry won't be updated multiple times

- Why I did it
To fix the issue.

- How I verified it
Mock test is added to cover the case.

Signed-off-by: Stephen Sun <stephens@nvidia.com>
StormLiangMS added a commit that referenced this issue Feb 17, 2023
Why I did it
Submodule advances:
sonic-utilities

8e8e6088 - [202211][dhcp_relay] Remove add field of vlanid to DHCP_RELAY table while adding vlan ([201811 sub-module] advance sub-modules: utilities, swss, swss-common #2679) (16 hours ago) [Yaqiang Zhu]
1400fb94 - [GCU] Ignore bgpraw in GCU applier (Fix sfputil indexing for 7170-Q59S20 #2623) (15 hours ago) [jingwenxie]
f76a6364 - [vlan] Refresh dhcpv6_relay config while adding/deleting a vlan ([sonic-py-swsssdk] Update submodule #2660) (15 hours ago) [Yaqiang Zhu]
7849e18d - [db_migrator] make LOG_LEVEL_DB migration more robust (Mellanox platform: attach queues 2 and 6 to lossy profile using generic buffer template #2651) (16 hours ago) [Stepan Blyshchak]
c7df6dfa - Fixed a bug in "show vnet routes all" causing screen overrun. (Add hook to allow customizing link cable lengths #2644) (16 hours ago) [siqbal1986]
a5505f02 - show logging CLI support for logs stored in tmpfs (Traceback error seen while issuing show interface commands with if_names #2641) (16 hours ago) [mihirpat1]
bbacb91a - [system-health] Fix issue: show system-health CLI crashes (Updating deb package for platform and sai #2635) (16 hours ago) [Junchao-Mellanox]
8d724024 - [sai_failure_dump]Invoking dump during SAI failure ([dockers]: Upgrade LLDP docker to stretch build #2633) (16 hours ago) [Sudharsan Dhamal Gopalarathnam]
3c3be526 - Add transceiver info CLI support to show output from TRANSCEIVER_INFO for ZR ([submodule]: Update sonic-sairedis pointer #2630) (16 hours ago) [mihirpat1]
37f41666 - [show] add support for gRPC show commands for active-active ([bitmap-vnet]: Bitmap vnet test image [DO NOT MERGE] #2629) (16 hours ago) [vdahiya12]
b06d7fe4 - [show_bfd] add local discriminator in show bfd command ([Pmon] Selectively load pmon container daemons #2625) (16 hours ago) [Baorong Liu]
6adcd3e8 - [GCU] Ignore bgpraw table in GCU operation ([Mellanox] Fix SAI version #2628) (16 hours ago) [jingwenxie]
c65bdc35 - [muxcable][config] Add support to enable/disable ceasing to be an advertisement interface when radv service is stopped (Add knob in ConfigDB to enable/disable telemetry container #2622) (16 hours ago) [Jing Zhang]
91e9457f - Add Transceiver PM basic CLI support to show output from TRANSCEIVER_PM table for ZR ([201803] Restart SwSS, syncd and dependent services if a critical process in syncd container exits #2615) (16 hours ago) [longhuan-cisco]
54cc8c5a - Remove TODO comment which is no longer relevant (Warm-reboot: teamd warm restart caused neighbor deleted and learned again.  #2600) (16 hours ago) [Lior Avramov]
6891b4fb - Making 'show feature autorestart' more resilient to missing auto_restart config in CONFIG_DB ([submodule] update mellanox hw-mgmgt pointer (V.2.0.0061) #2592) (16 hours ago) [kartik-arista]
1e8bea37 - [storyteller] add link prober state change to story teller ([sonic-buildimage] New feature managementVRF(L3mdev) #2585) (16 hours ago) [Jing Zhang]
7481a20f - Extend fast-reboot STATE_DB entry timer ([submodule]: update sonic-swss-common, sonic-py-swsssdk, sonic-snmpagent #2577) (16 hours ago) [Aryeh Feigin]
0e08701c - [sonic_installer] use /etc/resolv.conf from the host when migrating packages (Set a rate limit on syslog messages from all Docker containers #2573) (16 hours ago) [Stepan Blyshchak]
06096780 - Fixed admin state config CLI for Backport interfaces (Prior to install a new ONIE SONiC image, delete all partitions except EFI/ONIE #2557) (16 hours ago) [anamehra]
9f1f13e4 - [show] Add bgpraw to show run all (Fixed typo on paragraph #40 #2537) (16 hours ago) [jingwenxie]
98bc8bd2 - [chassis][voq] Add "show fabric reachability" command. ([ntp]: Build 4.2.6 locally. #2528) (16 hours ago) [jfeng-arista]
3a50b63f - Preserve copp tables through DB migration ([docker-radvd]: upgrade docker radvd to stretch based #2524) (16 hours ago) [Aryeh Feigin]
28f6b127 - [masic] 'show interfaces counters' reminds to use '-d all' option to check for internal links (solve dependency issue #2466) (16 hours ago) [wenyiz2021]
15026e14 - suppport multi asic for show queue counter ([dockers] Prevent old supervisord messages from gettting re-logged to syslog #2439) (16 hours ago) [zhixzhu]
2d773e17 - [masic support] 'show run bgp' support for multi-asic (lo address not synced to the asic #2427) (16 hours ago) [wenyiz2021]
sonic-swss

4f304bc - [EVPN]Handling race condition when remote VNI arrives before tunnel map entry ([sonic-quagga] Function defect, do NOT cancel route while connect IP down #2642) (15 hours ago) [Sudharsan Dhamal Gopalarathnam]
34fc615 - [sai_failure_dump]Invoking dump during SAI failure (Add hook to allow customizing link cable lengths #2644) (15 hours ago) [Sudharsan Dhamal Gopalarathnam]
b817695 - [autoneg]Fixing adv interface types to be set when AN is disabled (Fix issue with platform file path name #2638) (15 hours ago) [Sudharsan Dhamal Gopalarathnam]
ab36bd4 - [bfdorch] add local discriminator to state DB ([bitmap-vnet]: Bitmap vnet test image [DO NOT MERGE] #2629) (15 hours ago) [Baorong Liu]
6343471 - Remove TODO comments that are no longer relevant (Add knob in ConfigDB to enable/disable telemetry container #2622) (15 hours ago) [Lior Avramov]
2b1869c - [refactor]Refactoring sai handle status (Rollback kernel submodule update. #2621) (15 hours ago) [Sudharsan Dhamal Gopalarathnam]
c41a1b7 - Fix issue ARP entry is out of sync between kernel and APPL_DB after warm reboot if the ARP entry is updated more than once during warm reboot in PFC watchdog warm reboot test #13341 ARP entry can be out of sync between kernel and APPL_DB if multiple updates are received from RTNL ([sub module] advance sonic-utilities sub module for 201811 branch #2619) (15 hours ago) [Stephen Sun]
da0cf7a - Changed the BFD default detect multiplier to 10x ("failed to load plugin io.containerd.snapshotter..." seen during linux boot up #2614) (15 hours ago) [siqbal1986]
13b5adf - [vstest] Only collect stdout of orchagent_restart_check in vstest ([submodules] update swss and utilities pointers #2597) (15 hours ago) [bingwang-ms]
2b9d94d - Avoid aborting orchagent when setting TUNNEL attributes (build failing for PLATFORM=p4 #2591) (15 hours ago) [Stephen Sun]
99b7d3b - Only collect stdout of orchagent_restart_check in vstest ( [saibcm-modules]: import new bcm modules #2578) (15 hours ago) [bingwang-ms]
5209c42 - dereg acl-rule counters during acl-table del ([201803] Set a rate limit on syslog messages from all Docker containers #2574) (15 hours ago) [Vivek]
ae68054 - Fixed set mtu for deleted subintf due to late notification ([vs]: Add option to specify platform name for DVS orchagent #2571) (15 hours ago) [EdenGri]
ab13dfa - Remove TODO comments which are no longer needed (support set timezone in ConfigDB #2568) (15 hours ago) [Junchao-Mellanox]
a3545cf - Modify coppmgr mergeConfig to support preserving copp tables through reboot. (Added new SN3700/SN3700C Mellanox platforms #2548) (15 hours ago) [Aryeh Feigin]
be16e79 - Use github code scanning instead of LGTM ([201803] [services] Restart SwSS service upon unexpected critical process exit #2546) (15 hours ago) [Liu Shilong]
63c0234 - Updated handling of VRF_VNI mapping and VLAN_VNI mapping for same VNI ID (Move warm_restart enable/disable config to stateDB WARM_RESTART_ENABL… #2538) (15 hours ago) [Tapash Das]
4844111 - Fix potential risks ([mlnx] Fix sai xml path for boxer platform #2516) (15 hours ago) [Liran-Ar]
6420808 - [p4orch]: PINS Extension tables support ([build] When generating image version, handle case where current commit has no reachable tags #2506) (15 hours ago) [svshah-intel]
sonic-swss-common

1badd46 - Increase the netlink buffer size from 3MB to 16MB. (arp_update doesn't sleep 300 between each execution #739) (14 hours ago) [KISHORE KUNAL]
6555057 - Refactor eventpublisher deinit ([acl] Add default deny rule for l3 table #734) (14 hours ago) [Zain Budhwani]
f4d6de7 - Use github code scanning instead of LGTM ([sonic-quagga]:update submodule #718) (14 hours ago) [Liu Shilong]
sonic-linux-kernel

74f9a8f - Update linux kernel for hw-mgmt V.7.0020.4104 (Move template files to /usr/share/sonic/templates #305) (14 hours ago) [Stephen Sun]
6365701 - Fixes for emmc unreliability ([build_debian.sh]: Integrate system dump script #270) (14 hours ago) [Samuel Angebault]
How I did it
How to verify it
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
NVIDIA Triaged this issue has been triaged
Projects
None yet
Development

No branches or pull requests

2 participants