New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EOS Configs Truncated #2038
Comments
Output of oxidized --show-exhaustive-config is below:
|
I am seeing this as well with EOS devices. It truncates the configuration one minute and then downloads it again the next hour. Our network backup repository is now at 1500 commits. 70% of that is from arista devices just truncating and updating over and over again. |
Unfortunately, we are still having this issue and have been unable to fix
it. Saku was working with us on this, but was unable to reproduce the
issue when he tried from his server to a test device on our side. If you
find a fix, please let me know.
…On Thu, Jun 3, 2021 at 10:26 AM Nick ***@***.***> wrote:
I am seeing this as well with EOS devices. It truncates the configuration
one minute and then downloads it again the next hour. Our network backup
repository is now at 1500 commits. 70% of that is from arista devices just
truncating and updating over and over again.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#2038 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AOHJ7HJPI7JRBOWC2KVKC6LTQ6GJNANCNFSM4K4N4C3A>
.
|
Unfortunately, no love here... Anyone able to find a fix for this? @vincent516 perchance are you using 7050's? |
We are not using 7050. We have a mix of Arista models and it happens across all of them unfortunately. |
Hello! We're experiencing the same issue. Our environment contains dozens of switches, about half being Arista and half being Juniper. Out of all these hosts, only 3 or so Arista switches experience this issue. Like @edchiodo saw, our SSH debug output shows the full configuration file successfully being retrieve over SSH, ruling out an SSH-specific issue. We see several hundred lines missing between the raw SSH data and what oxidized outputs:
We're unable to find the root cause of this, my colleagues and I thought perhaps some rogue escape character or malformed character was located in the file causing Oxidized to cut the config short, but looking at where the Oxidized cutoff occurs, there are no non-standard characters, and the location of the cutoff appears completely random. Has anyone been able to find a resolution at this time? @f0rkz If it matters at all, we're seeing this issue exclusively on 7050s at this time. |
We still haven't been able to find a resolution.
…On Thu, Dec 9, 2021 at 4:45 PM Justin Goetz ***@***.***> wrote:
Hello!
We're experiencing the same issue. Our environment contains dozens of
switches, about half being Arista and half being Juniper. Out of all these
hosts, only 3 or so Arista switches experience this issue.
Like @edchiodo <https://github.com/edchiodo> saw, our SSH debug output
shows the full configuration file successfully being retrieve over SSH,
ruling out an SSH-specific issue. We see several hundred lines missing
between the raw SSH data and what oxidized outputs:
# git show :lf220a.pit1.teraswitch.com | wc -l
1720
# cat 10.100.10.121-ssh | wc -l
2856
We're unable to find the root cause of this, my colleagues and I thought
perhaps some rogue escape character or malformed character was located in
the file causing Oxidized to cut the config short, but looking at where the
Oxidized cutoff occurs, there are no non-standard characters, and the
location of the cutoff appears completely random.
Has anyone been able to find a resolution at this time?
@f0rkz <https://github.com/f0rkz> If it matters at all, we're seeing this
issue exclusively on 7050s at this time.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#2038 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AOHJ7HKLHEOHWZBGONPENUTUQEPOXANCNFSM4K4N4C3A>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
|
I've been having this issue, and then after digging into it I think I've fixed it by tweaking the prompt in lib/oxidized/model/eos.rb. Current:
Fixes the issue for me (On DCS-7050):
Configs are now being pulled consistently in full. I presume the \s? is there for a reason for other Arista models? I assume a new/extended model is needed rather than editing the plain eos.rb one? |
Huge thanks to @jake2184 for the find! I can confirm, we applied the fix to our environment and have not seen the issue re-occur for several days now on any DCS-7050 in our inventory at this time. |
Fix verified on EOS 4.27.2F. This is device independent.
Fixed in #2489 |
Running Oxidized with Docker:
Longer configs for Arista EOS devices are being truncated when writing to disk after a seemingly random number of lines. For instance, when connecting to switch01.domain.net (10.0.0.1) the debug log "10.0.0.1-ssh" shows the complete config with the following commands executed:
However, when comparing the git output to the debug file, we see the output is missing ~450 lines:
I was originally pointed to a possible prompt desync issue, but after reviewing the configs there is nothing that matches the regex expression
/^.+[#>]\s?$/
.Even more odd, where the truncation begins seems to change between Oxidized runs. For instance, switch01.domain.net may have 987 lines backed up on run 1, but on run 2 (4 hours later) only 833 lines are written to output, and on run 3, 1083 lines are written to output...despite no configuration changes being made on the switch.
Currently seeing this behavior on approximately 100 Arista EOS switches running multiple versions of EOS (4.19, 4.20, 4.21). Not seeing similar behavior with IOS, NXOS, MLNXOS, or PANOS devices, which all seem to function correctly.
The text was updated successfully, but these errors were encountered: