New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
File states seem slower in 2016.3, especially on first cache retrieval #33575
Comments
Similar experience after upgrade to 2016.3.0. First run of highstate on clean install is significantly slower. Subsequent runs are also slower vs 2015.8.10 Salt Version: Dependency Versions: System Versions: |
@anlutro I am able to replicate this and am seeing a performance difference with a small test case. Here's the test case:
When running the following: on 2015.8.10:
on 2016.3.0
|
@cachedout do you think that this might be part of the multimaster cacheing file client fix? |
Hmmm, that's a good thought. Could be. Trying to find the culprit with a |
Looks like I have the same problem.
It took ~4 seconds to copy a file with 2891 bytes!! |
Yeah we're experiencing this issue too... a job that used to take like 10 minutes now takes 3 hours. What takes time is copying about 8000 images from the master. |
we also have this problem, a highstate takes now about 15m on each server, so for now i reverted to back to 2015.8. |
Yes, we are still hunting this one down. |
This should be resolved here and available in 2016.3.1: |
Phew! Thanks @cachedout ! I thought this had been missed for a minute! |
so can this be closed? |
Fixed by #33896 |
Just upgraded again to 2016.3.1 and still takes a long time to process... the salt-minion keeps using 100% of the CPU until it finishes the highstate. salt-minion -l garbage i get this log showing always a AsyncZeroMQReqChannel and each 1~2 seconds i finally get a normal state being applied:
Doing a strace -enetwork -f shows in the loop this:
So this query is being done several times per second and with the encryption, it eats all the CPU of the machine |
reverting to the 2015.8, the |
I can't reproduce this problem in 2016.3.1, I'll leave another comment if it should pop up again. |
@markahopper this (specifically what @danielmotaleite pointed out) is huge for us. |
Oh, odd. I was sure it was merged before 2016.3.1. I guess it's just a coincidence that I can't reproduce it. |
oohh, i see! as this ticket was marked as fixed, closed and then the 2016.3.1 was released, we all assume this version had the fix. I tested with 2016.3.1 and applied the fix patch...and it worked! Thanks, i will wait for the 2016.3.2 release |
I upgraded to 2016.3 in my VMs and file states seem significantly slower on the first pass (10-20 times slower). However, on subsequent
state.apply
calls, things seem fine again.For every file that the minion has to fetch from the master, I see about 30-40 occurances of these log lines in the minion log:
Followed by:
Then some more of the zeromq-related log lines, then this:
I had a look at logs from before the upgrade and there were nowhere near the same amounts of log lines regarding re-auth.
The text was updated successfully, but these errors were encountered: