Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
vcache: Replace score table with results of vcachetuner
The new tables have been generated by a ~3-day 64-core run of vcachetuner optimizing for AMD GCN & NVidia Pascal profiles. In general the new tables show small improvements across the board on many meshes (in the 1-2% range), are mostly flat on other cache profiles (e.g. Intel), and sometimes have a <1% penalty. The overall gain in terms of average improvement is around 1% on a test mesh set. One distinguishing mesh, however, is a regular grid, where the new tables are ~6% better on NV, ~9% better on AMD, ~10% better on Intel - with the new tables the results are much closer to FIFO algorithm so unless optimization time is an issue this algorithm is finally on par or better than FIFO (it's still slightly worse on regular grids on AMD with ~2% worse ACMR, but it's slightly better on NVidia). Finally, it's worth noting that since the new tables were learned tabula rasa, the insight-driven scoring formulas from before need not apply. It's also possible to retrain the algorithm targeting a particular hardware to get better results (tuning *just* for AMD ends up producing even better results).
- Loading branch information