You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The documentation and paper make no claims on the comparison of performance between Quantized Neural Networks and their high-precision weight and bias counterparts. At the time of writing, there is clearly a trade-off (although the authors are working hard to close the gap). I suggest being transparent in this difference and possibly opening the option of educating users on when a few percents of accuracy on a specific dataset are essential and when not. This might be especially relevant for people working in the industry.
I think it would be good to add a small paragraph to our FAQ or the guides that briefly explains the story about efficiency and the challenges and of quantized networks as well as what theoretical speedups someone can expect. In general this always depends on the choice of networks and hardware used to run the models, but I agree that we should be transparent about this.
@jamescook106@koenhelwegen Do we have some resources we could use to elaborate on our claims on the home page?
The text was updated successfully, but these errors were encountered:
In openjournals/joss-reviews#1746 @sbrugman raised a very valid point that is currently missing in our guides and documentation:
I think it would be good to add a small paragraph to our FAQ or the guides that briefly explains the story about efficiency and the challenges and of quantized networks as well as what theoretical speedups someone can expect. In general this always depends on the choice of networks and hardware used to run the models, but I agree that we should be transparent about this.
@jamescook106 @koenhelwegen Do we have some resources we could use to elaborate on our claims on the home page?
The text was updated successfully, but these errors were encountered: