How can i create smaller sized file for inference of 1.5B model #22

pragnakalpdev6 · 2019-12-11T05:04:34Z

I am working on gpt2 1.5b model.
It is taking too much time for inference how can i decrease the time taken by the model?
How can i optimize my model?

ConnorJL · 2019-12-11T11:20:26Z

Unfortunately, the 1.5B model is just really, really big. You can batch your predictions or run on faster hardware, but there isn't much more you can do. Maybe there is some way to use distillation methods or the like to reduce the model size, but I'm not familiar with any research of doing so with the GPT2 model specifically.

ConnorJL closed this as completed Dec 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can i create smaller sized file for inference of 1.5B model #22

How can i create smaller sized file for inference of 1.5B model #22

pragnakalpdev6 commented Dec 11, 2019

ConnorJL commented Dec 11, 2019

How can i create smaller sized file for inference of 1.5B model #22

How can i create smaller sized file for inference of 1.5B model #22

Comments

pragnakalpdev6 commented Dec 11, 2019

ConnorJL commented Dec 11, 2019