[ASK] Improve user experience for long running notebooks #213

yijingchen · 2019-07-30T17:43:24Z

Description

Some notebooks take long time to run. For external data scientist who wants to try it out fast and see how things work, this is not a quite pleasant experience. Here are some ideas for improvements:

Each notebook add a note section describing the the machine configuration(i.e. # of GPU, etc) and estimated time to finish running the notebooks so that user won't be surprised.
Another idea is set the default of the notebooks to run on smaller data and smaller parameters. And then add another section to guide user change it to larger experiment so they know they'll face a long running time.

Notebook running time (Last update: 8/1/2019)

Machine: Azure DLVM Standard_NC12 with 2 GPU

Scenario	Notebook Name	CPU	GPU
entailment	entailment_xnli_multilingual	NA	~20hrs
name_entity_recognition	ner_wikigold_bert	~37mins	~6mins
embeddings	embedding_trainer	~5mins	~5mins
interpret_NLP_models	understand_models	~4mins	~2mins
text_classification	tc_mnil_bert	~8.2hrs	~1.2hrs

daden-ms · 2019-07-30T19:54:50Z

second on the idea of making the notebooks run on smaller data and smaller parameters or provide an options to do so. I have a similar issue for the entailment notebook (#215)

yijingchen · 2019-07-30T20:22:56Z

second on the idea of making the notebooks run on smaller data and smaller parameters or provide an options to do so. I have a similar issue for the entailment notebook (#215)

I believe @hlums is looking into this. I personally also prefer the default of the notebook runs on smaller data, because it is very likely people don't read about the instructions and click run directly.
Hong, I'm thinking maybe we can add both note section and update the default. For those who wants to know the true model performance, we can add the number in note section and guide them how they can change the dataset/parameter to achieve the same performance number.

yijingchen · 2019-08-01T17:22:01Z

@miguelgfierro FYI, I will keep updating the notebook end-to-end running time in this issue. I thought it could be useful for you notebook testing pipeline as well.

hlums · 2019-08-01T17:50:25Z

@yijingchen @daden-ms I had a discussion with @saidbleik , both of us think it's not ideal to run the notebook on a smaller dataset by default because the model performance in the notebook will look bad. Can you take a look at https://github.com/microsoft/nlp/blob/hlu/update_entailment_notebook_running_time/scenarios/entailment/entailment_multinli_bert.ipynb? Does this help improving the user experience?
If it helps, I plan to test all notebooks on a cpu machine and a machine with a single GPU for both QUICK_RUN=True and QUICK_RUN=False and provide the running time information.

yijingchen · 2019-08-01T18:50:27Z

@hlums This notebook looks great. You can add a comment on the line of 'quick_run' as well to tell user they need to make changes on that line.
It would be great if we can have this consistent format across all the notebooks.

hlums · 2019-08-02T18:53:18Z

@hlums This notebook looks great. You can add a comment on the line of 'quick_run' as well to tell user they need to make changes on that line.
It would be great if we can have this consistent format across all the notebooks.

Sounds great! I will work on that.

daden-ms · 2019-08-06T17:43:54Z

This is awesome. Thanks!

yijingchen added the enhancement New feature or request label Jul 30, 2019

hlums self-assigned this Aug 1, 2019

hlums mentioned this issue Aug 5, 2019

Hlu/update entailment notebook running time #237

Merged

3 tasks

This was referenced Aug 12, 2019

Hlu/add reference running time to more notebooks #267

Merged

Hlu/add reference running time to more notebooks #308

Merged

yijingchen closed this as completed Aug 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ASK] Improve user experience for long running notebooks #213

[ASK] Improve user experience for long running notebooks #213

yijingchen commented Jul 30, 2019 •

edited

daden-ms commented Jul 30, 2019 •

edited

yijingchen commented Jul 30, 2019

yijingchen commented Aug 1, 2019

hlums commented Aug 1, 2019

yijingchen commented Aug 1, 2019

hlums commented Aug 2, 2019

daden-ms commented Aug 6, 2019

[ASK] Improve user experience for long running notebooks #213

[ASK] Improve user experience for long running notebooks #213

Comments

yijingchen commented Jul 30, 2019 • edited

Description

Notebook running time (Last update: 8/1/2019)

daden-ms commented Jul 30, 2019 • edited

yijingchen commented Jul 30, 2019

yijingchen commented Aug 1, 2019

hlums commented Aug 1, 2019

yijingchen commented Aug 1, 2019

hlums commented Aug 2, 2019

daden-ms commented Aug 6, 2019

yijingchen commented Jul 30, 2019 •

edited

daden-ms commented Jul 30, 2019 •

edited