[ALBERT]Has anyone reproduced ALBERT a scores on GLUE dataset? #99

lonePatient · 2019-10-30T00:43:59Z

I convert tf weight to pytorch weight ,and on QQP dataset, I only get 87% accuracy.

model: albert-base
epochs: 3
learning_rate; 2e-5
batch size: 24
max sequence length: 128
warmup_proportion: 0.1

kamalkraj · 2019-10-31T21:15:30Z

https://github.com/kamalkraj/ALBERT-TF2.0 [WIP]
got better accuracy on dev set CoLA.

wxp16 · 2019-11-01T15:07:28Z

I convert tf weight to pytorch weight ,and on QQP dataset, I only get 87% accuracy.

model: albert-base
epochs: 3
learning_rate; 2e-5
batch size: 24
max sequence length: 128
warmup_proportion: 0.1

On the MNLI dataset, using the 'ALBERT' base v1, I got the following results. Clearly, the accuracy is very low.

eval_accuracy = 0.77962303
eval_loss = 0.5517804
global_step = 24543
loss = 0.5517709

kamalkraj · 2019-11-01T15:54:12Z

https://github.com/kamalkraj/ALBERT-TF2.0 [WIP]
got better accuracy on dev set CoLA.

Dataset: MNLI
Model: ALBERT large v1
Dev accuracy : 0.8089
epochs : 3
max_seq_length : 128
batch_size: 128
learning_rate : 3e-5

lonePatient · 2019-11-03T02:01:30Z

https://github.com/lonePatient/albert_pytorch

Dataset: MNLI
Model: ALBERT_BASE_V2
Dev accuracy : 0.8418

kamalkraj · 2019-11-03T02:38:46Z

@lonePatient
Could your share the Hyperparameters?
Max seq length ?

lonePatient · 2019-11-03T02:42:50Z

@kamalkraj
--max_seq_length=128
--per_gpu_train_batch_size=16
--per_gpu_eval_batch_size=16
--spm_model_file=${BERT_BASE_DIR}/30k-clean.model
--learning_rate=1e-5
--num_train_epochs=3.0
--logging_steps=24544
--save_steps=24544 \

kamalkraj · 2019-11-03T03:03:24Z

@lonePatient
Dropouts ? All 0 ?

lonePatient · 2019-11-03T06:14:20Z

@kamalkraj 。fine-tuning， dropout rate=0.1

andrewluchen transferred this issue from google-research/google-research Jan 6, 2020

lonePatient closed this as completed Jan 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ALBERT]Has anyone reproduced ALBERT a scores on GLUE dataset? #99

[ALBERT]Has anyone reproduced ALBERT a scores on GLUE dataset? #99

lonePatient commented Oct 30, 2019

kamalkraj commented Oct 31, 2019

wxp16 commented Nov 1, 2019

kamalkraj commented Nov 1, 2019 •

edited

lonePatient commented Nov 3, 2019

kamalkraj commented Nov 3, 2019

lonePatient commented Nov 3, 2019

kamalkraj commented Nov 3, 2019

lonePatient commented Nov 3, 2019

[ALBERT]Has anyone reproduced ALBERT a scores on GLUE dataset? #99

[ALBERT]Has anyone reproduced ALBERT a scores on GLUE dataset? #99

Comments

lonePatient commented Oct 30, 2019

kamalkraj commented Oct 31, 2019

wxp16 commented Nov 1, 2019

kamalkraj commented Nov 1, 2019 • edited

lonePatient commented Nov 3, 2019

kamalkraj commented Nov 3, 2019

lonePatient commented Nov 3, 2019

kamalkraj commented Nov 3, 2019

lonePatient commented Nov 3, 2019

kamalkraj commented Nov 1, 2019 •

edited