Report your verification accuracy of new training dataset 'insightv2_emore'. #215

nttstar · 2018-05-15T16:48:50Z

A new training dataset 'insightv2'(code name emore) (still largely based on ms1m) is available at baiducloud and onedrive (from @gdshen ) which can achieve a better accuracy easily. I hope anybody who uses insightface can post your training accuracy and detail here to show the strength of our network backbone, dataset and loss function. (Currently I will not provide pretrained models)
(more training dataset contribution is welcome, email me pls~)

The format may like below(take from one of my experiments):

dataset: emore (or faces_ms1m for the old one)
network backbone: r50 ( res_unit=3, output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
training pipeline: straightforward (lr drop at 100K, 140K, 160K), batch-size:512
Highest LFW: 99.83%; Highest CFP_FP: 97.67; Highest AgeDB30: 98.10

Wisgon · 2018-05-16T00:23:05Z

OK, I will try it as soon as you release new dataset

Drsea123 · 2018-05-16T01:13:59Z

Excuse me,what should I do if I want to make a new train.rec and train.idx by myself. I can't find these code about #make_list(args) in face2rec2.py.

Drsea123 · 2018-05-16T01:14:14Z

@nttstar

HaoLiuHust · 2018-05-16T05:58:59Z

do you have plan to release the images than rec file?

nttstar · 2018-05-16T14:51:07Z

@HaoLiuHust no such plan.

meanmee · 2018-05-17T02:26:37Z

能否换个网盘，或者其他方式，百度网盘没开超级会员下载会限速

xxllp · 2018-05-17T09:05:02Z

能说下这个新数据包含点啥吗

yxchng · 2018-05-17T17:20:02Z

What is the difference between this dataset and msceleb? cleaner?

Wisgon · 2018-05-19T01:39:07Z

dataset: emore
network backbone: MobileFaceNet(y1)
loss function: arcface
training pipeline: batch-size:320
Highest LFW: 99.35%; Highest CFP_FP: 91.27%; Highest AgeDB30: 94.43%; the acc of trainning is stay around 0.20 to 0.21

I just have two GPU of 1070ti, so my batch-size can't be larger than about 350, my train command is:

CUDA_VISIBLE_DEVICES='0,1' python -u train_softmax.py --network y1 --loss-type 4 --margin-s 64.0 --margin-m 0.5 --per-batch-size 160 --emb-size 128 --ckpt 2 --data-dir ../datasets/faces_emore --wd 0.00004 --fc7-wd-mult 10.0 --lr 0.01 --prefix ../mobile/model-mobilefacenet0001

The final argument list:

(batch_size=320, beta=1000.0, beta_freeze=0, beta_min=5.0, bn_mom=0.9, ckpt=2, ctx_num=2, cutoff=0, data_dir='../datasets/faces_emore', easy_margin=0, emb_size=128, end_epoch=100000, fc7_wd_mult=10.0, gamma=0.12, image_channel=3, image_h=112, image_w=112, loss_type=4, lr=0.01, lr_steps='', margin=4, margin_a=1.0, margin_b=0.0, margin_m=0.5, margin_s=64.0, max_steps=0, mom=0.9, network='y1', num_classes=85742, num_layers=1, per_batch_size=160, power=1.0, prefix='../fine_turn_0001/model-mobilefacenet0001-fine', pretrained='', rand_mirror=1, rescale_threshold=0, scale=0.9993, target='lfw,cfp_fp,agedb_30', use_deformable=0, verbose=2000, version_act='prelu', version_input=1, version_output='E', version_se=0, version_unit=3, wd=4e-05)

Which argument can I change to improve the result?

tp-nan · 2018-05-19T03:52:01Z

@Wisgon you may try to change learn rate. For example, set --lr 0.1 --lr-step 55000,85000,100000,110000

Wisgon · 2018-05-19T12:01:23Z

Thank you , I will try it later @ShiyangZhang

auroua · 2018-05-21T07:38:02Z

How can I get the image list? I'm using another face detection and align method that is different from mtcnn. Thanks!

yangfly · 2018-05-21T07:56:49Z

dataset: emore
network backbone: r34 ( res_unit=3, output=E, emb_size=256, prelu)
loss function: arcface(m=0.5)
training pipeline: straightforward (lr drop at 100K, 125K, 150K), batch-size:1024
Highest LFW: 99.817%; Highest CFP_FP: 97.371; Highest AgeDB30: 97.867

bruinxiong · 2018-05-22T01:39:32Z

@nttstar
Could you share more detailed information about this new dataset ? I only get little information:
dataset: emore
it includes 85742 identities which more than previous 85164.
it includes 5822653 images which more than previous 3804846.
As you mentioned, this emore dataset still largely based on ms1m. Only increase 578 identities, images increase more than 2 million.
Did you consider the data balance when you collect, clean and tide up those dataset ? Such as remove identities which has very little images or very large images.

xmuszq · 2018-05-22T01:54:09Z

@yangfly Did you use alignment? and how much the alignment contributes to the final accuracy? Also, you used a large batch size - 1024, does this also contribute to the accuracy, when compared to smaller batch size like 50, or 100?

yangfly · 2018-05-22T02:00:54Z

@xmuszq

No extra alignment is used. Accuracy is gained by --target in train_softmax.py
Highest Accuray is actually evaluated using one of the final model.
I found that batch_size=1024 was slightly more accurate than batch_size=512.

RunningLeon · 2018-05-22T12:05:09Z

@nttstar
Could kindly share your trained model on emore?
thanks a lot

visionxyz · 2018-05-24T08:38:09Z

I think BaiduYun is not a good place to share such a dataset. For some reason, baidu will block some ips, for example, one of my ip is in taiwan education net, so it(baiduyun) will give me a 404.

meanmee · 2018-05-24T09:24:19Z

dataset: emore
network backbone: r100 ( res_unit=3, output=E, emb_size=512, prelu)
loss function: cosine(m=0.35)
training pipeline: straightforward , batch-size:128*8
Highest LFW: 99.8%+; Highest CFP_FP: 98%+; Highest AgeDB30: 98%+

YaqiLYU · 2018-05-25T02:36:07Z

dataset: eMS1M (emore)
network backbone: y2 (a simplified vision of MobileFaceNet, which has 0.74M params and 94M M-Adds)
loss function: arcface (m=0.5, s=64)
input_size: 112*112
batch_size: 512
training pipeline:
Stage 1: pre-training with m=0.1. lr keep 0.1, end when acc plateaus.
Stage 2: normal-training with m=0.5. lr starts from 0.1 and is divided by 10 when the acc plateaus.
Highest LFW: 99.55%
Highest CFP_FP: 92.23%
Highest AgeDB30: 95.20%

gdshen · 2018-05-26T02:30:33Z

I have uploaded the datasets using OneDrive, hope it helps.

allenxcp · 2018-05-28T09:39:03Z

dataset: emore
network backbone: Resnet 152
loss function: arcface (m=0.5, s=64)
input_size: 112*112
batch_size: 512

call reset()
testing verification..
(12000, 512)
infer time 20.323823
[lfw][307045]XNorm: 22.521362
[lfw][307045]Accuracy-Flip: 0.99800+-0.00245
testing verification..
(14000, 512)
infer time 22.382543
[cfp_ff][307045]XNorm: 20.382610
[cfp_ff][307045]Accuracy-Flip: 0.99857+-0.00181
testing verification..
(14000, 512)
infer time 22.360029
[cfp_fp][307045]XNorm: 22.274495
[cfp_fp][307045]Accuracy-Flip: 0.98114+-0.00657
testing verification..
(12000, 512)
infer time 18.985229
[agedb_30][307045]XNorm: 22.957849
[agedb_30][307045]Accuracy-Flip: 0.98300+-0.00733

visionxyz · 2018-05-30T08:43:59Z

dataset: emore
network backbone: y1 ( emb_size=512, prelu )
loss function: arcface(m=0.5)
training pipeline: [40000, 54000, 60000] batch-size:1024
Highest LFW:0.99383; Highest CFP_FP: 0.91871; Highest AgeDB30: 0.95783

lzg188 · 2018-06-01T08:12:16Z

how to compute the Highest LFW? it is added with variance?

qidiso · 2018-06-10T07:24:21Z

**now we get more higher accuray using my modified mobilenet network

[lfw][12000]Accuracy-Flip: 0.99617+-0.00358
[agedb_30][12000]Accuracy-Flip: 0.96017+-0.00893 .**

bigbao9494 · 2018-06-11T08:35:46Z

@meanmee 百度网盘有破解版本，不限制速度的。

YaqiLYU · 2018-06-12T03:38:42Z

dataset: eMS1M (emore)
network backbone: private CNN, which has 0.75M params (3/4 of y1) and 57M M-Adds (1/4 of y1)
loss function: arcface (m=0.5, s=64)
input_size: 96x80
batch_size: 512
training pipeline:

Stage 1: pre-training with m=0.1. lr 0.1 keep [5W] iters
Stage 2: norm-training with m=0.5. lr 0.1 and is divided by 10 at [25W 30W 34W 37W 40W] iters

result:

Highest LFW: 99.42%
Highest CFP_FP: 91.39%
Highest AgeDB30: 95.10%

yoookoo · 2018-06-12T12:05:21Z

dataset: EMore
network backbone: r100
loss function: arcface (m=0.5, scale=64)
input size: 112x112
batch size: 2024(32 GPUs)
lr: 0.4
lr steps: 37500, 52500, 60000
max steps: 75000
One megaface: 97.43
One LFW: 99.8; One AGEDB_30: 98.13; One CFP_FP: 95.37, One CFP_FF:99.93

alvenchen · 2018-08-16T05:08:37Z

dataset: emore (or faces_ms1m for the old one)
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], batch-size:256, 4gpu
on epoch 21:
one LFW: 99.817; CFP_FP: 98.17; AgeDB30: 98.06

bruinxiong · 2018-08-22T05:36:39Z

We plan to release CRUNet 116 modified architecture to support this community. If you are interested
, please pay attention to this link https://github.com/bruinxiong/Modified-CRUNet-and-Residual-Attention-Network.mxnet. Thanks!

xmuszq · 2018-08-22T14:22:43Z

@bruinxiong May I ask what kind of GPU you are using to train this model?

dataset: EMore
network backbone: c116 ( res_unit=3, output=E, emb_size=256, prelu ), CRU_Net
loss function: arcface(m=0.5)
aug_color = 0.1 for data color augmentation
random_crop = 0.9
input size: 112x112
batch size: 80(8 GPUs)
lr: 0.1
lr steps: [80000, 160000, 320000, 400000, 560000]
LFW: 0.99867+-0.00180; CFP_FP: 0.98386+-0.00542; CFP_FF: 0.99729+-0.00216; CALFW: 0.95933+-0.01119; CPLFW:0.92833+-0.01624; AgeDB30: 0.97983+-0.00765; all @epoch 36

bruinxiong · 2018-08-22T20:59:43Z

@xmuszq We use 8 Nvidia Titan X with Pascal GPU architecture and 12G DDR5.

mariolew · 2018-08-27T03:34:30Z

@bruinxiong Hi, did you figure out the difference between ms1m and emore?

sharonjunjun · 2018-09-30T02:30:55Z

dataset: emore (or faces_ms1m for the old one)
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], batch-size:256, 4gpu
on epoch 21:
one LFW: 99.817; CFP_FP: 98.17; AgeDB30: 98.06

@nttstar @alvenchen can you tell me how are you adjust the lr_step, i come true it use tensorflow , but i can not get a good result, i think the porbelm may be lr_step?

erichouyi · 2018-09-30T03:00:48Z

dataset: emore
network backbone: mobilefacenet + GNAP block
loss function: arcface(m=0.5)
training pipeline: finetune (lr drop at 100K, 140K, 160K), batch-size:512
one epoch 52: LFW-99.60% CFP-FP-93.46%, AgeDB-95.45%

stupiding · 2018-09-30T03:37:06Z

dataset: emore
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], end with 180001, batch-size:256, 4gpu
then retrain with lr = 0.01, lr_steps[200000, 300000, 400000]
one LFW: 99.82; CFP_FP: 98.50; AgeDB30: 98.25

tyxsspa · 2018-09-30T05:46:59Z

dataset: emore
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], end with 180001, batch-size:256, 4gpu
then retrain with lr = 0.01, lr_steps[200000, 300000, 400000]
one LFW: 99.82; CFP_FP: 98.50; AgeDB30: 98.25

How about on MegaFace?

sharonjunjun · 2018-09-30T06:44:19Z

dataset: emore
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], end with 180001, batch-size:256, 4gpu
then retrain with lr = 0.01, lr_steps[200000, 300000, 400000]
one LFW: 99.82; CFP_FP: 98.50; AgeDB30: 98.25

@stupiding can you tell me how to debug lr_steps? By looking at what indicators to debug?

stupiding · 2018-09-30T07:27:20Z

dataset: emore
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], end with 180001, batch-size:256, 4gpu
then retrain with lr = 0.01, lr_steps[200000, 300000, 400000]
one LFW: 99.82; CFP_FP: 98.50; AgeDB30: 98.25

How about on MegaFace?

I have no Megaface account, so it is not tested for now

stupiding · 2018-09-30T07:30:49Z

dataset: emore
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], end with 180001, batch-size:256, 4gpu
then retrain with lr = 0.01, lr_steps[200000, 300000, 400000]
one LFW: 99.82; CFP_FP: 98.50; AgeDB30: 98.25

@stupiding can you tell me how to debug lr_steps? By looking at what indicators to debug?

@sharonjunjun Don't understand your meaning by "debug lr_steps", I just adjust lr with SGD when the loss plateau

sharonjunjun · 2018-10-10T03:21:01Z

dataset: emore
network backbone: r100 ( res_unit=3, output=E, emb_size=512, prelu)
loss function: cosine(m=0.35)
training pipeline: straightforward , batch-size:128*8
Highest LFW: 99.8%+; Highest CFP_FP: 98%+; Highest AgeDB30: 98%+

@meanmee ”training pipeline: straightforward“ your lr schedule is only 0.1? or you change lr in different
global step?

guangdongliang · 2018-11-08T07:37:52Z

dataset: EMore

network backbone: r100

loss function: arcface (m=0.5, scale=64)

input size: 112x112

batch size: 2024(32 GPUs)

lr: 0.4

lr steps: 37500, 52500, 60000

max steps: 75000

One megaface: 97.43

One LFW: 99.8; One AGEDB_30: 98.13; One CFP_FP: 95.37, One CFP_FF:99.93

@yoookoo did your acc reach 1 in the training process ? If so , did you clean EMore yourself before training ?

LicheeX · 2018-12-13T02:03:12Z

dataset: EMore
network backbone: mobilefacenet res-4-8-16-8
Model size：30.1M
loss function: arcface (m=0.5, scale=64)
input size: 112x112
LFW: 99.850
AGEDB_30: 98.167
CFP_FP: 97.729
Megaface: 96.8392

twmht · 2018-12-18T05:15:36Z

@LicheeX

Can you share the architecture?

twmht · 2018-12-18T05:22:36Z

@LicheeX

what do you mean res-4-8-16-8?

chenbiaolong · 2018-12-26T03:14:28Z

@nttstar @LicheeX can you report your training accuracy?

KiXiang · 2019-01-14T12:45:19Z

@LicheeX can you share the training log and model?

jackchinor · 2019-04-03T08:10:36Z

dataset: eMS1M (emore)
network backbone: y2 (a simplified vision of MobileFaceNet, which has 0.74M params and 94M M-Adds)
loss function: arcface (m=0.5, s=64)
input_size: 112*112
batch_size: 512
training pipeline:
Stage 1: pre-training with m=0.1. lr keep 0.1, end when acc plateaus.
Stage 2: normal-training with m=0.5. lr starts from 0.1 and is divided by 10 when the acc plateaus.
Highest LFW: 99.55%
Highest CFP_FP: 92.23%
Highest AgeDB30: 95.20%

can you tell me where is y2 network? is the y2 network designed by you? I can't find it in author's source code, really appreciate it.

xiezheng-cs · 2019-04-29T10:34:13Z

dataset: eMS1M (emore)
network backbone: y2 (a simplified vision of MobileFaceNet, which has 0.74M params and 94M M-Adds)
loss function: arcface (m=0.5, s=64)
input_size: 112*112
batch_size: 512
training pipeline:
Stage 1: pre-training with m=0.1. lr keep 0.1, end when acc plateaus.
Stage 2: normal-training with m=0.5. lr starts from 0.1 and is divided by 10 when the acc plateaus.
Highest LFW: 99.55%
Highest CFP_FP: 92.23%
Highest AgeDB30: 95.20%

can you tell me where is y2 network? is the y2 network designed by you? I can't find it in author's source code, really appreciate it.

https://github.com/deepinsight/insightface/blob/master/recognition/symbol/fmobilefacenet.py

heyanglearning · 2019-05-24T02:58:52Z

dataset: emore
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], end with 180001, batch-size:256, 4gpu
then retrain with lr = 0.01, lr_steps[200000, 300000, 400000]
one LFW: 99.82; CFP_FP: 98.50; AgeDB30: 98.25

Before you retrain ,what`s your lr(lr=0.1)?

xsacha · 2019-05-26T00:39:11Z

Does this emore have overlap with tested datasets? It seems to perform abnormally well on LFW and CFP FF, which suggests overlap with frontal celebrities.
It doesn't perform any different to cleaned ms1m against private testing sets.

clhne · 2019-05-29T07:41:21Z

dataset: EMore
network backbone: mobilefacenet res-4-8-16-8
Model size：30.1M
loss function: arcface (m=0.5, scale=64)
input size: 112x112
LFW: 99.850
AGEDB_30: 98.167
CFP_FP: 97.729
Megaface: 96.8392

Which GPU you use, and how many GPU?
Batch size ?
Training time cost?

heyanglearning · 2019-05-29T09:37:05Z

dataset: EMore
network backbone: c116 ( res_unit=3, output=E, emb_size=256, prelu ), CRU_Net
loss function: arcface(m=0.5)
aug_color = 0.1 for data color augmentation
random_crop = 0.9
input size: 112x112
batch size: 80(8 GPUs)
lr: 0.1
lr steps: [80000, 160000, 320000, 400000, 560000]
LFW: 0.99867+-0.00180; CFP_FP: 0.98386+-0.00542; CFP_FF: 0.99729+-0.00216; CALFW: 0.95933+-0.01119; CPLFW:0.92833+-0.01624; AgeDB30: 0.97983+-0.00765; all @epoch 36

Did you test on megaface?

gehaocool · 2019-06-13T09:02:45Z

dataset: EMore
network backbone: mobilefacenet res-4-8-16-8
Model size：30.1M
loss function: arcface (m=0.5, scale=64)
input size: 112x112
LFW: 99.850
AGEDB_30: 98.167
CFP_FP: 97.729
Megaface: 96.8392

Hi, what embedding setting do you use? GDC or GNAP or E?
Thank U!

zhengzhong521 · 2019-12-18T05:35:50Z

dataset: emore (or faces_ms1m for the old one)
network backbone: r100 ( output=E, emb_size=512, prelu )
loss function: arcface(m=0.5)
lr_steps [105000, 125000, 150000], batch-size:256, 4gpu
on epoch 21:
one LFW: 99.817; CFP_FP: 98.17; AgeDB30: 98.06

Hi,do your lr is 0.1? Thanks.

nttstar added the Example label May 15, 2018

nttstar changed the title ~~Report your verification accuracy for new training dataset 'faces_emore'.~~ Report your verification accuracy of new training dataset 'insightv2_emore'. May 16, 2018

glorioushedgehog mentioned this issue Aug 13, 2018

Is there any training / verification overlap? #331

Closed

TreB1eN mentioned this issue Oct 12, 2018

Do you need image alignment during training or evalution? TreB1eN/InsightFace_Pytorch#8

Closed

nttstar closed this as completed Jun 1, 2023

Report your verification accuracy of new training dataset 'insightv2_emore'. #215

Report your verification accuracy of new training dataset 'insightv2_emore'. #215

Comments

nttstar commented May 15, 2018 • edited Loading

Wisgon commented May 16, 2018

Drsea123 commented May 16, 2018

Drsea123 commented May 16, 2018

HaoLiuHust commented May 16, 2018

nttstar commented May 16, 2018

meanmee commented May 17, 2018

xxllp commented May 17, 2018

yxchng commented May 17, 2018

Wisgon commented May 19, 2018 • edited Loading

tp-nan commented May 19, 2018

Wisgon commented May 19, 2018 • edited Loading

auroua commented May 21, 2018

yangfly commented May 21, 2018

bruinxiong commented May 22, 2018

xmuszq commented May 22, 2018 • edited Loading

yangfly commented May 22, 2018 • edited Loading

RunningLeon commented May 22, 2018

visionxyz commented May 24, 2018 • edited Loading

meanmee commented May 24, 2018 • edited Loading

YaqiLYU commented May 25, 2018

gdshen commented May 26, 2018

allenxcp commented May 28, 2018 • edited Loading

visionxyz commented May 30, 2018

lzg188 commented Jun 1, 2018

qidiso commented Jun 10, 2018

bigbao9494 commented Jun 11, 2018

YaqiLYU commented Jun 12, 2018 • edited Loading

yoookoo commented Jun 12, 2018

alvenchen commented Aug 16, 2018

bruinxiong commented Aug 22, 2018

xmuszq commented Aug 22, 2018 • edited Loading

bruinxiong commented Aug 22, 2018

mariolew commented Aug 27, 2018

sharonjunjun commented Sep 30, 2018

erichouyi commented Sep 30, 2018

stupiding commented Sep 30, 2018

tyxsspa commented Sep 30, 2018

sharonjunjun commented Sep 30, 2018

stupiding commented Sep 30, 2018

stupiding commented Sep 30, 2018

sharonjunjun commented Oct 10, 2018

guangdongliang commented Nov 8, 2018

LicheeX commented Dec 13, 2018 • edited Loading

twmht commented Dec 18, 2018

twmht commented Dec 18, 2018

chenbiaolong commented Dec 26, 2018

KiXiang commented Jan 14, 2019 • edited Loading

jackchinor commented Apr 3, 2019

xiezheng-cs commented Apr 29, 2019

heyanglearning commented May 24, 2019

xsacha commented May 26, 2019 • edited Loading

clhne commented May 29, 2019

heyanglearning commented May 29, 2019

gehaocool commented Jun 13, 2019

zhengzhong521 commented Dec 18, 2019

nttstar commented May 15, 2018 •

edited

Loading

Wisgon commented May 19, 2018 •

edited

Loading

Wisgon commented May 19, 2018 •

edited

Loading

xmuszq commented May 22, 2018 •

edited

Loading

yangfly commented May 22, 2018 •

edited

Loading

visionxyz commented May 24, 2018 •

edited

Loading

meanmee commented May 24, 2018 •

edited

Loading

allenxcp commented May 28, 2018 •

edited

Loading

YaqiLYU commented Jun 12, 2018 •

edited

Loading

xmuszq commented Aug 22, 2018 •

edited

Loading

LicheeX commented Dec 13, 2018 •

edited

Loading

KiXiang commented Jan 14, 2019 •

edited

Loading

xsacha commented May 26, 2019 •

edited

Loading