train error: AttributeError: 'tuple' object has no attribute 'log_softmax' #6

lxy5513 · 2021-06-07T07:11:51Z

Hi, thanks for you great work. When I train script, some error occurs: AttributeError: 'tuple' object has no attribute 'log_softmax'

with amp_autocast():   
            output = model(input)  
            loss = loss_fn(output, target)  # error occurs

and loss function is train_loss_fn = LabelSmoothingCrossEntropy(smoothing=0.0).cuda()

by the way: Could you please tell me why we need to specify smoothing=0.0?

The text was updated successfully, but these errors were encountered:

zihangJiang · 2021-06-07T11:50:07Z

Hi,
Can you post your training script? The label smoothing is handled in Mixup & Cutmix here

TokenLabeling/main.py

Lines 484 to 488 in 9f71792

    
           if mixup_active: 
        
               mixup_args = dict( 
        
                   mixup_alpha=args.mixup, cutmix_alpha=args.cutmix, cutmix_minmax=args.cutmix_minmax, 
        
                   prob=args.mixup_prob, switch_prob=args.mixup_switch_prob, mode=args.mixup_mode, 
        
                   label_smoothing=args.smoothing, num_classes=args.num_classes)

and here

TokenLabeling/main.py

Lines 678 to 690 in 9f71792

    
                   if args.token_label and args.token_label_data: 
        
                       target=create_token_label_target(target,num_classes=args.num_classes, 
        
                           smoothing=args.smoothing, label_size=args.token_label_size) 
        
                   if len(target.shape)==1: 
        
                       target=create_token_label_target(target,num_classes=args.num_classes, 
        
                           smoothing=args.smoothing) 
        
           else: 
        
               if args.token_label and args.token_label_data and not loader.mixup_enabled: 
        
                   target=create_token_label_target(target,num_classes=args.num_classes, 
        
                       smoothing=args.smoothing, label_size=args.token_label_size) 
        
               if len(target.shape)==1: 
        
                   target=create_token_label_target(target,num_classes=args.num_classes, 
        
                       smoothing=args.smoothing)

If you train without token labeling, We suggest you add --mixup 0.8 or --cutmix 1.0 as regularization, and use lvvit model. Model like lvvit_s returns all tokens by default which will cause error if you train without token labeling.
If you train with token labeling, please add --token-label flag.

lxy5513 · 2021-06-08T01:55:15Z

@zihangJiang Thanks for your response.
My training script is here:
python main.py /home/liuxingyu/data/patrol/comfort_classification/nursery_v16 --model lvvit_m -b 64 --apex-amp --img-size 384 --drop-path 0.2 --token-label-size 24 --model-ema --finetune lvvit_m-56M-384-85.4.pth.tar --token-label-data ''

I train without token labeling, because I have not generate token labels for my custome dataset.

Model like lvvit_s returns all tokens by default which will cause error if you train without token labeling

Is this mean if I want to train without token labeling, I can't use pretrain model ?

zihangJiang · 2021-06-08T03:46:04Z

We haven't tested on other datasets yet, but you can of course use the pre-trained model. I think you can add --token-label --dense-weight 0 flag. This can work without token label data.

However, some other bugs may occur. You can wait for our further update supporting transfer learning.

lxy5513 · 2021-06-08T03:52:59Z

Well, thanks, expect for further update.

zihangJiang · 2021-06-12T07:36:39Z

Hi @lxy5513 ,

We've updated and tested support for transfer learning. The dataset folder structure should be the same as ImageNet structure https://github.com/zihangJiang/TokenLabeling#requirements (with train and val split). You can clone the main branch, try to specify --num-classes flag, and add --token-label --dense-weight 0 for fine-tuning.

Let me know if you have any further questions.

lxy5513 closed this as completed Jun 15, 2021

Williamlizl mentioned this issue Jun 25, 2021

error: download the pretrained model but couldn't be unzipped #8

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train error: AttributeError: 'tuple' object has no attribute 'log_softmax' #6

train error: AttributeError: 'tuple' object has no attribute 'log_softmax' #6

lxy5513 commented Jun 7, 2021 •

edited

zihangJiang commented Jun 7, 2021 •

edited

lxy5513 commented Jun 8, 2021

zihangJiang commented Jun 8, 2021

lxy5513 commented Jun 8, 2021

zihangJiang commented Jun 12, 2021

train error: AttributeError: 'tuple' object has no attribute 'log_softmax' #6

train error: AttributeError: 'tuple' object has no attribute 'log_softmax' #6

Comments

lxy5513 commented Jun 7, 2021 • edited

zihangJiang commented Jun 7, 2021 • edited

lxy5513 commented Jun 8, 2021

zihangJiang commented Jun 8, 2021

lxy5513 commented Jun 8, 2021

zihangJiang commented Jun 12, 2021

lxy5513 commented Jun 7, 2021 •

edited

zihangJiang commented Jun 7, 2021 •

edited