Low performance when using minimum code to reuse your models #28

kondela · 2020-03-23T18:33:49Z

I am trying to reuse your pre-trained Pix2Vox-A model but the output seems to be of a low quality (close to randomness). Following is the bare minimum of code that I used.

encoder = Encoder(cfg)
decoder = Decoder(cfg)
refiner = Refiner(cfg)
merger = Merger(cfg)

cfg.CONST.WEIGHTS = '/Projects/Pix2Vox/pretrained_weights/Pix2Vox-A-ShapeNet.pth'
checkpoint = torch.load(cfg.CONST.WEIGHTS, map_location=torch.device('cpu'))

fix_checkpoint = {}
fix_checkpoint['encoder_state_dict'] = OrderedDict((k.split('module.')[1:][0], v) for k, v in checkpoint['encoder_state_dict'].items())
fix_checkpoint['decoder_state_dict'] = OrderedDict((k.split('module.')[1:][0], v) for k, v in checkpoint['decoder_state_dict'].items())

epoch_idx = checkpoint['epoch_idx']
encoder.load_state_dict(fix_checkpoint['encoder_state_dict'])
decoder.load_state_dict(fix_checkpoint['decoder_state_dict'])

encoder.eval()
decoder.eval()
refiner.eval()
merger.eval()

img1_path = '/ShapeNetRendering/02691156/1a04e3eab45ca15dd86060f189eb133/rendering/00.png'
img1_np = np.asarray(Image.open(img1_path))

sample = np.array([img1_np])

IMG_SIZE = cfg.CONST.IMG_H, cfg.CONST.IMG_W
CROP_SIZE = cfg.CONST.CROP_IMG_H, cfg.CONST.CROP_IMG_W

test_transforms = utils.data_transforms.Compose([
    utils.data_transforms.CenterCrop(IMG_SIZE, CROP_SIZE),
    utils.data_transforms.RandomBackground(cfg.TEST.RANDOM_BG_COLOR_RANGE),
    utils.data_transforms.Normalize(mean=cfg.DATASET.MEAN, std=cfg.DATASET.STD),
    utils.data_transforms.ToTensor(),
])

rendering_images = test_transforms(rendering_images=sample)
rendering_images = rendering_images.unsqueeze(0)

with torch.no_grad():
    image_features = encoder(rendering_images)
    raw_features, generated_volume = decoder(image_features)

    if cfg.NETWORK.USE_MERGER and epoch_idx >= cfg.TRAIN.EPOCH_START_USE_MERGER:
        generated_volume = merger(raw_features, generated_volume)
    else:
        generated_volume = torch.mean(generated_volume, dim=1)

    if cfg.NETWORK.USE_REFINER and epoch_idx >= cfg.TRAIN.EPOCH_START_USE_REFINER:
        generated_volume = refiner(generated_volume)

For visualization I used the binvox_visualization from utilities:

generated_volume = generated_volume.squeeze(0)

img_dir = '/sample_images'
gv = generated_volume.cpu().numpy()
rendering_views = utils.binvox_visualization.get_volume_views(gv, os.path.join(img_dir),
                                                              epoch_idx)

This is the model's output:

This is the input:

There were few problems with loading the pre-trained weights, as mentioned in other issues, but other than that it seems to work properly except for the quality. I guess I must be missing something?

The text was updated successfully, but these errors were encountered:

kondela · 2020-03-28T14:30:29Z

After digging a bit deeper the difference was in the image loading, I used Pillow instead of OpenCV and that made all the difference. In case of visualization all I had to do was to swap axes np.swapaxes(voxels, 2, 1)

aashishbohra10 · 2020-05-22T16:21:46Z

Hi @kondela .......
I am also facing the issue. So if you are able to share your test.py it would be very grateful for me to generate effective results.

ahmedshingaly · 2020-09-08T02:26:32Z

I will be grateful if the above-mentioned script is collected in a test file, I tried to reproduce the test results but failed.
@hzxie @kondela @aashishbohra10

encoder = Encoder(cfg)
decoder = Decoder(cfg)
refiner = Refiner(cfg)
merger = Merger(cfg)

cfg.CONST.WEIGHTS = pretrained/Pix2Vox-A-ShapeNet.pth
chechpoint = torch.load(cfg.CONST.WEIGHTS, map_location=torch.device('cpu'))

fix_checkpoint = {}
fix_checkpoint['encoder_state_dict'] = orderedDict((k.split('module.')[1:][0], v) for k, v in checkpoint['encoder_state_dict'].items())
fix_checkpoint['decoder_state_dict'] = orderedDict((k.split('module.')[1:][0], v) for k, v in checkpoint['decoder_state_dict'].items())

epoch_idx = checkpoint['epoch_idx']
encoder.load_state_dict(fix_checkpoint['encoder_state_dict'])
decoder.load_state_dict(fix_checkpoint['decoder_state_dict'])

encoder.eval()
decoder.eval()
refiner.eval()
merger.eval()

img1_path = '/datasets/ShapeNetRendering/02691156/1a04e3eab45ca15dd86060f189eb133/rendering/00.png'
img1_np = np.asarray(Image.open(img1_path))

sample = np.array([img1_np])

IMG_SIZE = cfg.CONST.IMG_H, cfg.CONST.IMG_W
CROP_SIZE = cfg.CONST.CROP_IMG_H, cfg.CONST.CROP_IMG_W

test_transforms = utils.data_transforms.Compose([
    utils.data_transforms.CenterCrop(IMG_SIZE, CROP_SIZE),
    utils.data_transforms.RandomBackground(cfg.TEST.RANDOM_BG_COLOR_RANGE), 
    utils.data_transforms.Normalize(mean=cfg.DATASET.MEAN, std=cfg.DATASET.STD),
    utils.data_transforms.ToTensor(),
])

rendering_images = test_transforms(rendering_images=sample)
rendering_images = rendering_images.unsqueeze(0)

with torch.no_grad():
    image_features = encoder(rendering_images)
    raw_features, generated_volum = decoder(image_features)

    if cfg.NETWORK.USE_MERFER and epoch_idx >= cfg.TRAIN.EPOCH_START_USE_REFINER:
        generated_volume = refiner(generated_volume)


generated_volume = generated_volume.squeeze(0)

img_dir= '/output/myresults'
gv = generated_volume.cpu().numpy()
rendering_views = utils.binvox_visualizatino.get_volume_views(gv, os.path.join(img_dir), epoch_idx)

Thank you in advance.

saisai1002 · 2020-09-28T08:45:37Z

After digging a bit deeper the difference was in the image loading, I used Pillow instead of OpenCV and that made all the difference. In case of visualization all I had to do was to swap axes np.swapaxes(voxels, 2, 1)

I am also facing the issue.Appeared when i tested this

Where is the ‘np.swapaxes(voxels, 2, 1)’ code added，thanks

LiyingCV · 2021-01-19T03:42:36Z

After digging a bit deeper the difference was in the image loading, I used Pillow instead of OpenCV and that made all the difference. In case of visualization all I had to do was to swap axes np.swapaxes(voxels, 2, 1)

I am also facing the issue.Appeared when i tested this

Where is the ‘np.swapaxes(voxels, 2, 1)’ code added，thanks

Do you resolve this issue? I also meet this issue and I still can not resolve it.

b7leung · 2021-03-19T10:44:04Z

In my case, this was resolved when I found out the pix2vox is trained so that for transparent png input images, the RGB channels where the Alpha channel is transparent need to be black. In my case, they were white. This is impossible to tell right away from the image visually (because of the alpha), but can be double checked with code (eg matplotlib). If you only show the RGB channels, it should have a black background.

kondela closed this as completed Mar 28, 2020

hzxie mentioned this issue May 28, 2020

How can I test my images on you project? I couldn't find a way to input my images? What's more, I don't understand that why we need 'Voxel_path' in the config.py when we only test our images. Look forward to your reply. #34

Closed

hzxie mentioned this issue Jun 10, 2020

Could you help me with my comparative experiment? #37

Closed

ahmedshingaly mentioned this issue Sep 18, 2020

I will be grateful if model visualization script is collected in a test file, I tried to reproduce the test results after training model but failed. #50

Closed

LiyingCV mentioned this issue Jan 19, 2021

Some problems when test own multiple pictures #63

Closed

sirish-gambhira mentioned this issue Aug 30, 2021

Some questions about the reconstruction from pictures of real scene #72

Closed

hzxie mentioned this issue Feb 27, 2024

How can I Load and extract values from the pre trained model? #108

Closed

chihabhedidi mentioned this issue Apr 29, 2024

Problem in results!! #113

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Low performance when using minimum code to reuse your models #28

Low performance when using minimum code to reuse your models #28

kondela commented Mar 23, 2020 •

edited

kondela commented Mar 28, 2020

aashishbohra10 commented May 22, 2020

ahmedshingaly commented Sep 8, 2020 •

edited by hzxie

saisai1002 commented Sep 28, 2020 •

edited by hzxie

LiyingCV commented Jan 19, 2021

b7leung commented Mar 19, 2021

Low performance when using minimum code to reuse your models #28

Low performance when using minimum code to reuse your models #28

Comments

kondela commented Mar 23, 2020 • edited

kondela commented Mar 28, 2020

aashishbohra10 commented May 22, 2020

ahmedshingaly commented Sep 8, 2020 • edited by hzxie

saisai1002 commented Sep 28, 2020 • edited by hzxie

LiyingCV commented Jan 19, 2021

b7leung commented Mar 19, 2021

kondela commented Mar 23, 2020 •

edited

ahmedshingaly commented Sep 8, 2020 •

edited by hzxie

saisai1002 commented Sep 28, 2020 •

edited by hzxie