Image Quality Degrades #40

nerdogram · 2020-04-27T04:38:43Z

Hi - amazing work here! I was experimenting with higher resolution images by increasing the longer_side_len argument to 1920. However - the video quality is a little blurry and does not match the input images which are high res. Is this because you downsample and then upsample them in the process or am I missing some other argument that needs to be updated?

cless-zor · 2020-05-31T23:10:37Z

Same problem..
I tried to change some argument but the final result is blurry and bad quality.
I tried on Colab.
Any idea ?

victusfate · 2020-06-04T19:32:08Z

I ended up setting up the output video resolution to match the input

    # frac = config['longer_side_len'] / max(config['output_h'], config['output_w'])
    # config['output_h'], config['output_w'] = int(config['output_h'] * frac), int(config['output_w'] * frac)
    frac = 1
    config['original_h'], config['original_w'] = config['output_h'], config['output_w']

I was going to look at the output video bitrate next, but any help on increasing the sharpness of the output video to more closely approximate the input image would be great.

trying libx264 + higher bitrate on the output 2+ Mbps

        clip.write_videofile(os.path.join(output_dir, output_name), fps=config['fps'], codec='libx264',bitrate='2000k')

also going to explore skipping the gaussian blur (may be req by the alg)

    # img = cv2.GaussianBlur(img,(int(init_factor//2 * 2 + 1), int(init_factor//2 * 2 + 1)), 0)

victusfate · 2020-06-05T13:10:23Z

ended up passing in the original image width to run_depth and using it vs 640.
testing it now to ensure the output resolution = the input.

After that I'll circle back to the output video quality

in main.py

    image_width = image.shape[1]
    run_depth(device,[sample['ref_img_fi']], config['src_folder'], config['depth_folder'],
              config['MiDaS_model_ckpt'], MonoDepthNet, MiDaS_utils, target_w=image_width)

in run.py run_depth

        scale = target_w / max(img.shape[0], img.shape[1])

victusfate · 2020-06-05T13:22:56Z

that did it, getting a large/high quality video now
took 14min locally vs 2-3min before (both cpu), also needed more ram up to 22-24GB

turkeyphant · 2020-07-24T00:00:44Z

ended up passing in the original image width to run_depth and using it vs 640.
testing it now to ensure the output resolution = the input.

After that I'll circle back to the output video quality

in main.py
    image_width = image.shape[1]
    run_depth(device,[sample['ref_img_fi']], config['src_folder'], config['depth_folder'],
              config['MiDaS_model_ckpt'], MonoDepthNet, MiDaS_utils, target_w=image_width)
in run.py run_depth
        scale = target_w / max(img.shape[0], img.shape[1])

This still seems to limit my output to a width of 960px. Did you made another other changes?

turkeyphant · 2020-07-24T09:14:18Z

I've also changed the frac variable but now main.py won't complete:

running on device 0
  0% 0/1 [00:00<?, ?it/s]Current Source ==>  2mars
Running depth extraction at 1595581689.5904121
initialize
device: cpu
start processing
  processing image/2mars.jpg (1/1)
torch.Size([1, 3, 352, 384])
finished
Start Running 3D_Photo ...
Loading edge model at 1595581910.257932
Loading depth model at 1595581919.6054657
Loading rgb model at 1595581925.925136
Writing depth ply (and basically doing everything) at 1595581932.251731
^C

peymanrah · 2020-07-27T17:41:26Z

@victusfate I tried your way and the results are better! For others to follow: These are the steps I took,

I hardcoded the frac in main.py to 1 . frac=1.. This make sure the input and output sizes are the same.
```
image_width = image.shape[1]
```
run_depth(device,[sample['ref_img_fi']], config['src_folder'], config['depth_folder'],
config['MiDaS_model_ckpt'], MonoDepthNet, MiDaS_utils, target_w=image_width)
run.py is changed as:
scale = target_w / max(img.shape[0], img.shape[1])
In mesh folder I uncommented the following: # img = cv2.GaussianBlur(img,(int(init_factor//2 * 2 + 1), int(init_factor//2 * 2 + 1)), 0)
In mesh folder I also modified the output video setting
clip.write_videofile(os.path.join(output_dir, output_name), fps=config['fps'], codec='libx264',bitrate='2000k')
commented the resize code in main.py
#image = cv2.resize(image, (config['output_w'], config['output_h']), interpolation=cv2.INTER_AREA)

PS., If I play with longer_side_len argument to much larger resolution (4500), the code would take for ever (I have one GPU). The RAM shoots up to over 64GB and then the code exit! lack of memory perhaps.. Is there any way I can work with high resolution image with a 64GB RAM or do I need much larger RAM.. If so, how big of a RAM do I need? This seems like a memory leakage for large files!

Another issue is that if you output the same size image in the video, some of the pixels would not have the depth info when you zoon in or circle around the object.. This does not happen when you resize the input image. However, if you output the same size as the input by folllowing the steps above, the output video effect would have "gray blocks" with no depth info! dont know why? any idea?

wux12 · 2022-03-16T01:41:29Z

@victusfate I tried your way and the results are better! For others to follow: These are the steps I took,
I hardcoded the frac in main.py to 1 . frac=1.. This make sure the input and output sizes are the same.
image_width = image.shape[1]
run_depth(device,[sample['ref_img_fi']], config['src_folder'], config['depth_folder'],
config['MiDaS_model_ckpt'], MonoDepthNet, MiDaS_utils, target_w=image_width)
run.py is changed as:
scale = target_w / max(img.shape[0], img.shape[1])

In mesh folder I uncommented the following: # img = cv2.GaussianBlur(img,(int(init_factor//2 * 2 + 1), int(init_factor//2 * 2 + 1)), 0)

In mesh folder I also modified the output video setting
clip.write_videofile(os.path.join(output_dir, output_name), fps=config['fps'], codec='libx264',bitrate='2000k')

commented the resize code in main.py
#image = cv2.resize(image, (config['output_w'], config['output_h']), interpolation=cv2.INTER_AREA)
PS., If I play with longer_side_len argument to much larger resolution (4500), the code would take for ever (I have one GPU). The RAM shoots up to over 64GB and then the code exit! lack of memory perhaps.. Is there any way I can work with high resolution image with a 64GB RAM or do I need much larger RAM.. If so, how big of a RAM do I need? This seems like a memory leakage for large files!

Another issue is that if you output the same size image in the video, some of the pixels would not have the depth info when you zoon in or circle around the object.. This does not happen when you resize the input image. However, if you output the same size as the input by folllowing the steps above, the output video effect would have "gray blocks" with no depth info! dont know why? any idea?

Hi, I tried your code, but the error is as follows, have you encountered this problem?
Traceback (most recent call last):
File "main.py", line 89, in
vis_photos, vis_depths = sparse_bilateral_filtering(depth.copy(), image.copy(), config, num_iter=config['sparse_iter'], spdb=False)
File "/home/w/3d-photo-inpainting/bilateral_filtering.py", line 31, in sparse_bilateral_filtering
vis_image[u_over > 0] = np.array([0, 0, 0])
IndexError: boolean index did not match indexed array along dimension 0; dimension is 2848 but corresponding boolean dimension is 425

psx2 · 2022-06-25T22:20:06Z

6. #image = cv2.resize(image, (config['output_w'], config['output_h']), interpolation=cv2.INTER_AREA)

Ensure the following code has not been commented out in main.py.

image = cv2.resize(image, (config['output_w'], config['output_h']), interpolation=cv2.INTER_AREA)

This should fix your issue.

DGriffin91 mentioned this issue Jul 23, 2020

[Question] Video output at arbitrary resolution? #87

Open

luobo348 mentioned this issue Mar 25, 2021

你好，我可以生成深度图但无法生成视频，错误之处是“段错误（核心已转储）”，以下是具体错误 #121

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image Quality Degrades #40

Image Quality Degrades #40

nerdogram commented Apr 27, 2020

cless-zor commented May 31, 2020

victusfate commented Jun 4, 2020 •

edited

victusfate commented Jun 5, 2020

victusfate commented Jun 5, 2020 •

edited

turkeyphant commented Jul 24, 2020

turkeyphant commented Jul 24, 2020

peymanrah commented Jul 27, 2020 •

edited

wux12 commented Mar 16, 2022

psx2 commented Jun 25, 2022 •

edited

Image Quality Degrades #40

Image Quality Degrades #40

Comments

nerdogram commented Apr 27, 2020

cless-zor commented May 31, 2020

victusfate commented Jun 4, 2020 • edited

victusfate commented Jun 5, 2020

victusfate commented Jun 5, 2020 • edited

turkeyphant commented Jul 24, 2020

turkeyphant commented Jul 24, 2020

peymanrah commented Jul 27, 2020 • edited

wux12 commented Mar 16, 2022

psx2 commented Jun 25, 2022 • edited

victusfate commented Jun 4, 2020 •

edited

victusfate commented Jun 5, 2020 •

edited

peymanrah commented Jul 27, 2020 •

edited

psx2 commented Jun 25, 2022 •

edited