The speed of example in the release (pre) is much slower than the previous method. #22

hooke007 · 2023-01-18T14:43:41Z

perform the necessary padding, YUV/RGB conversion and FP16 conversion in one go

https://github.com/AmusementClub/vs-mlrt/releases/tag/v13

In my device, "pad first and then crop" (283fps) is faster than "all-in-one" processing (243fps).

1080p_p10

2160p_p10

WolframRhodium · 2023-01-19T11:13:35Z

Thanks, probably zimg does not efficiently fuse operations.

hooke007 · 2023-02-27T15:21:19Z

@WolframRhodium Hi, i was testing rife v2(bd0ff98) but failed. I'm not sure if I missed sth.

trtexec_230227_151717.log

WolframRhodium · 2023-02-27T23:56:16Z

Does it work if dynamic shapes or faster_dynamic_shapes is disabled? I only have tested the former.

hooke007 · 2023-02-28T00:04:50Z

faster_dynamic_shapes is disabled

Failed.

hooke007 · 2023-02-28T00:24:31Z

Though I cannot make it work with dynamic shape, but v2 (Output 16000 frames in 53.22 seconds (300.66 fps)) is much faster than v1 (Output 16000 frames in 68.05 seconds (235.12 fps)).
Tested 1080p10bit

WolframRhodium · 2023-02-28T02:38:45Z

Hi, it seems that the Range operator does not support fp16 precision, and removing --layerPrecisions=*:fp16 --layerOutputTypes=*:fp16 --precisionConstraints=obey in your command will work. You may have to manually exclude those layers in precision specification.

edit: replace --precisionConstraints=obey by --precisionConstraints=prefer also works, with the following warnings

[02/28/2023-10:40:14] [W] [TRT] No implementation of layer /Range obeys the requested constraints. I.e. no conforming implementation was found for requested layer computation precision and output precision. Using fastest implementation instead.
[02/28/2023-10:40:14] [W] [TRT] No implementation of layer /Range_1 obeys the requested constraints. I.e. no conforming implementation was found for requested layer computation precision and output precision. Using fastest implementation instead.
[02/28/2023-10:40:14] [W] [TRT] No implementation of layer [ShapeHostToDeviceCopy 0] obeys the requested constraints. I.e. no conforming implementation was found for requested layer computation precision and output precision. Using fastest implementation instead.
[02/28/2023-10:40:28] [W] [TRT] Skipping tactic 0x0000000000000000 due to Myelin error: [] Mismatched type for tensor (Unnamed Layer_ 83) [Constant]_output', f16 vs. expected type:f32.
[02/28/2023-10:41:58] [W] [TRT] Skipping tactic 0x0000000000000000 due to Myelin error: [] Mismatched type for tensor (Unnamed Layer_ 706) [Shuffle]_output', f16 vs. expected type:f32.

the first three warnings (and related source code) confirms the observation, and the last two warnings may be a bug in trt's experimental graph compiler

hooke007 · 2023-03-12T17:24:16Z

@WolframRhodium Hello, I have a question about v2 model.

For V1 model, I use 1920x 1088 as the value of opt_shape. Whether should I use the 1920x 1080 instead for V2?

WolframRhodium · 2023-03-12T20:32:47Z

I think 1920x1080 should be used for V2.

hooke007 closed this as not planned Won't fix, can't repro, duplicate, stale Feb 9, 2023

hooke007 mentioned this issue Apr 11, 2023

RIFE v2 models seem to drastically lower visual quality. #38

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The speed of example in the release (pre) is much slower than the previous method. #22

The speed of example in the release (pre) is much slower than the previous method. #22

hooke007 commented Jan 18, 2023

WolframRhodium commented Jan 19, 2023

hooke007 commented Feb 27, 2023

WolframRhodium commented Feb 27, 2023

hooke007 commented Feb 28, 2023 •

edited

Loading

hooke007 commented Feb 28, 2023 •

edited

Loading

WolframRhodium commented Feb 28, 2023 •

edited

Loading

hooke007 commented Mar 12, 2023

WolframRhodium commented Mar 12, 2023

The speed of example in the release (pre) is much slower than the previous method. #22

The speed of example in the release (pre) is much slower than the previous method. #22

Comments

hooke007 commented Jan 18, 2023

WolframRhodium commented Jan 19, 2023

hooke007 commented Feb 27, 2023

WolframRhodium commented Feb 27, 2023

hooke007 commented Feb 28, 2023 • edited Loading

hooke007 commented Feb 28, 2023 • edited Loading

WolframRhodium commented Feb 28, 2023 • edited Loading

hooke007 commented Mar 12, 2023

WolframRhodium commented Mar 12, 2023

hooke007 commented Feb 28, 2023 •

edited

Loading

hooke007 commented Feb 28, 2023 •

edited

Loading

WolframRhodium commented Feb 28, 2023 •

edited

Loading