AVS+ Support #7

WolframRhodium · 2021-07-25T09:24:38Z

This issue tracks information related to AviSynth+ support. Please discuss in the doom9's thread.

WolframRhodium · 2021-07-25T09:27:46Z

experimental releases

test1 (ad93e6a)
test2 (28db68e, +CPU version)
test3 (8339115, separates VAggregate and compiles for AVX)
test4 (611f566, fix array parameter)
- convert array to arguments, and others issues about array AviSynth/AviSynthPlus#226
test5 (4b31d23, bm_range now defaults to 9 instead of 8, fixes parameter check)
~~test6 (13fabee, add support for cc 3.5, links to the static msvc rt, remove avx)~~ (don't use; accidentally build with debug config)
test7 (c307c22, fix performance regression introduced in test6, restore avx on win64 and msvc rt dynamic linking)
test8 (72e24c1, remove avx)
test9 (aabebd9, fix temporal padding) (x86 build is deprecating)
test10 (58dbc0a, update to cuda 12, bug fixes, add support for Ada Lovelace, remove support for Kepler and x86)
test10-cuda118 (2a89613, backport to cuda 11.8.0)

kedaitinh12 · 2021-07-25T13:10:34Z

Thanks, can you update avisynth cpu ver??

kedaitinh12 · 2021-07-26T10:41:19Z

Deleted, sr for disturb, don't care about sse3, sse4

kedaitinh12 · 2021-09-02T17:04:58Z

Can you make auto build for avs+ x86?? Thanks
https://github.com/WolframRhodium/VapourSynth-BM3DCUDA/actions/runs/1168740493

WolframRhodium · 2021-09-03T05:57:52Z

Yes, I will do it in the near future.

kedaitinh12 · 2021-09-03T10:00:09Z

Thanks

WolframRhodium · 2021-09-03T10:32:26Z

Done 26a5015.

kedaitinh12 · 2021-09-03T11:11:18Z

Thanks, i don't think "near future" only 5 hours 😂😂😂

kedaitinh12 · 2021-09-03T11:26:02Z

But x86 ver don't have bm3d_vaggregate_avs.dll

WolframRhodium · 2021-09-04T03:01:39Z

But x86 ver don't have bm3d_vaggregate_avs.dll

Thanks.

Thanks, i don't think "near future" only 5 hours 😂😂😂

I thought there must be some compilation errors but everything goes smoothly.

kedaitinh12 · 2021-09-04T04:19:17Z

x86 ver have bm3d_vaggregate_avs now. Thanks

mysteryx93 · 2021-10-15T03:23:16Z

In Avisynth, bm_range is limited to 1-8 while it can easily be 16 in VapourSynth

#7 (comment)

WolframRhodium · 2021-10-17T01:37:33Z

In Avisynth, bm_range is limited to 1-8 while it can easily be 16 in VapourSynth

Fixed. Thanks for the information.

Reel-Deal · 2021-10-17T02:21:32Z

@WolframRhodium

Thank you for putting up the new releases on here.

tormento · 2022-06-18T12:33:26Z

Port 2.8 with internal VAggregate to AVS+ :)

tormento · 2022-10-11T20:54:18Z

@WolframRhodium please? :)

madey83 · 2022-10-18T15:34:03Z

hi,

i use BM3DCUDA_AVS-test9 on my RTX 2060 with below call as prefilter:
BM3D(sigma=10,preset="normal",radius=3,UV=1,gpuid=0,tv_range=true)

and i saw that clock of RTX is set to 855MHz only....

mysteryx93 · 2022-10-19T02:01:07Z

GPU may be limited by transfer bandwidth. It's designed for outputting graphics, not to transfer massive data back and forth.

WolframRhodium · 2022-10-19T04:20:38Z

hi,

i use BM3DCUDA_AVS-test9 on my RTX 2060 with below call as prefilter: BM3D(sigma=10,preset="normal",radius=3,UV=1,gpuid=0,tv_range=true)

and i saw that clock of RTX is set to 855MHz only....

@madey83

Hi. You should always use Prefetch() to enable multi-threading. The difference in speed is huge. Check the example here.

Port 2.8 with internal VAggregate to AVS+ :)

Various wrappers for AVS+ exist and I don't think there is a need to introduce it.

madey83 · 2022-10-19T08:55:55Z

@WolframRhodium
at the end of my script i use this: Prefetch(6,12) and it can't brake 855MHz

WolframRhodium · 2022-10-19T09:41:38Z

The Prefetch call should follow the BM3D_VAggregate call immediately.

madey83 · 2022-10-19T11:11:27Z

hi,
this is my script call:

WolframRhodium · 2022-10-19T11:28:55Z

...
ex_BM3D(...)
Prefetch(...)
...

madey83 · 2022-10-19T11:34:42Z

sorry, but i do not catch your answer....

WolframRhodium · 2022-10-19T11:40:11Z

Sorry about that.

The script should be

...

pre=ex_BM3D(sigma=10,preset="normal",radius=3,UV=1,gpuid=0,tv_range=true)

pre=Prefetch(pre,6,12) # <= ** new line here **

SMDgrain(prefilter=pre, LFR=false, limits=false, DFTFlicker=false, tr=2, thSAD=)

...

madey83 · 2022-10-19T11:48:12Z

thank you for answer. this not help but maybe this is the problem with WinOS,
i will test it again when i will have clean Win installation done.

WolframRhodium · 2022-10-19T11:51:01Z

What if you remove all the following filters and output pre directly?

tormento · 2022-10-19T15:26:51Z

The real bottleneck is the aggregate part (i.e. the temporal part of BM3D), that is still done in CPU.

tormento · 2023-06-12T12:34:59Z

@WolframRhodium sorry to bother you again but I'd like to see the porting with internal aggregation :)

WolframRhodium · 2023-06-12T12:41:56Z

It is simply a kind of wrapper in terms of avisynth, in which scripts and plugins are treated equally.

tormento · 2023-06-12T13:35:29Z

@WolframRhodium so there is no speed advantage in the so called BM3Dv2?

WolframRhodium · 2023-06-12T14:09:43Z

Yep.

tormento · 2023-06-12T15:55:09Z

And has this part

Improve performance of VAggregate() and BM3Dv2() for temporal denoising.
This VAggregate() implementation is measured to be ~40% faster than the original implementation, resulting in 0 ~ 5% speedup overall.

been ported? :)

WolframRhodium · 2023-06-12T16:25:32Z

Previously bm3dcpu/cuda on vs are using VapourSynth-BM3D for VAggregate computation, which is never available for avs.

newcapricasean · 2024-01-28T16:24:09Z

Any chance of this avisynth+ version continuing to be under development? I particularly would love to see, for example, like the vapoursynth one, specific cpu type optimized versions made. But, also, with any other improvements included in the vapoursynth compiles. If you could show me / us how to compile the avisynth+ version, from the vapoursynth source code (if that's how you did it - you didn't provide source code for these avisynth+ versions), then I / we could do it ourselves... Why hold on to the obsolete avisynth+? Well, for the moment, I've found that, the TemporalDegrain2 avisynth+ script, with the BM3D_CPU, does better than the BM3D_CUDA/CPU, alone. That script also produces the best output, with the BM3D_CUDA/CPU. So, if you could possibly help in some way, that would be great! Thanks, in advance!

WolframRhodium · 2024-01-28T22:42:19Z

The source code for the avisynth+ version is in the avs+ branch, and the corresponding automatic compilation script is here.

BM3D_CPU should not produce noticeable result compared to BM3D_CUDA. That is a design objective.

newcapricasean · 2024-01-28T23:16:48Z

I thought the cuda version created non-deterministic results, whereas the CPU one always created deterministic...???

…

On Sun, Jan 28, 2024, 5:42 PM WolframRhodium ***@***.***> wrote: The source code for the avisynth+ version is in the avs+ <https://github.com/WolframRhodium/VapourSynth-BM3DCUDA/tree/avs%2B> branch, and the corresponding automatic compilation script is here <https://github.com/WolframRhodium/VapourSynth-BM3DCUDA/blob/avs%2B/.github/workflows/windows.yml> . BM3D_CPU should not produce noticeable result compared to BM3D_CUDA. That is a design objective. — Reply to this email directly, view it on GitHub <#7 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5VH7CGRUKU24Y3ZRIADU7LYQ3H5NAVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGM3TINJZGEYA> . You are receiving this because you commented.Message ID: ***@***.***>

WolframRhodium · 2024-01-28T23:24:09Z

The cuda one can be made deterministic by setting the extractor_exp parameter to 3 or higher.

newcapricasean · 2024-01-29T00:06:17Z

How do you know how high you have to set that parameter? Is there a max number, for that parameter, that could just always be used? Does that parameter, in any way, negatively affect the results?

…

On Sun, Jan 28, 2024 at 6:24 PM WolframRhodium ***@***.***> wrote: The cuda one can be made deterministic by setting the extractor_exp parameter to 3 or higher. — Reply to this email directly, view it on GitHub <#7 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5VH7CB7CWTKHXR2OTUR45TYQ3M2JAVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGM3TKNZQGUYQ> . You are receiving this because you commented.Message ID: ***@***.***>

WolframRhodium · 2024-01-29T00:58:13Z

There is no max number because the least value required for reproducible sum depends on the number of summation operands, which may increase as the value of parameters radius, bm_range, block_step, ps_num, ps_range increase.

This parameter does reduce accuracy, because this is the price of deterministic result. However, this error is marginal compared to the error of conventional fp32 -> uint16/uint10/uint8 conversion.

newcapricasean · 2024-01-29T01:24:40Z

How do I run that automatic compilation script? Do I run it from inside MSYS? I'm not familiar with a yml file. Additionally, can I also modify the yml file to replace...

…

-D CMAKE_CXX_FLAGS="/fp:fast" with...

-D CMAKE_CXX_FLAGS="-march=znver3"

On Sun, Jan 28, 2024 at 5:42 PM WolframRhodium ***@***.***> wrote: The source code for the avisynth+ version is in the avs+ <https://github.com/WolframRhodium/VapourSynth-BM3DCUDA/tree/avs%2B> branch, and the corresponding automatic compilation script is here <https://github.com/WolframRhodium/VapourSynth-BM3DCUDA/blob/avs%2B/.github/workflows/windows.yml> . BM3D_CPU should not produce noticeable result compared to BM3D_CUDA. That is a design objective. — Reply to this email directly, view it on GitHub <#7 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5VH7CGRUKU24Y3ZRIADU7LYQ3H5NAVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGM3TINJZGEYA> . You are receiving this because you commented.Message ID: ***@***.***>

WolframRhodium · 2024-01-29T01:30:27Z

The yml file uses GitHub actions to compile the plugins on GitHub-hosted runners. You may check individual cmake commands in that file to compile on your host.

You can change the compilation flags.

newcapricasean · 2024-01-29T04:29:10Z

I'm giving up, for now... Been trying for a couple hours, to get it to compile, but no matter what I do, it tells me that there are indentation errors in the windows.yml file... Maybe I'll figure it out Wednesday... I work the next two days, and won't really have time to look at it...

…

On Sun, Jan 28, 2024 at 8:30 PM WolframRhodium ***@***.***> wrote: The yml file uses GitHub actions to compile the plugins on GitHub-hosted runners. You may check individual cmake commands in that file to compile on your host. You can change the compilation flags. — Reply to this email directly, view it on GitHub <#7 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5VH7CGSZCLXS2MXAUPSBO3YQ33T7AVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGM4DCMZSGQ3A> . You are receiving this because you commented.Message ID: ***@***.***>

WolframRhodium · 2024-01-29T04:40:18Z

git clone -b avs+ https://github.com/WolframRhodium/VapourSynth-BM3DCUDA

cd VapourSynth-BM3DCUDA

cmake -S . -B build -G Ninja -D CMAKE_BUILD_TYPE=Release -D USE_NVRTC_STATIC=ON -D ENABLE_AVISYNTHPLUS=ON -D AVISYNTHPLUS_INCLUDE_DIRECTORY="%cd%\avisynth+\avs_core\include" -D ENABLE_VAPOURSYNTH=OFF -D CMAKE_CXX_FLAGS="/fp:fast" -D CMAKE_CUDA_FLAGS="--threads 0 --use_fast_math --resource-usage -Wno-deprecated-gpu-targets" -D CMAKE_CUDA_ARCHITECTURES="50;61-real;75-real;86-real;89-real"

cmake --build build

newcapricasean · 2024-01-29T13:51:17Z

I'll try those as soon as I can. Now, do I put -march=znver3 -O3 in the cxx flags only, or can I also put it in the cuda flags? Should I add a c flags with those? Should I replace the /fp fast or fast math?

…

On Sun, Jan 28, 2024, 11:40 PM WolframRhodium ***@***.***> wrote: git clone -b avs+ https://github.com/WolframRhodium/VapourSynth-BM3DCUDA cd VapourSynth-BM3DCUDA cmake -S . -B build -G Ninja -D CMAKE_BUILD_TYPE=Release -D USE_NVRTC_STATIC=ON -D ENABLE_AVISYNTHPLUS=ON -D AVISYNTHPLUS_INCLUDE_DIRECTORY="%cd%\avisynth+\avs_core\include" -D ENABLE_VAPOURSYNTH=OFF -D CMAKE_CXX_FLAGS="/fp:fast" -D CMAKE_CUDA_FLAGS="--threads 0 --use_fast_math --resource-usage -Wno-deprecated-gpu-targets" -D CMAKE_CUDA_ARCHITECTURES="50;61-real;75-real;86-real;89-real" cmake --build build — Reply to this email directly, view it on GitHub <#7 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5VH7CAI2RVFFI6IZ4HABYDYQ4R33AVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGM4TIOJTGA2A> . You are receiving this because you commented.Message ID: ***@***.***>

newcapricasean · 2024-01-29T16:01:43Z

Alternatively, is there a way for avisynth scripts, like the TemporalDegrain2.avsi to be imported into vapoursynth?

…

On Mon, Jan 29, 2024, 8:51 AM Sean Lloyd ***@***.***> wrote: I'll try those as soon as I can. Now, do I put -march=znver3 -O3 in the cxx flags only, or can I also put it in the cuda flags? Should I add a c flags with those? Should I replace the /fp fast or fast math? On Sun, Jan 28, 2024, 11:40 PM WolframRhodium ***@***.***> wrote: > git clone -b avs+ https://github.com/WolframRhodium/VapourSynth-BM3DCUDA > cd VapourSynth-BM3DCUDA > > cmake -S . -B build -G Ninja -D CMAKE_BUILD_TYPE=Release -D USE_NVRTC_STATIC=ON -D ENABLE_AVISYNTHPLUS=ON -D AVISYNTHPLUS_INCLUDE_DIRECTORY="%cd%\avisynth+\avs_core\include" -D ENABLE_VAPOURSYNTH=OFF -D CMAKE_CXX_FLAGS="/fp:fast" -D CMAKE_CUDA_FLAGS="--threads 0 --use_fast_math --resource-usage -Wno-deprecated-gpu-targets" -D CMAKE_CUDA_ARCHITECTURES="50;61-real;75-real;86-real;89-real" > > cmake --build build > > — > Reply to this email directly, view it on GitHub > <#7 (comment)>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/A5VH7CAI2RVFFI6IZ4HABYDYQ4R33AVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGM4TIOJTGA2A> > . > You are receiving this because you commented.Message ID: > ***@***.***> >

WolframRhodium · 2024-01-30T02:43:26Z

Only cxx flags will be used. The flags depend on the compiler you use.

avs+ code should be re-implemented in vapoursynth for maximal performance in general.

newcapricasean · 2024-01-30T02:52:46Z

Is it possible to import or translate the process, filters, and scripts necessary for the temporaldegrain2 script to vapoursynth?

…

On Mon, Jan 29, 2024, 9:43 PM WolframRhodium ***@***.***> wrote: Only cxx flags will be used. The flags depend on the compiler you use. avs+ code should be re-implemented in vapoursynth for maximal performance in general. — Reply to this email directly, view it on GitHub <#7 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A5VH7CFW6D5KR2UJT3GSBRDYRBM5TAVCNFSM5A6LMPTKU5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TCOJRGU4TMNZSG42Q> . You are receiving this because you commented.Message ID: ***@***.***>

WolframRhodium · 2024-01-30T03:26:27Z

This is not related to this repository.

tormento · 2024-05-17T12:38:59Z

Any wish to get AVS+ version on par with VS release?

WolframRhodium · 2024-05-17T12:54:43Z

The cuda backend is equivalent.

tormento · 2024-05-17T13:53:20Z

VAggregate is still external, plus you didn't add all the subsequent fixes and new cards.

WolframRhodium · 2024-05-17T14:16:55Z

VAggregate is external but it can be easily handled by various wrappers in the field.
Which fixes and new cards are not included?

tormento · 2024-05-18T11:43:37Z

I was reading:

bm3d.VAggregate should be called after temporal filtering, as in VapourSynth-BM3D. Alternatively, you may use the BM3Dv2() interface for both spatial and temporal denoising in one step.

Isn't that faster that a wrapper?

WolframRhodium · 2024-05-18T12:58:22Z

No.

tormento · 2024-05-19T08:39:08Z

About GPU support, forgive me if I am wrong but the last AVS+ build was on Jan 31, 2023 (R12.3.test).

Later you released VS builds up to R12.4, introducing AMD and Intel support.

WolframRhodium · 2024-05-19T09:09:52Z

I am not intended to port these implementations to AVS+.

kedaitinh12 · 2024-05-20T03:07:20Z

About GPU support, forgive me if I am wrong but the last AVS+ build was on Jan 31, 2023 (R12.3.test).

Later you released VS builds up to R12.4, introducing AMD and Intel support.

You can ask Asd-g if he is interested in it.

kedaitinh12 mentioned this issue Aug 11, 2021

vs DotKill Asd-g/AviSynth-bifrost#1

Closed

WolframRhodium added a commit that referenced this issue Oct 17, 2021

Fix AVS+ parameter processing (bm_range: 8 -> 9)

97efb4a

#7 (comment)

AVS+ Support #7

AVS+ Support #7

Comments

WolframRhodium commented Jul 25, 2021

WolframRhodium commented Jul 25, 2021 • edited

experimental releases

kedaitinh12 commented Jul 25, 2021

kedaitinh12 commented Jul 26, 2021

kedaitinh12 commented Sep 2, 2021

WolframRhodium commented Sep 3, 2021

kedaitinh12 commented Sep 3, 2021

WolframRhodium commented Sep 3, 2021

kedaitinh12 commented Sep 3, 2021 • edited

kedaitinh12 commented Sep 3, 2021 • edited

WolframRhodium commented Sep 4, 2021

kedaitinh12 commented Sep 4, 2021

mysteryx93 commented Oct 15, 2021

WolframRhodium commented Oct 17, 2021

Reel-Deal commented Oct 17, 2021

tormento commented Jun 18, 2022

tormento commented Oct 11, 2022

madey83 commented Oct 18, 2022

mysteryx93 commented Oct 19, 2022 • edited

WolframRhodium commented Oct 19, 2022 • edited

madey83 commented Oct 19, 2022

WolframRhodium commented Oct 19, 2022 • edited

madey83 commented Oct 19, 2022

WolframRhodium commented Oct 19, 2022 • edited

madey83 commented Oct 19, 2022

WolframRhodium commented Oct 19, 2022

madey83 commented Oct 19, 2022

WolframRhodium commented Oct 19, 2022

tormento commented Oct 19, 2022

tormento commented Jun 12, 2023

WolframRhodium commented Jun 12, 2023

tormento commented Jun 12, 2023

WolframRhodium commented Jun 12, 2023

tormento commented Jun 12, 2023

WolframRhodium commented Jun 12, 2023

newcapricasean commented Jan 28, 2024

WolframRhodium commented Jan 28, 2024

newcapricasean commented Jan 28, 2024 via email

WolframRhodium commented Jan 28, 2024

newcapricasean commented Jan 29, 2024 via email

WolframRhodium commented Jan 29, 2024

newcapricasean commented Jan 29, 2024 via email

WolframRhodium commented Jan 29, 2024

newcapricasean commented Jan 29, 2024 via email

WolframRhodium commented Jan 29, 2024

newcapricasean commented Jan 29, 2024 via email

newcapricasean commented Jan 29, 2024 via email

WolframRhodium commented Jan 30, 2024

newcapricasean commented Jan 30, 2024 via email

WolframRhodium commented Jan 30, 2024

tormento commented May 17, 2024

WolframRhodium commented May 17, 2024

tormento commented May 17, 2024

WolframRhodium commented May 17, 2024

tormento commented May 18, 2024

WolframRhodium commented May 18, 2024

tormento commented May 19, 2024

WolframRhodium commented May 19, 2024

kedaitinh12 commented May 20, 2024

WolframRhodium commented Jul 25, 2021 •

edited

kedaitinh12 commented Sep 3, 2021 •

edited

kedaitinh12 commented Sep 3, 2021 •

edited

mysteryx93 commented Oct 19, 2022 •

edited

WolframRhodium commented Oct 19, 2022 •

edited

WolframRhodium commented Oct 19, 2022 •

edited

WolframRhodium commented Oct 19, 2022 •

edited