Assets 3

MvTools2 + Depan + DepanEstimate

Change log

2.7.35 (20181113)

  • MFlowXXX: Slight speed gain by putting the out-of-frame vector check to resizer
  • Fix: MAnalyze: Fix a possible internal overflow on larger blocksizes with small overlap e.g. BlkSize=32, OverlapX=0, OverlapY=4
  • MSuper: Planar RGB support.
  • MFlowFPS: Planar RGB support: generate vectors in YUV, use RGB input and super clip
  • MFlowFPS: less memory for 4:4:4 and greyscale
  • MFlowFPS: a bit faster 4:4:4, much faster greyscale
  • MFlowFPS: support different bit depth for clip and the vectors (use vectors from 8 bit analysis for a 16 bit clip)

2.7.34 (20181108)

  • MFlowInter: Use less memory, eliminates ten full-frame internal buffers

2.7.33 (20181021)

  • Fix: MFlowXX: random access violation caused by enlarged vectors pointing on out-of-frame positions

2.7.32 (20181018)

  • New: MAnalyze: Enhance mt mode report for Avisynth+: MT_SERIALIZED instead of MT_MULTI_INSTANCE when temporal=true or using output file.
  • Fix: MAnalyze: fix a possible internal overflow on larger blocksizes and lambda combinations. e.g. truemotion=true with blksize=32

2.7.31 (20180409)

  • Fix: MFlow: SC detection after having the mv clip. Fixed in 2.5.11.22 but was missed during 2.6.0.5 merge.
  • Fix: MFlow: crash in 16bit 4:2:0, mode=1
  • Fix: MDegrain, out16=true: Green bottom lines when overlap blocks are not covering the full vertical area

2.7.30 (20180405)

  • Fix: crash in MFlowInter (and possibly other MFlow...). v2.7.29 revelead this additional bug (which was not even 100% reproducible), this fix is basically the 2nd part of the solution.

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Nov 8, 2018 · 10 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

Change log

2.7.34 (20181108)

  • MFlowInter: Use less memory, eliminates ten full-frame internal buffers

2.7.33 (20181021)

  • Fix: MFlowXX: random access violation caused by enlarged vectors pointing on out-of-frame positions

2.7.32 (20181018)

  • New: MAnalyze: Enhance mt mode report for Avisynth+: MT_SERIALIZED instead of MT_MULTI_INSTANCE when temporal=true or using output file.
  • Fix: MAnalyze: fix a possible internal overflow on larger blocksizes and lambda combinations. e.g. truemotion=true with blksize=32

2.7.31 (20180409)

  • Fix: MFlow: SC detection after having the mv clip. Fixed in 2.5.11.22 but was missed during 2.6.0.5 merge.
  • Fix: MFlow: crash in 16bit 4:2:0, mode=1
  • Fix: MDegrain, out16=true: Green bottom lines when overlap blocks are not covering the full vertical area

2.7.30 (20180405)

  • Fix: crash in MFlowInter (and possibly other MFlow...). v2.7.29 revelead this additional bug (which was not even 100% reproducible), this fix is basically the 2nd part of the solution.

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Oct 22, 2018 · 13 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

Change log

2.7.33 (20181021)

  • Fix: MFlowXX: random access violation caused by enlarged vectors pointing on out-of-frame positions

2.7.32 (20181018)

  • New: MAnalyze: Enhance mt mode report for Avisynth+: MT_SERIALIZED instead of MT_MULTI_INSTANCE when temporal=true or using output file.
  • Fix: MAnalyze: fix a possible internal overflow on larger blocksizes and lambda combinations. e.g. truemotion=true with blksize=32

2.7.31 (20180409)

  • Fix: MFlow: SC detection after having the mv clip. Fixed in 2.5.11.22 but was missed during 2.6.0.5 merge.
  • Fix: MFlow: crash in 16bit 4:2:0, mode=1
  • Fix: MDegrain, out16=true: Green bottom lines when overlap blocks are not covering the full vertical area

2.7.30 (20180405)

  • Fix: crash in MFlowInter (and possibly other MFlow...). v2.7.29 revelead this additional bug (which was not even 100% reproducible), this fix is basically the 2nd part of the solution.

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Oct 19, 2018 · 17 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

Change log
2.7.32 (20181018)

  • New: MAnalyze: Enhance mt mode report for Avisynth+: MT_SERIALIZED instead of MT_MULTI_INSTANCE when temporal=true or using output file.
  • Fix: MAnalyze: fix a possible internal overflow on larger blocksizes and lambda combinations. e.g. truemotion=true with blksize=32

2.7.31 (20180409)

  • Fix: MFlow: SC detection after having the mv clip. Fixed in 2.5.11.22 but was missed during 2.6.0.5 merge.
  • Fix: MFlow: crash in 16bit 4:2:0, mode=1
  • Fix: MDegrain, out16=true: Green bottom lines when overlap blocks are not covering the full vertical area

2.7.30 (20180405)

  • Fix: crash in MFlowInter (and possibly other MFlow...). v2.7.29 revelead this additional bug (which was not even 100% reproducible), this fix is basically the 2nd part of the solution.

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Apr 9, 2018 · 28 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

2.7.31 (20180409)

  • Fix: MFlow: SC detection after having the mv clip. Fixed in 2.5.11.22 but was missed during 2.6.0.5 merge.
  • Fix: MFlow: crash in 16bit 4:2:0, mode=1
  • Fix: MDegrain, out16=true: Green bottom lines when overlap blocks are not covering the full vertical area

2.7.30 (20180405)

  • Fix: crash in MFlowInter (and possibly other MFlow...). v2.7.29 revelead this additional bug (which was not even 100% reproducible), this fix is basically the 2nd part of the solution.

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Apr 5, 2018 · 34 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

2.7.30 (20180405)

  • Fix: crash in MFlowInter (and possibly other MFlow...). v2.7.29 revelead this additional bug (which was not even 100% reproducible), this fix is basically the 2nd part of the solution.

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Apr 3, 2018 · 37 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

2.7.29 (20180403)

  • Fix: MFlowInter (and possibly other MFlow...) crash with specific combination of analyze parameters (e.g. blkSize=16,overlapv=4,divide=1). Bug existed since at least 2.5.11.22

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

2.7.21.22 (20170629)

  • Fix: [MDegrainN] fix chroma plane processing, regression since 2.7.1.22

2.7.20.22 (20170526) - hotfix

  • Fix: [MMask] greyscale input resulted in AV when filter exiting

2.7.19.22 (20170525)
New: [MMask] Support any planar input video formats e.g. greyscale, Planar RGB.
Input clip can even be of different bit depth or format from vector's original format
For kind==5 where U and V is filled, the greyscale option is not allowed

Mod: [MMask] Faster: request source frame only for kind=5.

Fix: [MxxxxFPS,MMask]: MakeVectorOcclusionMaskTime garbage in bottom blocks (30 hrs of debugging)

Fix: [MMask] bottom padding garbage for padded frame dimension

Fix: [MMask] proper 10+ bits scene change values (for default: 1023, 4095, 16383, 65535. Was: 65535)
Parameter is still in 8-bit range

Fix: [MRecalculate] prevent overflow during thSAD scaling in 16 bits or large block sizes (32, 48...)

Fix: [DepanEstimate] Giving wrong motion instead of scene change detection

Fix: [MAnalyze] Possible overflow in MAnalyze 8 bit, block size 48x48 and above.
Overflow-safe predictor recalc for big block sizes

New: [General] Add block size 12x3 for SAD, allow 6x24
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6, 12x3
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Mod: [Internal] Reorganized 10-16 bit SAD simd intrinsics, faster 8-12% for BlkSize 12-32

2.7.18.22 (20170512)
Fix: 10-16 bit: DCT buffer possible overflow

Fix: DCT is fast again for non 8x8 blocksizes. Regression since 2.7.5.22.

New: Chroma SAD is now always half of luma SAD, regardless of video format.
Without this: YV24's luma:chroma SAD ratio is 4:8 instead of 4:2 (of YV12)

New: MAnalyze, MRecalculate new parameter: "scaleCSAD" integer, default 0
Fine tune chroma SAD weight relative to luma SAD.
ScaleCSAD values for luma:chroma SAD ratio
-2: 4:0.5
-1: 4:1
0: 4:2 (default, same as the native ratio for YV12)
1: 4:4
2: 4:8

New: Block sizes 64, 48, 24, 12, 6
MAnalyze/MRecalculate new block sizes (SATD support mod4 sizes)
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Note: some smaller block sizes can only be available in 4:4:4 formats, due to block size division (chroma subsampling)

New: All block sizes are supported in MDegrain1-6, MDegrainN, and MScaleVect

New: Changed to 2017 version of asm files for 8 bit SAD/SATD functions from x265 project.
Added not implemented asm code for 12, 24, 48 sizes
For some block sizes AVX2 and SSE4 is supported (AVX2 if reported under AviSynth+)
e.g. BlkSize 32 is faster now.

New: MMask SAD Mask to give identical weights for other-than-YV12 formats, e.g. for YV24

MvTools2 2.7.17.22 (20170426)
Fix: Regression in 2.7.16.22: MDegrain right pixel artifacts on non-modulo 16 widths
Misc: MMask, mode SADMask output is normalized further by video subsampling (YV16/YV24 has larger SAD value due to bigger chroma part that classic YV12)

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Mar 23, 2018 · 52 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

2.7.28 (20180323)

  • Fix: in MDegrain1-6/N allow Y8 input for out16 parameter

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

2.7.21.22 (20170629)

  • Fix: [MDegrainN] fix chroma plane processing, regression since 2.7.1.22

2.7.20.22 (20170526) - hotfix

  • Fix: [MMask] greyscale input resulted in AV when filter exiting

2.7.19.22 (20170525)
New: [MMask] Support any planar input video formats e.g. greyscale, Planar RGB.
Input clip can even be of different bit depth or format from vector's original format
For kind==5 where U and V is filled, the greyscale option is not allowed

Mod: [MMask] Faster: request source frame only for kind=5.

Fix: [MxxxxFPS,MMask]: MakeVectorOcclusionMaskTime garbage in bottom blocks (30 hrs of debugging)

Fix: [MMask] bottom padding garbage for padded frame dimension

Fix: [MMask] proper 10+ bits scene change values (for default: 1023, 4095, 16383, 65535. Was: 65535)
Parameter is still in 8-bit range

Fix: [MRecalculate] prevent overflow during thSAD scaling in 16 bits or large block sizes (32, 48...)

Fix: [DepanEstimate] Giving wrong motion instead of scene change detection

Fix: [MAnalyze] Possible overflow in MAnalyze 8 bit, block size 48x48 and above.
Overflow-safe predictor recalc for big block sizes

New: [General] Add block size 12x3 for SAD, allow 6x24
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6, 12x3
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Mod: [Internal] Reorganized 10-16 bit SAD simd intrinsics, faster 8-12% for BlkSize 12-32

2.7.18.22 (20170512)
Fix: 10-16 bit: DCT buffer possible overflow

Fix: DCT is fast again for non 8x8 blocksizes. Regression since 2.7.5.22.

New: Chroma SAD is now always half of luma SAD, regardless of video format.
Without this: YV24's luma:chroma SAD ratio is 4:8 instead of 4:2 (of YV12)

New: MAnalyze, MRecalculate new parameter: "scaleCSAD" integer, default 0
Fine tune chroma SAD weight relative to luma SAD.
ScaleCSAD values for luma:chroma SAD ratio
-2: 4:0.5
-1: 4:1
0: 4:2 (default, same as the native ratio for YV12)
1: 4:4
2: 4:8

New: Block sizes 64, 48, 24, 12, 6
MAnalyze/MRecalculate new block sizes (SATD support mod4 sizes)
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Note: some smaller block sizes can only be available in 4:4:4 formats, due to block size division (chroma subsampling)

New: All block sizes are supported in MDegrain1-6, MDegrainN, and MScaleVect

New: Changed to 2017 version of asm files for 8 bit SAD/SATD functions from x265 project.
Added not implemented asm code for 12, 24, 48 sizes
For some block sizes AVX2 and SSE4 is supported (AVX2 if reported under AviSynth+)
e.g. BlkSize 32 is faster now.

New: MMask SAD Mask to give identical weights for other-than-YV12 formats, e.g. for YV24

MvTools2 2.7.17.22 (20170426)
Fix: Regression in 2.7.16.22: MDegrain right pixel artifacts on non-modulo 16 widths
Misc: MMask, mode SADMask output is normalized further by video subsampling (YV16/YV24 has larger SAD value due to bigger chroma part that classic YV12)

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Mar 18, 2018 · 55 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

2.7.27 (20180318)

  • Fix: MDepan: use zerow parameter. The parameter had no effect probably since it had been introduced. (veins1)
  • MDepan: report MT mode for Avisynth+. MT_MULTI_INSTANCE, except for logfile writing output mode when it reports MT_SERIALIZED. (other filters already have proper registration, MDepan was missed)

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-6 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

2.7.21.22 (20170629)

  • Fix: [MDegrainN] fix chroma plane processing, regression since 2.7.1.22

2.7.20.22 (20170526) - hotfix

  • Fix: [MMask] greyscale input resulted in AV when filter exiting

2.7.19.22 (20170525)
New: [MMask] Support any planar input video formats e.g. greyscale, Planar RGB.
Input clip can even be of different bit depth or format from vector's original format
For kind==5 where U and V is filled, the greyscale option is not allowed

Mod: [MMask] Faster: request source frame only for kind=5.

Fix: [MxxxxFPS,MMask]: MakeVectorOcclusionMaskTime garbage in bottom blocks (30 hrs of debugging)

Fix: [MMask] bottom padding garbage for padded frame dimension

Fix: [MMask] proper 10+ bits scene change values (for default: 1023, 4095, 16383, 65535. Was: 65535)
Parameter is still in 8-bit range

Fix: [MRecalculate] prevent overflow during thSAD scaling in 16 bits or large block sizes (32, 48...)

Fix: [DepanEstimate] Giving wrong motion instead of scene change detection

Fix: [MAnalyze] Possible overflow in MAnalyze 8 bit, block size 48x48 and above.
Overflow-safe predictor recalc for big block sizes

New: [General] Add block size 12x3 for SAD, allow 6x24
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6, 12x3
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Mod: [Internal] Reorganized 10-16 bit SAD simd intrinsics, faster 8-12% for BlkSize 12-32

2.7.18.22 (20170512)
Fix: 10-16 bit: DCT buffer possible overflow

Fix: DCT is fast again for non 8x8 blocksizes. Regression since 2.7.5.22.

New: Chroma SAD is now always half of luma SAD, regardless of video format.
Without this: YV24's luma:chroma SAD ratio is 4:8 instead of 4:2 (of YV12)

New: MAnalyze, MRecalculate new parameter: "scaleCSAD" integer, default 0
Fine tune chroma SAD weight relative to luma SAD.
ScaleCSAD values for luma:chroma SAD ratio
-2: 4:0.5
-1: 4:1
0: 4:2 (default, same as the native ratio for YV12)
1: 4:4
2: 4:8

New: Block sizes 64, 48, 24, 12, 6
MAnalyze/MRecalculate new block sizes (SATD support mod4 sizes)
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Note: some smaller block sizes can only be available in 4:4:4 formats, due to block size division (chroma subsampling)

New: All block sizes are supported in MDegrain1-6, MDegrainN, and MScaleVect

New: Changed to 2017 version of asm files for 8 bit SAD/SATD functions from x265 project.
Added not implemented asm code for 12, 24, 48 sizes
For some block sizes AVX2 and SSE4 is supported (AVX2 if reported under AviSynth+)
e.g. BlkSize 32 is faster now.

New: MMask SAD Mask to give identical weights for other-than-YV12 formats, e.g. for YV24

MvTools2 2.7.17.22 (20170426)
Fix: Regression in 2.7.16.22: MDegrain right pixel artifacts on non-modulo 16 widths
Misc: MMask, mode SADMask output is normalized further by video subsampling (YV16/YV24 has larger SAD value due to bigger chroma part that classic YV12)

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356

@pinterf pinterf released this Mar 14, 2018 · 60 commits to mvtools-pfmod since this release

Assets 3

MvTools2 + Depan + DepanEstimate

2.7.26 (20180314)

  • New: MDegrain1-6 and N: new parameter bool "out16" = false. If set, 8 bit input results in native 16bit output (like lsb=true hack but this is native). Faster than lsb=true by up to 12% (i7-7700)
  • Faster: special 10 bit SAD functions instead of the generic 10-16bit ones. Depending on the block size, 4-17% gain for a typical MDegrain1-5 session (for exactly 10 bit clips)

2.7.25 (20180227)

  • Fix: x64: not-cleared mmx state in MSuper assembly code would cause crash later, e.g. in x264 encoding, depending on following filters.
  • Fix: MSCDetection SC value parameter name to Ysc from Yth (must be an ancient typo), docs are OK, but the fix is mentioned in docs
  • MSuper: import 8 bit sse2 interpolators from mvtools-vs. Extend them for 10-16bits (faster super clip). Some filters are still todo.
  • MSuper: support 32bit float clips, which can be used later by MDegrains (but not for MAnalyse)
  • MDegrains: allow degraining clip with different bit depth from vectors. Clip and Super must be the same bit depth
  • MDegrains: consistently use limit and limitC, 255 do nothing, otherwise scale 0-254 value to the current bit-depth range
  • Overlaps: more correct internal rounding for 8 bits:
    old: pixel = Sum( (tmp + 256) >> 6) >> 5
    new: pixel = (Sum( (tmp + 32) >> 6) + 16) >> 5
  • Overlaps: round for 16bits
    old: pixel = Sum(tmp) >> 11
    new: pixel = (Sum(tmp) + 1024) >> 11
  • Overlaps: 32bit float (but still use the original 11 bit window constants)
  • Project: change from yasm to nasm.

2.7.24 (20171205)

  • Fix: MFlowBlur: possible access violation crash when nPel>1
  • New: MScaleVect parameter 'bits'. e.g. Analyze 8 bit clips, use their vectors for 16 bits
  • Move project to VS2017

2.7.23 (20171012)

  • Fix: MScaleVect wrong rounding of scaled motion vectors with negative components. e.g. proper scaling (-1;-2) to (-2;-4) instead of (-1,-3)

2.7.22 (20170830)

  • Misc: Stop using version suffix .22
  • Fix: [DCT 8x8@8bit] garbage on x64: internal assembly code did not save xmm6/xmm7
  • Fix: [DCT 8x8@8bit] safe multithreading for integer DCT (8x8 block size, 8 bit video): assembly had a single working buffer.
  • Fix: [MDegrain] did not release input motion vector clips in destructor, possible hang at script closing. Bug since 2.7.1.22 (introducing MDegrain4/5)
  • Mod: fftw conversion constant of sqrt(2)/2 is more accurate (was:0.707), 16 bit formats may benefit (by feisty2)
    Fix: SSE4 assembly instructions in x64, broke on non-SSE4 processors

2.7.21.22 (20170629)

  • Fix: [MDegrainN] fix chroma plane processing, regression since 2.7.1.22

2.7.20.22 (20170526) - hotfix

  • Fix: [MMask] greyscale input resulted in AV when filter exiting

2.7.19.22 (20170525)
New: [MMask] Support any planar input video formats e.g. greyscale, Planar RGB.
Input clip can even be of different bit depth or format from vector's original format
For kind==5 where U and V is filled, the greyscale option is not allowed

Mod: [MMask] Faster: request source frame only for kind=5.

Fix: [MxxxxFPS,MMask]: MakeVectorOcclusionMaskTime garbage in bottom blocks (30 hrs of debugging)

Fix: [MMask] bottom padding garbage for padded frame dimension

Fix: [MMask] proper 10+ bits scene change values (for default: 1023, 4095, 16383, 65535. Was: 65535)
Parameter is still in 8-bit range

Fix: [MRecalculate] prevent overflow during thSAD scaling in 16 bits or large block sizes (32, 48...)

Fix: [DepanEstimate] Giving wrong motion instead of scene change detection

Fix: [MAnalyze] Possible overflow in MAnalyze 8 bit, block size 48x48 and above.
Overflow-safe predictor recalc for big block sizes

New: [General] Add block size 12x3 for SAD, allow 6x24
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6, 12x3
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Mod: [Internal] Reorganized 10-16 bit SAD simd intrinsics, faster 8-12% for BlkSize 12-32

2.7.18.22 (20170512)
Fix: 10-16 bit: DCT buffer possible overflow

Fix: DCT is fast again for non 8x8 blocksizes. Regression since 2.7.5.22.

New: Chroma SAD is now always half of luma SAD, regardless of video format.
Without this: YV24's luma:chroma SAD ratio is 4:8 instead of 4:2 (of YV12)

New: MAnalyze, MRecalculate new parameter: "scaleCSAD" integer, default 0
Fine tune chroma SAD weight relative to luma SAD.
ScaleCSAD values for luma:chroma SAD ratio
-2: 4:0.5
-1: 4:1
0: 4:2 (default, same as the native ratio for YV12)
1: 4:4
2: 4:8

New: Block sizes 64, 48, 24, 12, 6
MAnalyze/MRecalculate new block sizes (SATD support mod4 sizes)
List of available block sizes
64x64, 64x48, 64x32, 64x16
48x64, 48x48, 48x24, 48x12
32x64, 32x32, 32x24, 32x16, 32x8
24x48, 24x24, 24x32, 24x12, 24x6
16x64, 16x32, 16x16, 16x12, 16x8, 16x4, 16x2
12x48, 12x24, 12x16, 12x12, 12x6
8x32, 8x16, 8x8, 8x4, 8x2, 8x1
6x24, 6x12, 6x6, 6x3
4x8, 4x4, 4x2
3x6, 3x3
2x4, 2x2

Note: some smaller block sizes can only be available in 4:4:4 formats, due to block size division (chroma subsampling)

New: All block sizes are supported in MDegrain1-6, MDegrainN, and MScaleVect

New: Changed to 2017 version of asm files for 8 bit SAD/SATD functions from x265 project.
Added not implemented asm code for 12, 24, 48 sizes
For some block sizes AVX2 and SSE4 is supported (AVX2 if reported under AviSynth+)
e.g. BlkSize 32 is faster now.

New: MMask SAD Mask to give identical weights for other-than-YV12 formats, e.g. for YV24

MvTools2 2.7.17.22 (20170426)
Fix: Regression in 2.7.16.22: MDegrain right pixel artifacts on non-modulo 16 widths
Misc: MMask, mode SADMask output is normalized further by video subsampling (YV16/YV24 has larger SAD value due to bigger chroma part that classic YV12)

For full changelog see documentation.

Bundled filters:
Depan: 2.13.1.3 (20170525)
DepanEstimate: 2.10.0.2 (20170525)
General support of 10-16 bit formats with Avisynth Plus (r2294 or newer recommended)

x86 and x64 builds
Windows XP is still supported

forum:
http://forum.doom9.org/showthread.php?t=173356