Skip to content

Train new model #320

@Agamemnus-uA

Description

@Agamemnus-uA

Hi, I've been on a mission to benchmark different encoders and their settings using VMAF as well as some other metrics. I get 2 main points of feedback. 1, which I think is incorrect, is that you can't use model 0.6.1 on a reference/distorted pair in 720p. I believe most people are actually incorrect in this sense, since I don't compare absolute scores across resolutions, I only compare 720p reference/distorted with other distorted 720p encodes of the same source, and see which one performs higher. Same goes for 1080p and 1440p, all distorted encodes are the same resolution as the reference in my benchmarks. Can you confirm this is fine or not?

The second main point of feedback I get is that VMAF isn't tuned to predict gameplay footage. Since I'm focused on encoding choices/options for gameplay footage, I can't help but shake the feeling that the dataset you've used to train the models you offer is in fact not suited to this. Can you confirm this is true, to either a small or large degree?

If so, I'm very interested in training a new model but I have no idea how to do it and would need help. I would think something like 1080p at 2xH viewing distance or maybe 1440p at 1.5xH viewing distance might be an appropriate underlying assumption for most computer/desk viewer situations. I have access to hundreds of people I can use to gather opinion score data, with a variety of distortion versions from an original, and can do it across multiple references (say 3 or 4 different games). The thing is, I have no idea how to store these scores in a way that a training mechanism like libsvm will read, and I have no idea how to generate a final model file that I can then actually use through FFMPEG and the libvmaf filter.

If this idea is a waste of time, and there's little reason to create a new model for gameplay footage for desk/pc viewers, then I'd love to have that confirmed here. But if there's value in doing this, I'm very interested in being a part of it, but I'd essentially need a n00b tutorial on how to create the model. I will happily share any and all opinion and benchmark data you like with you here or elsewhere.

If you want to see some of the benchmarks I've been doing here's a few recent ones:
https://unrealaussies.com/tech/x264-presets-and-settings-quality-analysis-of-720p60-apex-legends/
https://unrealaussies.com/tech/nvenc-x264-quicksync-qsv-vp9-av1/
https://unrealaussies.com/tech/nvenc-x264-obs/

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions