Use more interesting videos for README experiment #351

scotts · 2024-11-08T14:56:50Z

Before this PR, our README experiments were using a video that was just a single blue frame for 30 seconds. This PR changes the README experiment to use two different videos:

A Mandelbrot video generated from FFmpeg.
The long version of the NASA video we often use for testing. It's got a lot of variety, and I think it's a great representative video.

I also experimented with a third video, using FFmpeg's rgbtestsrc option. The results were roughly the same as Mandelbrot - interesting for us, and some evidence that our existing videos are not outliers, but it doesn't tell our users much new, and it adds 30% to the size of the charts. I think even the current size is maybe too big.

The other big change is that our old charts were using the median with error bars representing p25 and p75 percentile. I changed our charts to use the mean and the error bars are the standard deviation. In my experience reporting performance in computer systems, I have always used mean and standard deviation rather than the median. I think including the outliers is important. You can have a great median and terrible p90 performance.

After discussing, it's easier to go with the median. We have a challenge that we're turning the times into a rate, and average that properly will take more thinking.

ahmadsharif1

I prefer median because it makes the default pytorch.utils.benchmark table results look identical to the charts. Mean may cause user confusion.

Also see my comment that it's not really the arith mean fps -- it's the harmonic mean

Using the higher res nasa video looks good to me

benchmarks/decoders/benchmark_decoders_library.py

NicolasHug · 2024-11-08T15:53:24Z

benchmarks/decoders/benchmark_decoders_library.py


            # Set the title for the subplot
-            base_video = os.path.basename(video)
+            base_video = os.path.basename(video).removesuffix(".mp4")


We should stop using os.path, I had tried to migrate awa from those already in #249

I know this wasn't introduced in this PR but do you mind relyin on pathlib for this?

NicolasHug · 2024-11-08T15:55:32Z

benchmarks/decoders/generate_readme_data.py


    # These are the number of uniform seeks we do in the seek+decode benchmark.
    num_samples = 10
+    video_files_paths = glob.glob(f"{videos_dir_path}/*.mp4")


We can use pathlib.glob here too instead of glob.glob

benchmarks/decoders/benchmark_decoders_library.py

ahmadsharif1 · 2024-11-08T19:35:57Z

README.md

 ![benchmark_results](./benchmarks/decoders/benchmark_readme_chart.png)

+The top row is a [Mandelbrot](https://ffmpeg.org/ffmpeg-filters.html#mandelbrot) video
+generated from FFmpeg that has a resolution of 1280x720 at 60 fps and is 120 seconds long.


Add the codec and pixel format too if you can

ahmadsharif1 · 2024-11-08T19:36:47Z

README.md


+The top row is a [Mandelbrot](https://ffmpeg.org/ffmpeg-filters.html#mandelbrot) video
+generated from FFmpeg that has a resolution of 1280x720 at 60 fps and is 120 seconds long.
+The bottom row is [promotional video from NASA](https://download.pytorch.org/torchaudio/tutorial-assets/stream-api/NASAs_Most_Scientifically_Complex_Space_Observatory_Requires_Precision-MP4_small.mp4)


The two charts look very similar. I would remove the fractal and just use the real video

I'm going to merge as-is. Let's discuss later what we want long-term. I have a slight preference to show two videos, to give some indication that the NASA video isn't a fluke and to include a higher resolution video. Put another way, I want someone to look at both videos and think "Huh, they're about the same performance."

scotts added 3 commits November 8, 2024 06:43

Use more interesting videos for README experiment

d4cdc20

Merge branch 'main' of github.com:pytorch/torchcodec into readme_video

2d6b82f

Apply linting.

d537a90

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 8, 2024

scotts marked this pull request as ready for review November 8, 2024 14:57

ahmadsharif1 reviewed Nov 8, 2024

View reviewed changes

benchmarks/decoders/benchmark_decoders_library.py Outdated Show resolved Hide resolved

NicolasHug approved these changes Nov 8, 2024

View reviewed changes

scotts added 2 commits November 8, 2024 10:28

Revert mean back to median

74db54d

Refactor out uses of os library

9d5ed07

ahmadsharif1 approved these changes Nov 8, 2024

View reviewed changes

Add encoding and pixel format to README

51b5146

scotts merged commit 43ee807 into meta-pytorch:main Nov 8, 2024
18 checks passed

scotts deleted the readme_video branch November 8, 2024 20:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use more interesting videos for README experiment #351

Use more interesting videos for README experiment #351

Uh oh!

scotts commented Nov 8, 2024 •

edited

Loading

Uh oh!

ahmadsharif1 left a comment

Uh oh!

Uh oh!

NicolasHug Nov 8, 2024

Uh oh!

NicolasHug Nov 8, 2024

Uh oh!

Uh oh!

ahmadsharif1 Nov 8, 2024

Uh oh!

ahmadsharif1 Nov 8, 2024

Uh oh!

scotts Nov 8, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Use more interesting videos for README experiment #351

Use more interesting videos for README experiment #351

Uh oh!

Conversation

scotts commented Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahmadsharif1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NicolasHug Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

NicolasHug Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ahmadsharif1 Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

ahmadsharif1 Nov 8, 2024

Choose a reason for hiding this comment

Uh oh!

scotts Nov 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

scotts commented Nov 8, 2024 •

edited

Loading

scotts Nov 8, 2024 •

edited

Loading