Allow audio decoder to seek backwards #550

NicolasHug · 2025-03-12T14:49:32Z

Towards #549.

Changes are quite subtle, but I'm fairly confident they work as expected, since they pass our robust tests. I'll make sure to write some solid documentation around the mechanisms involved eventually.

Benchmarks results show no perf hit

Duration: 13s
torchcodec: med = 8.05ms +- 1.13
torchaudio: med = 12.27ms +- 0.51

Duration: 13s
torchcodec: med = 4.20ms +- 0.91
torchaudio: med = 7.21ms +- 0.58

Duration: 2m11s
torchcodec: med = 28.26ms +- 0.95
torchaudio: med = 45.72ms +- 0.89

Duration: 1h27m
torchcodec: med = 1046.49ms +- 66.00
torchaudio: med = 1746.49ms +- 22.55

Benchmark code is the same as in #538, I'm not benchmarking the "backwards seeking" logic.

NicolasHug · 2025-03-12T14:50:24Z

test/decoders/test_ops.py

+            # indices.
+            # Ultimately, this test compares a "stateful decoder" which calls
+            # `get_frames_by_pts_in_range_audio()`` multiple times with a
+            # "stateless decoder" (the one here, treated as the reference)


I've convinced myself that we should actually keep this helper instead of doing the conversion. Thoughts?

I think that's fine - then what we're testing is that decoder behaves the same when it seeks to a location "fresh" versus having to seek from some given location, including backwards. That seems reasonable.

NicolasHug · 2025-03-12T14:53:07Z

src/torchcodec/decoders/_core/VideoDecoder.cpp

+    // of the stream.
+    // TODO-AUDIO: document why this is needed in a big comment.
+    setCursorPtsInSeconds(INT64_MIN);
+  }


Note that this is INT64_MIN and not 0, because some packets actually start before 0. In one of our assets the first packet is at -1024.
I noticed that passing an arbitrary low value like -999999 makes FFmpeg unhappy and raise and error, but INT64_MIN seems to be understood and correct (although I haven't found docs on this).

NicolasHug added 3 commits March 12, 2025 14:32

WELL THIS WORKS

fec8a70

Enable backwards seeks

39e7414

Comment

f512912

NicolasHug requested a review from scotts March 12, 2025 14:49

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 12, 2025

NicolasHug commented Mar 12, 2025

View reviewed changes

NicolasHug mentioned this pull request Mar 12, 2025

Audio decoding TODOs #549

Closed

7 tasks

NicolasHug commented Mar 12, 2025

View reviewed changes

Fix compilation

31d025b

scotts approved these changes Mar 12, 2025

View reviewed changes

NicolasHug merged commit c6de04a into pytorch:main Mar 12, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow audio decoder to seek backwards #550

Allow audio decoder to seek backwards #550

NicolasHug commented Mar 12, 2025 •

edited

Loading

NicolasHug Mar 12, 2025

scotts Mar 12, 2025

NicolasHug Mar 12, 2025

Allow audio decoder to seek backwards #550

Allow audio decoder to seek backwards #550

Conversation

NicolasHug commented Mar 12, 2025 • edited Loading

NicolasHug Mar 12, 2025

Choose a reason for hiding this comment

scotts Mar 12, 2025

Choose a reason for hiding this comment

NicolasHug Mar 12, 2025

Choose a reason for hiding this comment

NicolasHug commented Mar 12, 2025 •

edited

Loading