Video stream bframe support by caiopiccirillo · Pull Request #12700 · rerun-io/rerun · GitHub

caiopiccirillo · 2026-03-20T12:00:48Z

What

Adds B-frame support for H.264/H.265 streams on the VideoStream archetype.

New types:

VideoPresentationTimestampOffset datatype (transparent i64) and matching component — represents the per-sample offset from decode timestamp (DTS) to presentation timestamp (PTS), in timeline units.

Archetype changes (`VideoStream`):

sample field is now array-typed VideoSample, allowing multiple frames to be batched into a single row in decode order.

New optional presentation_time_offset field `VideoPresentationTimestampOffset — when present, PTS for sample i = DTS_i + offset_i. When absent, PTS == DTS (the common no-B-frame case, fully backward compatible).

Cache/decoder pipeline (`VideoStreamCache`):

Frame numbers are now assigned by presentation order (PTS-sorted) instead of decode order.
Sample durations are computed from PTS-sorted timestamps so scrubbing and seek work correctly.

read_pts_offsets_from_chunk extracts offset arrays from incoming chunks.

Incremental chunk additions properly update frame numbers and SamplesStatistics.
demux/mod.rs: video duration is now computed from actual min/max PTS to handle reordered frames.

… definitions

…sample array-typed

…am changes

…r B-frames

…ditions

…test - update_sample_durations: only clear the last sample's duration when the range extends to the actual end of the sample deque. Previously a sub-range update (from split/compacted chunk handling) would unconditionally clear a duration already computed by a wider pass. - Add #[expect(clippy::cast_sign_loss)] annotation on Arrow offset cast in read_pts_offsets_from_chunk for convention compliance. - Add video_stream_cache_bframe_incremental_buildup test that exercises the on_store_events path with B-frame offsets, verifying frame_nr, PTS offsets, and statistics stay consistent as frames arrive one at a time (with both compaction enabled and disabled).

github-actions

Hi! Thanks for opening this pull request.

Because this is your first time contributing to this repository, make sure you've read our Contributor Guide and Code of Conduct.

Wumpf · 2026-03-20T12:20:03Z

Thanks for wanting to contribute! This is a fairly complex and large piece that comes with quite some design choices & testing needs. Your description seems to be entirely generated, which can be fine but in the future make sure to add your own thoughts about the tradeoffs involved, what's good, what's bad, how this was tested, some screenshots & examples etc.

Either way we don't have time to process that right now, so I'll put it on draft

See also https://github.com/rerun-io/rerun/blob/main/CONTRIBUTING.md#what-to-contribute

Wumpf · 2026-03-20T12:24:18Z

Your description seems to be entirely generated, which can be fine

To clarify, I don’t care that it’s generated but I deeply care about it being reviewable and indicating that thought went into the things where thought is required! Right now that's just the usual "agent did things" listing which doesn't help understanding the relevant pieces much

caiopiccirillo · 2026-03-21T15:03:15Z

@Wumpf Sorry for the generated description. I was in a hurry yesterday and thought it was more important to review the code with a concise description.

But I took some time to describe the decisions/considerations that were made in the design.

I chose to implement DTS on the timeline with PTS offset per sample. The timeline timestamp is the decoding timestamp, but an optional field (presentation_time_offset, which is an int64 with the timeline unit) was added that gives the delta PTS - DTS per sample, and when is absent, PTS == DTS enabling backward compatibility at no cost for no B-frames.

Two alternatives were considered for the problem, the first being the one suggested in the issue itself, which would be to add the decoder_timeline, but I see that it solves another problem that can be addressed in the future (which would be if the user logs the samples out of decoding order), but the current implementation assumes that the samples arrive in DTS order on the timeline. The second alternative would be to separate the DTS and PTS fields, but I thought that wouldn't be efficient in the sense of duplicating the metadata, and the decoder doesn't need the DTS.

Making sample an array was necessary because B-frame encoders naturally output batches, thus allowing multiple samples per row and keeping the sample stream in decoding order within a log call (which aligns with how encoder outputs data).

I see some limitations in my implementation:

I haven't implemented the decoder_timeline field.
I haven't tested for AV1/VP9.
The frame number assignment is done by PTS order, which can be negligible for "typical" streams, but is worth noting for very large batches.

Finally, I tested using unit tests for the cases I found relevant, but I also did a manual e2e test using an H.264 file (re-encoded with libx264 with max_b_frames=3) and streaming it through the Python SDK and it seems to be working.

I'm open to discuss further if changes are needed 😄

caiopiccirillo and others added 12 commits March 19, 2026 15:44

feat: add VideoPresentationTimestampOffset datatype and component FBS…

a90ca4f

… definitions

feat: add presentation_time_offset to VideoStream archetype and make …

ba57952

…sample array-typed

chore: run codegen for VideoPresentationTimestampOffset and VideoStre…

9ccd8db

…am changes

feat: implement B-frame PTS offset support in VideoStreamCache

d4030af

fix: compute video duration from actual min/max PTS to handle B-frames

44b6ee3

fix: migrate downstream callers from with_many_sample to with_sample

42e54b2

test: update video visual test to use DTS timeline and PTS offsets fo…

538c418

…r B-frames

docs: update documentation and snippets for B-frame PTS offset support

34fc310

fix: update frame_nr and SamplesStatistics after incremental chunk ad…

a823bf8

…ditions

docs: add B-frame support section to video concept page

0a4e78e

Merge branch 'main' into feat/video-stream-bframe-support

25eed25

github-actions Bot reviewed Mar 20, 2026

View reviewed changes

Wumpf marked this pull request as draft March 20, 2026 12:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Video stream bframe support#12700

Video stream bframe support#12700
caiopiccirillo wants to merge 12 commits into
rerun-io:mainfrom
caiopiccirillo:feat/video-stream-bframe-support

caiopiccirillo commented Mar 20, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Wumpf commented Mar 20, 2026

Uh oh!

Wumpf commented Mar 20, 2026

Uh oh!

caiopiccirillo commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

caiopiccirillo commented Mar 20, 2026

Related

What

New types:

Archetype changes (VideoStream):

Cache/decoder pipeline (VideoStreamCache):

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Wumpf commented Mar 20, 2026

Uh oh!

Wumpf commented Mar 20, 2026

Uh oh!

caiopiccirillo commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Archetype changes (`VideoStream`):

Cache/decoder pipeline (`VideoStreamCache`):