comma-lab

comma-lab

Scorer-backed compression experiments for the comma.ai video compression challenge. This page is the brief: current state, search path, and supporting evidence.

Contest

comma.ai’s public challenge asks entrants to ship an archive.zip that inflates to video. The published score combines archive bytes, SegNet distortion, and PoseNet distortion on the public test clip.

Who we are

comma-lab is a public experiment log and submission repo maintained by Alejandro Pena. It publishes measured runs, rejected branches, and the current promoted operating point in one place.

Last updated

Apr 6, 2026, 1:57 PM CDT

Generated from repository state plus scorer-backed artifacts stored in this repo.

Track B current_workflow
2.12
864,486 bytes
Track B rule_faithful
2.142
897,745 bytes
Delta vs published baseline
-2.27
4.39 → 2.12
Delta vs prior floor
-0.06
+31 bytes

Track A remains separate: `current_workflow` 0.00 at 167 bytes. The robust run ledger currently contains 35 measured runs, 10 promotions, and 15 explicit rejections.

robust_current · libsvtav1 · 524x394 · film-grain=22 · lanczos/lanczos · rgb24(pc)

Score vs bytes

Track B runs only. Better runs move toward the lower-left. The x-axis uses log scaling. The y-axis is linear. The severe AV1 bug run at 97.45 is omitted here so the operating range stays legible; it remains documented in the search-path section.

900k1.2M1.7M2.5M4.0M5.0M2.5345archive size (bytes, log scale)current_workflow score (linear, lower is better)
900k1.2M1.7M2.5M22.534archive size (bytes, log scale)current_workflow score (linear, lower is better)
promotion explicit rejection measured run

Lower is better. The plot is generated directly from scorer-backed artifacts in the repository.

Search path

A branch view of the measured search path: x265 reductions, ROI failures, the AV1 byte-layout failure, and the later hardening step that moved the floor to 2.12.

x265 / earlier promotions current floor diagnostic bug node rejection / failed branch
robust_current-baseline-cpu-2026-04-03 | score=4.06 | bytes=3,735,828baseline4.06 · 512x384 · crf 22k32 · lanczos/bicubicrobust_current-medium23-cpu-2026-04-03 | score=3.62 | bytes=2,819,374512x384 / crf233.62 · 512x384 · crf 23k32 · lanczos/bicubicrobust_current-448x336-medium23-cpu-2026-04-03 | score=3.56 | bytes=1,978,141448x3363.56 · 448x336 · crf 23k32 · lanczos/bicubicrobust_current-lanczos-lanczos-cpu-2026-04-04 | score=3.54 | bytes=1,901,606lanczos/lanczos3.54 · 448x336 · crf 23k48 · lanczos/lanczosrobust_current-432x324-cpu-2026-04-04 | score=3.33 | bytes=1,781,129432x3243.33 · 432x324 · crf 23k48 · lanczos/lanczosrobust_current-cand-424x318-crf23-g48-b4-r4-cpu-2026-04-05 | score=3.25 | bytes=1,669,984x265 floor3.25 · 424x318 · crf 23k48 · lanczos/lanczosrobust_current-dynamic-main-roi-cpu-2026-04-05 | score=4.47 | bytes=2,660,388dynamic ROI4.47 · 432x324 · crf 23k48 · lanczos/lanczosrobust_current-cand-428x320-crf23-g48-b4-r4-cpu-2026-04-05 | score=3.32 | bytes=1,741,924nearby scale3.32 · 428x320 · crf 23k48 · lanczos/lanczosrobust_current-av1-524x394-cpu-2026-04-05 | score=97.45 | bytes=920,457AV1 failure97.45 · 524x394 · crf 33svtav1-p0 · lanczos/bicubicrobust_current-av1-524x394-colorspace-hardening-promoted-cpu-2026-04-06 | score=2.12 | bytes=864,486current floor2.12 · 524x394 · crf 34svtav1-p0 · explicit color

This graph is selective by design. It shows the runs that changed the operating point or changed the lab’s understanding of the evaluator.

Original vs inflated output

Browser preview assets for inspection only. Official scoring still comes from the scorer-backed reports.

Original

Source contest video excerpt.

Inflated output

Current promoted inflate path rendered to a browser preview.

Original zoom

Crop focused on task-relevant structure.

Inflated zoom

Same crop after compression and inflate.

Compression: 864,486 bytes. Distortion: PoseNet 0.09384175, SegNet 0.00575313. Total score: 2.12.

Walkthrough

  1. Evaluator path

    The scorer resizes frames and measures task distortion. It is sensitive to pipeline details, not just visual appearance.

    PoseNet sees both frames. SegNet sees only the last frame in each pair. That asymmetry is why small encoding or decode-path changes can move the score.

  2. Critical bug

    The main AV1 failure was a byte-layout error, not a codec-limit problem.

    The failed path emitted rawvideo as yuv444p bytes. The corrected path forces rgb24, which matches the evaluator’s raw-frame expectation.

  3. Current operating point

    The current honest floor is 2.12 at 864,486 bytes.

    Against the first honest baseline (4.06 at 3,735,828 bytes), the lab reduced bytes while preserving task signal well enough to keep the score moving down.

Why 2.12 beat 2.18

prior floor
2.18 864,455 bytes
implicit color handling
current floor
2.12 864,486 bytes
explicit bt709/tv encode tags · explicit rgb24(pc) decode
MetricPriorCurrentDelta
current_workflow score2.182.12-0.06
archive bytes864,455864,486+31
PoseNet distortion0.106568000.09384175-0.01272625
SegNet distortion0.005696170.00575313+0.00005696

The bytes barely moved. The score change came primarily from lower PoseNet distortion.

Local neighborhood

This table isolates the local AV1 neighborhood around the promoted floor. It shows which nearby changes improved the result and which did not.

VariantChanged axisScoreBytesΔ scoreΔ bytesVerdict
crf 33crf 332.20920,457promoted
crf 34crf 342.19864,455-0.01-56,002promoted
crf 35crf 352.21808,036+0.02-56,419rejected
unsharp 0.30postfilter unsharp=9:9:0.30:9:9:0.02.20864,455-0.01+56,419rejected
film-grain 0film-grain 03.33719,096+1.13-145,359rejected
522x392geometry 522x3922.23862,238+0.04-2,217rejected
lanczos upscaleupscale lanczos2.18864,455-0.01+0promoted
color hardeningexplicit bt709/tv -> rgb24(pc)2.12864,486-0.06+31promoted

On narrow screens, swipe horizontally to inspect the full table.

References

Turning points

  • Initial honest floor — robust_current-baseline-cpu-2026-04-03 (4.06)
  • 512x384 / crf23 — robust_current-medium23-cpu-2026-04-03 (3.62)
  • ROI branch rejection — robust_current-dynamic-main-roi-cpu-2026-04-05 (4.47)
  • Current floor — robust_current-av1-524x394-colorspace-hardening-promoted-cpu-2026-04-06 (2.12)

The landing page stays as the brief. Full detail lives in the linked artifacts.