Neural Video Codecs produce compressed bitstreams that are fundamentally different from H.264, HEVC, or AV1 — entropy-coded latent tensors, not NAL units or fMP4 boxes. Existing packaging formats can't represent them without unnecessary overhead and semantic mismatch.
NMSF extends MSF
(MOQT Streaming Format) with a new "nvc" packaging type,
following the same pattern as
CMSF for CMAF.
NVC tracks sit alongside LOC and CMAF tracks in the same MoQ Broadcast.
Neural keyframes (Intra) are the first Object in each MoQ Group. Delta frames (Inter) follow. Any MoQ relay forwards them without modification.
| Aspect | CMAF (CMSF) | NVC (NMSF) |
|---|---|---|
| Container | fMP4 (moof + mdat) | None (17-byte header) |
| Init segment | moov box | Catalog metadata only |
| Codec signaling | avc1.42c01e | dcvc-rt |
| Keyframe | IDR NAL unit | Neural Intra (context init) |
| Per-frame overhead | ~100-200 bytes | 17 bytes |
| Relay awareness | None | None |
Works with any NVC: DCVC-RT, SSF, FVC, RLVC. The packaging doesn't depend on the codec.
Standard MoQ relays forward NMSF Objects without modification. No plugins or codec awareness.
Every MoQ Group starts with an Intra frame. Subscribers begin decoding from any Group boundary.
NVC video tracks coexist with LOC and CMAF audio tracks in the same Broadcast catalog.
17-byte fixed header per Object. No container boxes, no padding, no alignment constraints.
IETF Internet-Draft. draft-herz-moq-nmsf-00
| Codec | Full Name | Status |
|---|---|---|
dcvc-rt | DCVC Real-Time (Microsoft) | Tested |
dcvc-fm | DCVC Feature Modulation | Compatible |
ssf | Scale-Space Flow | Compatible |
fvc | Feature-space Video Coding | Compatible |
elfvc | Efficient Learned Flexible VC | Compatible |
Pre-built Linux binaries for dcvc-pub and dcvc-sub
will be available with a push key from
moqcdn.net.
Requirements: Ubuntu 24.04, Intel Xeon (AMX recommended), OpenVINO 2025.1+, DCVC-RT model weights from Microsoft.