---
canonical: "https://yuanhaochen.dev/notes/semantic-video-meaning-layer"
path: "/notes/semantic-video-meaning-layer"
section: "Notes"
title: "Universal Semantic Video as a portable meaning layer"
language: "en"
agentUse: "summary, retrieval, citation, hiring evaluation"
---

# Universal Semantic Video as a portable meaning layer

Why a portable meaning layer can matter more than richer captions when media moves across AI and human tools.

The problem

Video workflows already have containers, captions, streams, and provenance standards, but they still struggle to carry object meaning, segment intent, speaker context, rights, consent rules, and fallback behavior through many AI and human tools.

When that meaning disappears, the next tool has to infer too much. The workflow becomes brittle even if the media file itself remains intact.

The artifact

Universal Semantic Video explores a sidecar approach: keep the media in existing delivery systems, then add a small validated semantic layer that can be inspected independently.

The useful boundary is that USV is not trying to become a codec, a player, or a hosted AI translation service. It is a portable meaning layer that can survive tool boundaries.

What changed

The project made the unit clearer for me. A richer caption is not enough. The inspectable object is the relationship between segment, speaker, object, right, fallback, and provenance.

Inspect the repository

https://github.com/89325516/universal-semantic-video
