Notes from Demuxed 2020

• Updated

Last week was Demuxed, a conference about digital video from an engineering perspective. It was my third time attending and it’s still an exhilarating conference: every speaker brings unique perspectives and expertise; I can never predict which talks will impress me most.

The full conference is now available for viewing on YouTube but here are a few talks/topics that stood out to me as especially interesting:

How WebRTC works (Sean DuBois, Amazon)

There are many introductions to the WebRTC APIs (Google has a nice one and MDN is our bible) but this talk is something else: an engineering overview describing the fundamental building blocks of WebRTC itself and how they fit together.

This talk fills some huge knowledge gaps. WebRTC is built on a strata of existing protocols and technologies, and without a talk like this it’s difficult to understand the technology properly without reading 20+ years of RFCs and connecting the dots. For the same reason, it’s a dense talk; I plan to rewatch it a few times myself.

You can watch the talk on YouTube and Sean compiled a lot of this information into a community book called WebRTC for the Curious.

Editing video by editing text (Roderick Hodgson, Simon Says)

Roderick’s company produces transcripts for use in post-production, e.g. for editors to use as a guide for navigating a video, rather than relying solely on an A/V scrubber. Recently, he’s been working on a web-based tool for editorial staff to create rough edits from multiple video clips, simply by highlighting transcript text and then dragging and dropping to reorder clips into a narrative – think of the way interviews are stitched together in a documentary or news bulletin. These rough cuts can then be passed on to video editors (e.g. as FCPXML) for creative work.

Much of Roderick’s talk focused on solving the technical issue of representing clip selections in a web-based editor. This was interesting, but the product itself caught my imagination. It’s an innovative way to close the gap between two phases of editorial work.

Watch Roderick’s talk on YouTube.

Video Vectorization (Sam Bhattacharyya, Vectorly)

Inspired by the Flash animations of yesteryear (Homestar Runner, anyone?) Sam’s been working on an SVG video codec: storing animation as vector graphics in MP4, and then rendering on the client using WebGL. Sam’s primarily interested in delivering complete animations in high quality at low bitrates, but also mentioned how the idea could apply to video overlays or be used to deliver more readable and accessible titles at heavier compression rates.

This talk’s a window into a lot of deep and interesting topics (vectorization, custom codecs, a custom library to parse and render SVG via WebGL). Conversation afterward also pointed to an in-development SVG Streaming spec which could make adoption of these ideas easier and more widespread in future.

Watch Sam’s talk on YouTube.

Audio Description (Jun Heider, RealEyes)

Audio description is an industry term for narrating video to describe visual content for accessibility purposes or to make it available in new contexts e.g. watching a TV show while doing the dishes.

Jun has been compiling a list of standards and other resources to help organizations get started on the path to producing more Audio Description content.

Watch and hear Jun’s talk on YouTube.

How to live stream a LaserDisc (Vanessa Pyne, Daily.co)

This talk’s too much fun, but still fascinating! I hadn’t realized how close LaserDisc was to analog media. Its video is stored entirely in analog: an FM signal was stamped into the disc as pits and lands that required data reads at a precise rate, like an optical record player with a laser as its needle.

Watch Vanessa’s talk on YouTube.

Resources

Finally, here’s a few of the links I scribbled down without context, mostly from lightning talks and side conversations:

  • Awesome Video - an awesome list of learning resources, tools, libraries, and much more about video tech
  • Media Fragments - did you know there’s a standard for deep-linking to specific times, tracks, and even for cropping media via URL?
  • HLSQ - CLI client that renders HLS manifests with syntax highlighting and some basic filtering (think jq for video streams)
  • “Simultaneous Interpreting” by Lynn Visson - came up in a conversation about stenography; strictly tangential but super interesting

If you’re interested in video or just want a new perspective on engineering problems, I can’t recommend this conference enough.

Questions/corrections? Reach out!