All Glossary Terms
GlossaryDefinition

Audio Synchronization Definition

Audio synchronization is the process of aligning separately recorded audio tracks with video footage using timecode matching, waveform analysis, or manual slate alignment to ensure lip-sync accuracy.

Why audio synchronization matters

Professional video production frequently records audio separately from video. On-camera microphones capture ambient sound and reference audio, while dedicated sound recorders capture high-quality audio from lavalier microphones, boom operators, and room mics. This dual-system recording produces superior audio quality but creates a synchronization challenge — the audio and video must be precisely aligned in post-production.

Even a single frame of sync drift is perceptible to viewers as a disconnect between lip movement and speech. At 24 fps, that is roughly 42 milliseconds — a threshold where audiences unconsciously sense something is wrong even if they cannot articulate what. Professional sync must be frame-accurate at minimum, and ideally sub-frame accurate for demanding content like music performances.

For teams managing large volumes of dual-system footage — documentary productions, multi-camera events, episodic series — manual synchronization is prohibitively time-consuming. A production shooting 10 hours of footage across 4 cameras with separate audio generates hundreds of individual sync relationships that must be established correctly.

Best practices

Use timecode as your primary sync reference. When cameras and audio recorders share jam-synced timecode, synchronization is automatic and frame-accurate. Invest in timecode synchronization hardware (Tentacle Sync, Deity TC-1, or similar) for any production using dual-system audio. The cost is trivial compared to the time saved in post.

Always capture a reference audio track on camera even when using separate recorders. This provides a waveform-matching fallback when timecode is unavailable or has drifted. Clap slates serve the same purpose visually — the spike in audio waveform aligns with the visual frame where the slate closes.

Verify sync periodically throughout long recordings. Timecode generators can drift relative to each other over hours. Checking sync every 30-60 minutes during long-form recording identifies drift before it becomes severe, and known drift points allow editors to apply correction.

How ShotAI relates

ShotAI preserves audio-video synchronization relationships in its index, ensuring that search results pointing to specific moments reflect both the visual and audio content at that precise timecode.

Related Terms

Written by the ShotAI team. Last updated May 2026.

今日からShotAIを無料で始めましょう