AI highlight detection for gaming streams
Don\u2019t scrub through hours of footage. Clippper\u2019s multi-signal pipeline analyzes audio, motion, transcript, and silence patterns to find your best moments automatically.
Four signals, one decision
Most tools use a single metric (like audio volume) to find highlights. Clippper combines four independent signals for better accuracy.
Audio energy
Per-second RMS and VAD detect moments when you or your chat get loud — clutch plays, team wipes, rage moments. Rolling z-scores flag spikes against the baseline.
Visual motion
Frame-by-frame difference analysis at 1 fps detects action sequences. Fast-moving gameplay, camera shakes, or sudden scene changes register as motion peaks.
Transcript analysis
Word-level Deepgram output flags reaction words ("let’s go", "no way", "oh my god"), sentence boundaries for natural clip start/end, and pauses that indicate moment breaks.
Silence detection
FFmpeg silencedetect identifies quiet stretches. These serve as natural clip boundaries and help avoid cutting mid-sentence or mid-action.
The highlight detection pipeline
From raw video to scored clips in five steps.
- 1
Signal extraction
FFmpeg extracts 16 kHz mono audio. Per-second analysis computes RMS energy, VAD (voice activity), motion from frame differences, silence detection, and rolling z-scores. Output: a timeline of raw signals.
- 2
Annotation
Signals merge with word-level transcript data from Deepgram. Each second gets tags: excitement peaks, audio spikes, action peaks, reaction words, sentence boundaries, pauses, and scene labels (active gameplay, talking, idle).
- 3
Candidate generation
Weighted trigger tags are clustered. Nearby triggers merge into candidate windows. Sentence and pause boundaries refine start/end points. Windows target 30–90 seconds with natural openings and closings.
- 4
LLM scoring
Each candidate window is scored by an LLM for hook quality, energy, narrative arc, and viral potential. Sub-scores are weighted and combined. Candidates below the threshold are filtered out.
- 5
Extraction
Overlapping candidates are deduplicated. Clip count scales with video length. FFmpeg extracts each clip as a separate MP4, with padding applied for natural starts and endings.
Register and get 10 credits per week
No credit card required. Start clipping in minutes.
AI highlight detection FAQ
How Clippper finds the best moments in your gaming VODs.
