Building Video Content Moderation with Amazon Rekognition - Ensuring Safety on UGC Platforms

Automate video moderation on UGC platforms, and streamline media workflows with face search and segment detection. Learn how to build event-driven pipelines with S3 and Lambda.

Rekognition Video API Overview

Rekognition's Video API asynchronously analyzes video files stored in S3. You initiate analysis with Start APIs such as StartContentModeration, StartLabelDetection, and StartFaceSearch, and a completion notification is sent to an SNS topic. After completion, you retrieve results using the corresponding Get API. Results include per-frame timestamps, allowing you to pinpoint exactly when something was detected in the video. Supported video formats are H.264-encoded MOV and MP4, with a maximum file size of 10 GB and a maximum video length of 6 hours.

Implementing Content Moderation

UGC (user-generated content) platforms need to automatically check whether user-uploaded videos contain inappropriate content. You can build a pipeline that triggers a Lambda function on S3 video upload to call StartContentModeration, detects completion via SNS notification, and evaluates the results. Detection results include inappropriate content categories (Explicit Nudity, Violence, Visually Disturbing, etc.) and confidence scores. Set a confidence threshold and design a workflow that automatically unpublishes videos exceeding the threshold or routes them to a human review queue.

Face Search and Segment Detection

By pre-registering face images in a face collection, StartFaceSearch can detect scenes in a video where registered faces appear. This is useful for media companies searching for scenes featuring specific performers, or for security applications tracking specific individuals. Segment detection (StartSegmentDetection) analyzes video structure, automatically identifying shot boundaries (camera cuts), black frames, color bars, slates, studio logos, and end credits. It's effective for automating video editing at broadcast stations and streaming services, and for detecting ad insertion points. To understand content moderation model design, related books (Amazon) are a helpful reference.

Rekognition Video Pricing

Rekognition Video API pricing is based on the minutes of video processed. Content moderation costs approximately $0.12 per minute, label detection approximately $0.10 per minute, and face search approximately $0.10 per minute. A 10-minute video costs about $1.20 for content moderation. For UGC platforms processing large volumes of video, optimizations such as setting appropriate confidence thresholds to reduce unnecessary analysis and substituting the Image API for thumbnail analysis on short videos (under 30 seconds) are effective.

Summary

Rekognition Video API enables serverless automated video content analysis. Content moderation ensures safety on UGC platforms, while face search and segment detection automate media workflows. By combining S3 and Lambda in an event-driven architecture, you can fully automate the pipeline from video upload to analysis completion.