AI Production Engineer
Zero-touch: text prompt to upscaled, interpolated, character-consistent video, fully automated.
Fully automated content pipeline, concept to ready-to-post video, no manual intervention.
5-stage pipeline via ComfyUI API. PuLID + IP-Adapter for dual consistency.
Hundreds of outputs, one recognizable person, across months of production.
Stage 1 text-to-image (LoRA), Stage 2 image-to-video (WAN 2.2/AnimateDiff), Stage 3 4x upscale (ESRGAN before interpolation, correct order), Stage 4 RIFE interpolation (24fps to 48/96fps), Stage 5 quality gate (automated character sheet comparison).
PuLID handles face identity, IP-Adapter handles visual style. Separating these allows style to vary without drifting character.