ai-avatar-video
🎯Skillfrom runcomfy-com/skills
Create AI avatar, talking-head, and lip-sync videos on RunComfy, routing across OmniHuman for full-body audio-driven avatars, Wan 2-7 for mouth sync, HappyHorse for native in-pass audio, and Seedance v2 Pro for multi-modal cinematic generation.
Overview
AI Avatar & Talking Head Video is a Claude Code skill that creates audio-driven avatar, talking-head, and lip-sync videos through the RunComfy CLI. It intelligently routes across five models based on user intent: OmniHuman for full-body audio-driven avatars from a portrait plus audio file, Wan 2-7 for open-weights lip-sync with full scene control, Wan 2-2 Animate for stylized/illustrated character animation, HappyHorse 1.0 for script-to-video without a pre-recorded audio file, and Seedance v2 Pro for multi-modal cinematic compositions combining reference images, videos, and audio.
Key Features
- Five specialized routes: OmniHuman (default, portrait + audio file), Wan 2-7 (scene control + custom audio), Wan 2-2 Animate (stylized/illustrated characters), HappyHorse 1.0 (script-only, no audio file needed), and Seedance v2 Pro (up to 9 reference images, 3 reference videos, 3 reference audio tracks)
- Intent-based model selection: Automatically classifies whether the user has a pre-recorded audio file or only a script, whether the subject is photoreal or stylized, and whether the output needs single-shot simplicity or cinematic composition
- Multi-language dubbing support: Use the same portrait with different audio files per language to create multi-language brand videos with consistent identity across all variants
- Chaining with other skills: Combine with image generation skills to first create a portrait, then animate it as a talking head with OmniHuman or extend the result using video-extend
Who is this for?
- Marketing teams creating UGC-style product ads and virtual presenter videos with custom voiceovers
- Content creators building multi-language dubbed videos from a single portrait and swapped audio tracks
- Developers and agencies seeking a programmable alternative to HeyGen or Synthesia for automated avatar video generation
- Animators working with illustrated or stylized characters who need audio-synchronized full-body motion
Same repository
runcomfy-com/skills(30 items)
Installation
npx vibeindex add runcomfy-com/skills --skill ai-avatar-videonpx skills add runcomfy-com/skills --skill ai-avatar-video~/.claude/skills/ai-avatar-video/SKILL.mdSKILL.md
More from this repository10
Generate and edit images on RunComfy via a smart router across 11+ AI models including FLUX 2, Nano Banana 2, GPT Image 2, Seedream 5, and Qwen Image. Covers both text-to-image and image-to-image endpoints, automatically selecting the best model for the user's intent.
A Claude Code skill that animates still images into video on RunComfy, routing to HappyHorse 1.0 for general animations, Wan 2.7 for audio-driven lip-sync, or Seedance 2.0 Pro for multi-modal composition based on user intent.
A Claude Code skill for generating and editing images with OpenAI GPT Image 2 (ChatGPT Images 2.0) via the RunComfy API, excelling at embedded text, multilingual typography, logos, and directive precision for layout-critical imagery.
A Claude Code skill that generates custom OpenAI Codex Pets on RunComfy, turning a single reference image into a Codex-compatible spritesheet and pet.json using GPT Image 2 edit plus ImageMagick transforms, requiring only a RunComfy token.
A Claude Code skill for editing existing videos on RunComfy, routing to Wan 2.7 Edit-Video for general restyle and background swap, Kling 2.6 Pro Motion Control for precise motion transfer, or Lucy Edit Restyle for lightweight identity-stable outfit swaps.
A Claude Code skill that generates full songs and instrumental tracks with ElevenLabs Music on RunComfy, producing studio-quality 44.1 kHz stereo audio from 5 seconds to 5 minutes with section-level control for intros, verses, choruses, and bridges.
A Claude Code skill for region-based video editing on RunComfy, enabling object removal, watermark cleanup, and motion-matched region replacement across video frames using Wan 2-7 Edit-Video and other endpoints.
Generate AI videos on RunComfy through a smart router across the full video-model catalog including HappyHorse 1.0, Wan 2-7, Seedance v2, Kling 3.0, Veo 3-1, and Hailuo 2-3. Supports text-to-video, image-to-video, and video-extend modes with automatic model selection based on user intent.
A Claude Code skill for pose-conditioned image and video generation on RunComfy, routing across Kling Motion Control Pro for motion transfer, Wan 2-2 Animate for audio-driven character animation, and Z-Image Turbo for pose-conditioned image generation.
A Claude Code skill for extending or continuing existing video clips using Google Veo 3-1 extend-video endpoints via the RunComfy CLI, enabling shot-by-shot narrative chaining with consistent motion, lighting, and subject identity.