🎯

ai-avatar-video

🎯Skill

from runcomfy-com/skills

VibeIndex|
What it does
|

Create AI avatar, talking-head, and lip-sync videos on RunComfy, routing across OmniHuman for full-body audio-driven avatars, Wan 2-7 for mouth sync, HappyHorse for native in-pass audio, and Seedance v2 Pro for multi-modal cinematic generation.

Overview

AI Avatar & Talking Head Video is a Claude Code skill that creates audio-driven avatar, talking-head, and lip-sync videos through the RunComfy CLI. It intelligently routes across five models based on user intent: OmniHuman for full-body audio-driven avatars from a portrait plus audio file, Wan 2-7 for open-weights lip-sync with full scene control, Wan 2-2 Animate for stylized/illustrated character animation, HappyHorse 1.0 for script-to-video without a pre-recorded audio file, and Seedance v2 Pro for multi-modal cinematic compositions combining reference images, videos, and audio.

Key Features

  • Five specialized routes: OmniHuman (default, portrait + audio file), Wan 2-7 (scene control + custom audio), Wan 2-2 Animate (stylized/illustrated characters), HappyHorse 1.0 (script-only, no audio file needed), and Seedance v2 Pro (up to 9 reference images, 3 reference videos, 3 reference audio tracks)
  • Intent-based model selection: Automatically classifies whether the user has a pre-recorded audio file or only a script, whether the subject is photoreal or stylized, and whether the output needs single-shot simplicity or cinematic composition
  • Multi-language dubbing support: Use the same portrait with different audio files per language to create multi-language brand videos with consistent identity across all variants
  • Chaining with other skills: Combine with image generation skills to first create a portrait, then animate it as a talking head with OmniHuman or extend the result using video-extend

Who is this for?

  • Marketing teams creating UGC-style product ads and virtual presenter videos with custom voiceovers
  • Content creators building multi-language dubbed videos from a single portrait and swapped audio tracks
  • Developers and agencies seeking a programmable alternative to HeyGen or Synthesia for automated avatar video generation
  • Animators working with illustrated or stylized characters who need audio-synchronized full-body motion
📦

Same repository

runcomfy-com/skills(30 items)

ai-avatar-video

Installation

Vibe Index InstallInstalls to .claude/skills/
npx vibeindex add runcomfy-com/skills --skill ai-avatar-video
skills.sh Install⚠ Installs to .agents/skills/
npx skills add runcomfy-com/skills --skill ai-avatar-video
Manual InstallCopy SKILL.md content and save to the path below
~/.claude/skills/ai-avatar-video/SKILL.md

SKILL.md

132,941Installs
-
AddedMay 18, 2026

More from this repository10

🎯
ai-image-generation🎯Skill

Generate and edit images on RunComfy via a smart router across 11+ AI models including FLUX 2, Nano Banana 2, GPT Image 2, Seedream 5, and Qwen Image. Covers both text-to-image and image-to-image endpoints, automatically selecting the best model for the user's intent.

🎯
image-to-video🎯Skill

A Claude Code skill that animates still images into video on RunComfy, routing to HappyHorse 1.0 for general animations, Wan 2.7 for audio-driven lip-sync, or Seedance 2.0 Pro for multi-modal composition based on user intent.

🎯
gpt-image-2🎯Skill

A Claude Code skill for generating and editing images with OpenAI GPT Image 2 (ChatGPT Images 2.0) via the RunComfy API, excelling at embedded text, multilingual typography, logos, and directive precision for layout-critical imagery.

🎯
codex-pet🎯Skill

A Claude Code skill that generates custom OpenAI Codex Pets on RunComfy, turning a single reference image into a Codex-compatible spritesheet and pet.json using GPT Image 2 edit plus ImageMagick transforms, requiring only a RunComfy token.

🎯
video-edit🎯Skill

A Claude Code skill for editing existing videos on RunComfy, routing to Wan 2.7 Edit-Video for general restyle and background swap, Kling 2.6 Pro Motion Control for precise motion transfer, or Lucy Edit Restyle for lightweight identity-stable outfit swaps.

🎯
elevenlabs-music-generation🎯Skill

A Claude Code skill that generates full songs and instrumental tracks with ElevenLabs Music on RunComfy, producing studio-quality 44.1 kHz stereo audio from 5 seconds to 5 minutes with section-level control for intros, verses, choruses, and bridges.

🎯
video-inpainting🎯Skill

A Claude Code skill for region-based video editing on RunComfy, enabling object removal, watermark cleanup, and motion-matched region replacement across video frames using Wan 2-7 Edit-Video and other endpoints.

🎯
ai-video-generation🎯Skill

Generate AI videos on RunComfy through a smart router across the full video-model catalog including HappyHorse 1.0, Wan 2-7, Seedance v2, Kling 3.0, Veo 3-1, and Hailuo 2-3. Supports text-to-video, image-to-video, and video-extend modes with automatic model selection based on user intent.

🎯
controlnet-pose🎯Skill

A Claude Code skill for pose-conditioned image and video generation on RunComfy, routing across Kling Motion Control Pro for motion transfer, Wan 2-2 Animate for audio-driven character animation, and Z-Image Turbo for pose-conditioned image generation.

🎯
video-extend🎯Skill

A Claude Code skill for extending or continuing existing video clips using Google Veo 3-1 extend-video endpoints via the RunComfy CLI, enabling shot-by-shot narrative chaining with consistent motion, lighting, and subject identity.