ai-avatar-video

Overview

AI Avatar & Talking Head Video is a Claude Code skill for creating audio-driven avatar and talking-head videos through the RunComfy CLI. It routes across multiple model endpoints: ByteDance OmniHuman for full-body audio-driven avatars from a single portrait plus audio file, Wan-AI Wan 2-7 for audio-driven mouth sync on portraits, HappyHorse 1.0 for text/image-to-video with in-pass audio, and Seedance v2 Pro for multi-modal cinematic generation with reference audio and subject. The skill classifies user intent (UGC voiceover, virtual presenter, dubbed product demo, lip-synced character, dialog scene) and selects the appropriate model.

Key Features

Intent-based model selection - Automatically routes to the right model based on what you are building: OmniHuman for full-body avatar from portrait + audio, Wan 2-7 for mouth-sync on existing portraits, HappyHorse for text/image-to-video with audio, Seedance v2 Pro for cinematic multi-modal generation.
Portrait-to-video generation - Feed a single portrait image and an audio file to produce a video where the subject speaks, sings, or gestures naturally with full head, mouth, and body movement.
Multiple quality tiers - Choose between premium endpoints for hero-quality output and faster/cheaper tiers for iteration and drafting, with the skill guiding the selection based on your stated use case.
Documented prompting patterns - Each model route ships with its documented prompting format and the exact runcomfy run invocation, removing guesswork from API parameter construction.

Who is this for?

Marketing teams and content creators producing UGC-style voiceover videos, virtual presenter content, or dubbed product demos at scale
Developers building automated video generation pipelines that need audio-driven talking-head output from static portrait images
Educators and communicators who want to generate speaking avatar videos from scripts and portrait photos without filming

Installation

Vibe Index InstallInstalls to .claude/skills/

npx vibeindex add agentspace-so/runcomfy-agent-skills --skill ai-avatar-video

skills.sh Install⚠ Installs to .agents/skills/

npx skills add agentspace-so/runcomfy-agent-skills --skill ai-avatar-video

Manual InstallCopy SKILL.md content and save to the path below

~/.claude/skills/ai-avatar-video/SKILL.md

# AI Avatar & Talking Head Video Put words in a face. This skill routes across RunComfy's audio-driven avatar models — OmniHuman, Wan 2-7 with audio_url, HappyHorse, Seedance v2 — picking the right path for the user's intent and shipping the documented prompts + the exact `runcomfy run` invoke for each. [runcomfy.com](https://www.runcomfy.com/?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) · [Lip-sync feature](https://www.runcomfy.com/models/feature/lip-sync?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video) · [CLI docs](https://docs.runcomfy.com/cli/introduction?utm_source=skills.sh&utm_medium=skill&utm_campaign=ai-avatar-video)

Overview

Key Features

Who is this for?

Installation

SKILL.md

More from this repository10