doany-ai

doany-ai/skills

29 resources in this repository

GitHub
🎯29

🎯Skills29

🎯image-outpainting🎯Skill

A Claude Code skill for image outpainting on RunComfy that extends images beyond their original canvas, supporting aspect ratio changes, uncropping, and canvas expansion by routing across Nano Banana 2 Edit, GPT Image 2 Edit, and FLUX Kontext Pro.

image-outpainting
🎯controlnet-pose🎯Skill

A Claude Code skill for pose-conditioned image and video generation on RunComfy, routing across Kling 2-6 Motion Control (video motion transfer), Wan 2-2 Animate (audio-driven character animation), and Z-Image Turbo ControlNet LoRA (pose-conditioned image generation).

controlnet-pose
🎯ai-avatar-video🎯Skill

A Claude Code skill that creates AI avatar and talking-head videos on RunComfy, intelligently routing across OmniHuman, Wan 2-7, HappyHorse, and Seedance models based on user intent such as UGC voiceover, virtual presenter, or lip-synced character.

ai-avatar-video
🎯nano-banana-2🎯Skill

A Claude Code skill for generating images with Google Nano Banana 2, a Gemini-family flash-tier text-to-image model hosted on RunComfy, optimized for rapid iteration, social thumbnails, and in-image typography rendering.

nano-banana-2
🎯video-outpainting🎯Skill

Extend a video's spatial canvas on RunComfy β€” uncrop, change aspect ratio (e.g., 9:16 to 16:9), or add environment beyond the original frame while preserving the central action. Routes through Wan 2-7 edit-video and dedicated ComfyUI outpaint workflows.

video-outpainting
🎯video-extend🎯Skill

A Claude Code skill that extends existing video clips on RunComfy using Google Veo 3-1's extend-video endpoints, continuing clips past their duration cap or chaining narrative shots while preserving consistent motion, lighting, and subject identity.

video-extend
🎯elevenlabs-music-generation🎯Skill

Generate full songs and instrumental tracks with ElevenLabs Music on RunComfy β€” studio-quality 44.1 kHz stereo audio from 5 seconds to 5 minutes with section-level control (Intro, Verse, Chorus, Bridge), multilingual vocals, and commercial-friendly output.

elevenlabs-music-generation
🎯gpt-image-2🎯Skill

A Claude Code skill for generating and editing images with OpenAI GPT Image 2 (ChatGPT Images 2.0) hosted on RunComfy, with strengths in embedded text, logos, multilingual typography, and precise multi-element prompt following.

gpt-image-2
🎯image-to-video🎯Skill

Animate any still image into video on RunComfy, routing to the best i2v model for each intent β€” HappyHorse 1.0 for general animations with native audio, Wan 2.7 with audio_url for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image plus reference video and audio.

image-to-video
🎯video-edit🎯Skill

A Claude Code skill that acts as a smart router for video editing on RunComfy, automatically selecting the best model -- Wan 2.7 Edit-Video for general restyle, Kling 2.6 Pro for motion transfer, or Lucy Edit Restyle for lightweight outfit/background swaps.

video-edit
🎯ai-image-generation🎯Skill

A Claude Code skill that smart-routes image generation and editing across 11+ AI models on RunComfy (FLUX 2, Nano Banana 2, GPT Image 2, Seedream, Z-Image, and more), picking the best model for each intent and shipping documented prompt patterns.

ai-image-generation
🎯ai-video-generation🎯Skill

Generate AI videos on RunComfy through a smart router across the full video-model catalog including HappyHorse 1.0, Wan 2-7, Seedance v2, Kling 3.0, Veo 3-1, and Hailuo 2-3. Covers text-to-video, image-to-video, and video-extend, automatically selecting the best model for the user's intent.

ai-video-generation
🎯image-edit🎯Skill

A Claude Code skill that acts as a smart router for image editing on RunComfy, automatically selecting the best model (Nano Banana Edit, GPT Image 2 Edit, Flux Kontext Pro, or Z-Image Turbo Inpaint) based on the user's intent.

image-edit
🎯flux-kontext🎯Skill

A Claude Code skill for editing images with Black Forest Labs Flux 1 Kontext Pro on RunComfy, specializing in single-reference precise local edits with strong prompt control and consistent high-fidelity identity preservation.

flux-kontext
🎯video-inpainting🎯Skill

Perform region edits across video frames on RunComfy β€” remove objects, clean up wires or watermarks, and replace regions with matching motion. Routes across Wan 2-7 edit-video for prompt-driven edits, Lucy Edit for identity-stable restyle, and Seedream for frame-by-frame processing.

video-inpainting
🎯runcomfy-cli🎯Skill

A unified CLI for the RunComfy Model API that provides one binary and one authentication to access hundreds of model endpoints including image generation, video generation, lip-sync, face swap, inpainting, outpainting, ControlNet, relighting, upscaling, and LoRA training.

runcomfy-cli
🎯face-swap🎯Skill

A Claude Code skill for face and character swapping in images and videos on RunComfy, routing across Wan 2-2 Animate, GPT Image 2 Edit, Nano Banana Edit, Flux Kontext, and Kling Motion Control based on the target medium and swap type.

face-swap
🎯nano-banana-edit🎯Skill

Edit images with Google Nano Banana 2 on RunComfy β€” preserve subject identity, swap backgrounds, localize edits with spatial language, and perform batch edits on up to 20 images in a single call. Best for identity-preserving edits and consistent multi-image processing.

nano-banana-edit
🎯relight🎯Skill

A Claude Code skill that relights still images on RunComfy, routing to Qwen Edit 2509's dedicated relight LoRA for purpose-built relighting or to identity-preserving edit models for prose-based lighting adjustments like golden hour or studio softbox effects.

relight
🎯image-inpainting🎯Skill

Mask-driven image inpainting on RunComfy β€” remove objects, fill gaps, or replace masked areas. Routes to Z-Image Turbo Inpainting when a mask is available and to instruction-driven edit models like Nano Banana 2 Edit and GPT Image 2 Edit when the region is described in prose.

image-inpainting
🎯happyhorse-1-0🎯Skill

A Claude Code skill for generating text-to-video with HappyHorse 1.0 on RunComfy, currently ranked #1 on Artificial Analysis Video Arena, featuring native 1080p output with in-pass synchronized audio and multi-shot character consistency.

happyhorse-1-0
🎯flux-2-klein🎯Skill

Generate images with Flux 2 Klein, Black Forest Labs' distilled fast variant of Flux 2, on RunComfy. Optimized for sub-second latency and rapid creative iteration with multi-reference brand styling and declarative prompts, available in 9B and 4B variants.

flux-2-klein
🎯lipsync🎯Skill

Lip-sync a face to an audio track using RunComfy, routing across multiple models including OmniHuman for portrait-to-avatar animation, Sync Labs for mouth sync onto existing video, and Kling lipsync for audio-to-video with synced speech.

lipsync
🎯ace-step🎯Skill

A Claude Code skill for generating, inpainting, and outpainting music with StepFun-AI ACE Step on RunComfy, offering tag-driven composition with multilingual lyrics across four CLI endpoints at extremely low per-second pricing.

ace-step
🎯kling-3-0🎯Skill

A Claude Code skill for generating video with Kuaishou Kling 3.0 on RunComfy, covering all six endpoints across three quality tiers (Standard, Pro, 4K) and two modes (text-to-video, image-to-video) with native synchronized audio and character consistency.

kling-3-0
🎯seedance-v2🎯Skill

A Claude Code skill for generating cinematic short-form video with ByteDance Seedance 2.0 Pro via RunComfy, supporting multi-modal references (images, videos, audio) with native lip-synced audio and cinematic motion refinement.

seedance-v2
🎯gpt-image-edit🎯Skill

A Claude Code skill for editing images with OpenAI GPT Image 2 on RunComfy, excelling at identity preservation through targeted edits, multilingual in-image text editing, and multi-reference composition with up to 10 input images.

gpt-image-edit
🎯wan-2-7🎯Skill

A Claude Code skill for generating text-to-video with Wan-AI's Wan 2.7 model on RunComfy, featuring multi-reference conditioning, audio-driven lip-sync via audio_url, and smoother transitions with prompt expansion.

wan-2-7
🎯ai-music🎯Skill

A Claude Code skill that generates AI music on RunComfy, routing between ElevenLabs Music Generation for premium vocal tracks and ACE Step for budget-friendly tag-driven composition, with support for audio inpainting and outpainting.

ai-music