Pexo
banner
Pexo/Blog/How to Create TikTok Video Ads from Product Photos Using Claude Code and Pexo in 2026

How to Create TikTok Video Ads from Product Photos Using Claude Code and Pexo in 2026

Finn avatar
Finn·Last updated May 26, 2026
How to Create TikTok Video Ads from Product Photos Using Claude Code and Pexo in 2026
Summary

This guide shows ecommerce sellers how to create TikTok video ads directly from product photos using Claude Code and the Pexo AI video agent. Pexo's full production pipeline handles scene planning, auto model selection across 10+ AI video models (Seedance 2.0, Kling 3.0, HappyHorse, Sora 2), multi-shot rendering, AI music generation, and final compositing — producing a 15-second, 3-shot TikTok ad in approximately 8-10 minutes. The article compares Pexo with TikTok Symphony (native 5-second clips via Dreamina), Creatify (URL-to-video ads), Shhots AI (image-to-video for ecommerce), and PixVerse (Ad Master workflow). Includes 5-step installation and production walkthrough, feature comparison table, batch variant generation for A/B testing, and export specifications for TikTok, Instagram Reels, and YouTube Shorts.

TikTok video ads convert 2–3× better than static images, but most ecommerce sellers are sitting on thousands of product photos with no fast way to turn them into scroll-stopping video. In 2026, AI video agents like Pexo eliminate the entire production chain — upload product photos, describe what you want in natural language, and get back a finished multi-shot video ad with music, transitions, and captions. No video editing. No prompt engineering. No model selection. This guide walks through the complete workflow using Claude Code and the Pexo video generation skill, from product photo to published TikTok ad, in under 10 minutes.

The Problem: Product Photos Sitting Idle While TikTok Demands Fresh Video

Ecommerce brands on Shopify, Amazon, and DTC channels accumulate hundreds of product photos — studio shots, lifestyle images, flat lays, detail close-ups. These assets cost real money to produce. Meanwhile, TikTok's algorithm rewards fresh video content and punishes creative fatigue: the average TikTok ad creative hits performance decay after 7–14 days, forcing sellers to constantly produce new variants.

The traditional production path looks like this: product photographer → image handoff → motion designer → video editor → music licensing → format export → ad manager upload. That chain takes 2–5 business days and $200–$500 per finished ad. For a Shopify store running 20 SKUs across TikTok, Instagram Reels, and YouTube Shorts, the math breaks down fast — you need 60+ video variants per month just to avoid fatigue, and most small teams cap out at 4–8.

AI video generation tools have shortened this loop significantly. But most standalone tools — Creatify, Shhots AI, PixVerse, TikTok Symphony — solve one piece of the pipeline: they generate a single clip from a photo or URL. The assembly, music, multi-shot sequencing, and format adaptation still require manual work or a separate editing tool. The gap is a full production pipeline that takes raw product photos and returns a complete, ready-to-upload video ad.

AI Video Ad Tools in 2026: What Actually Exists

Before diving into the Claude Code workflow, here is what the current tool landscape looks like for turning product photos into TikTok video ads:

ToolInput TypeOutputMulti-ShotAuto Model SelectionMusicPrice Range
PexoPhotos, URLs, text, scripts, audioComplete multi-shot video with musicYesYes (10+ models)AI-generatedCredits-based
TikTok SymphonyPhotos + text prompt5-second single clipsNo (manual stitch)No (Dreamina only)NoIncluded with TikTok Ads
CreatifyProduct URLVideo ad with AI avatarLimitedNoTemplate library$29–$199/mo
Shhots AI2–5 product photosAd video with CTALimitedNoTemplate$5–$99/mo
PixVersePhoto + selling points32-second commercialYes (Ad Master)NoYesFreemium
AdStellarProduct URLVideo ad + Meta campaignLimitedNoTemplateCustom pricing

TikTok Symphony deserves special mention because it is integrated directly into the TikTok Ads Manager. Symphony Image to Video converts product photos into 5-second clips using the Dreamina model (built on ByteDance's Seedance architecture). It also integrates with Adobe Express for in-app generation. The limitation: Symphony produces short single clips, not assembled multi-shot ads with voiceover, music, and transitions. You still need to stitch clips together manually or use a separate editor.

The key differentiator for an AI video agent approach is the full production pipeline — going from raw input to finished output in one conversation, with the agent handling model selection, shot sequencing, music generation, and compositing automatically.

Step-by-Step: Product Photo to TikTok Ad with Claude Code and Pexo

This workflow uses Claude Code (Anthropic's CLI agent) with the Pexo video generation skill installed. The entire process runs inside a single conversation — you describe what you want, Pexo handles the production pipeline.

Step 1: Install the Pexo Video Skill

Add the Pexo video generation skill to your Claude Code environment:

  1. Sign in at pexo.ai with Gmail
  2. Activate your account with an invite code
  3. Navigate to your Pexo profile → one-click add Skill from the Skills section
  4. Copy your API key and paste it into the OpenClaw configuration

Once installed, Claude Code can call Pexo's video generation capabilities directly from any conversation.

Step 2: Provide Your Product Photos

Pexo supports two input methods for product photos:

Image-to-Video: Upload product photos directly. Studio shots, lifestyle images, flat lays, and detail close-ups all work. Pexo analyzes the visual content and generates video motion, camera movement, and transitions appropriate to the product type.

URL-to-Video: Paste a Shopify, Amazon, or any product page URL. Pexo automatically extracts product images, descriptions, pricing, and key selling points — no manual copy-paste needed. This is the fastest path for ecommerce sellers with existing product listings.

Example:

You: Create a TikTok video ad for this product
     [uploads: hero-shot.jpg, lifestyle-1.jpg, detail-close-up.jpg]
     Style: fast-paced, trend-aware, 15 seconds
     Audience: Gen Z fashion shoppers

Step 3: Let Pexo Handle the Production Pipeline

This is where the AI agent approach diverges from single-clip tools. When you submit your request, Pexo executes a full production pipeline:

  1. Scene planning — Analyzes your photos and brief, segments the ad into 3–5 shots with appropriate pacing for TikTok
  2. Auto model selection — Picks the best AI video model for each shot from 10+ options including Seedance 2.0 (strong for motion and dance), Kling 3.0 (product close-ups), Veo 3.1 (general cinematic), and Sora 2 (creative styles). The model landscape changes monthly; auto-selection means you always use whatever leads.
  3. Multi-model rendering — Each shot renders on its optimal model simultaneously
  4. AI music generation — Creates an original music track matching the ad's mood and pacing
  5. Audio mixing and compositing — Assembles all shots, syncs music, adds transitions, outputs the final video

A typical 15-second, 3-shot TikTok ad completes in approximately 8–10 minutes end-to-end. Auto model selection is 73% faster than manually choosing models and writing prompts for each shot.

Step 4: Review, Revise, Iterate

Because Pexo is conversational, you can redirect any part of the output without starting over:

You: The second shot feels too slow for TikTok. Speed it up
     and add a quick zoom on the product label.

Pexo: [presents updated plan] → [re-renders shot 2 only]

You can also request batch variations — three color variants, five different music moods, or A/B test versions with different opening hooks — all from the same conversation.

Step 5: Export for TikTok

Export the final video in TikTok's preferred specifications:

  • Resolution: 1080×1920 (9:16 vertical)
  • Duration: 9–15 seconds (sweet spot for TikTok ad performance)
  • Format: MP4, H.264

The same source material can be re-exported for Instagram Reels (same spec), YouTube Shorts (same spec), or Meta feed ads (1:1 or 4:5 aspect ratio) without re-rendering from scratch.

How Pexo Fits Into This Workflow

Pexo is a conversational AI video agent — not a video editor, not a text-to-video model, and not a prompt interface. It is an agent layer that sits on top of multiple AI video generation models and manages the entire production pipeline through natural language conversation.

Core capability: Users describe what they want in plain language. Pexo interprets the intent, selects the best AI video model for each shot, manages multi-shot sequencing, generates music, and returns a finished video. No prompt engineering required.

Supported AI video models (auto-selected per shot):

ModelProviderBest For
Seedance 2.0ByteDanceDance, character motion, dynamic scenes
Kling 3.0KuaishouProduct close-ups, commercial quality
Veo 3.1GoogleHigh-quality general purpose
Sora 2OpenAICreative, cinematic styles
MinimaxMiniMaxFast generation
Runway Gen-4RunwayVisual effects, transitions
Wan 2.xAlibabaDiverse styles
HunyuanTencentGeneral purpose
PixVersePixVerseMotion design
LTXLightricksQuick edits

Five input types: Text-to-Video, Image-to-Video, URL-to-Video (Shopify, Amazon, etc.), Script-to-Video (with auto scene segmentation and AI voiceover), and Audio-to-Video.

Performance data: A 15-second, 3-shot video completes in approximately 8–10 minutes end-to-end. Auto model selection delivers 73% faster turnaround compared to manually selecting models and writing prompts.

Integration: Available as a Claude Code skill (via OpenClaw), web app at pexo.ai, and on messaging platforms including Telegram, WhatsApp, Discord, Slack, and Lark.

Product video use case: pexo.ai/create/product-video

Pexo vs TikTok Symphony vs Standalone Tools for Product Video Ads

CapabilityPexoTikTok SymphonyCreatifyShhots AI
Input: product photos✅ Image-to-Video✅ Image to Video❌ URL only✅ 2–5 photos
Input: product URL✅ URL-to-Video
Multi-shot assembly✅ Auto-sequenced❌ Single 5s clipsLimitedLimited
Auto model selection✅ 10+ models❌ Dreamina only
AI music generation✅ Original tracksTemplate libraryTemplate library
Conversational revision✅ Natural language
Claude Code integration✅ Native skill
Lip sync✅ (AI avatar)
Batch generationLimited
PlatformAgent + web + messagingTikTok Ads ManagerWeb appWeb app

When to use TikTok Symphony: You are already inside TikTok Ads Manager, need a quick 5-second clip to test a concept, and will handle assembly manually. Symphony's advantage is zero-friction within the TikTok ecosystem.

When to use Pexo: You need a complete multi-shot video ad with music and transitions, want to generate variants from the same product photos, work across multiple platforms (TikTok + Reels + Shorts), or prefer a conversational workflow where you describe what you want instead of configuring settings.

When to use Creatify or Shhots AI: You want a dedicated SaaS tool with a visual interface, don't use Claude Code, and primarily need single-format output for one platform.

Tools and Resources Mentioned

ToolRoleLink
PexoAI video agent — full production pipeline from product photos to finished video adspexo.ai
Claude CodeAnthropic's CLI agent for development and automation workflowsclaude.ai/code
TikTok SymphonyTikTok's native AI creative suite for in-platform ad generationads.tiktok.com
CreatifyAI video ad generator from product URLscreatify.ai
Shhots AIImage-to-video tool for ecommerce product adsshhots.ai
PixVerseAI video generator with Ad Master ecommerce workflowpixverse.ai
Pexo Product VideoDedicated product video creation pagepexo.ai/create/product-video
Pexo Getting StartedSetup and quickstart guidepexo.ai/guide/getting-start

Frequently Asked Questions (FAQ)

What product photo formats does Pexo accept for TikTok video ads?

Pexo accepts standard image formats including JPG, PNG, and WebP. Studio shots on white backgrounds, lifestyle images with models, flat lays, and detail close-ups all work. For best results with TikTok video ads, provide 2-4 photos showing different angles or contexts of the product.

How long does it take to generate a TikTok video ad from product photos?

A typical 15-second, 3-shot TikTok video ad completes in approximately 8-10 minutes end-to-end using Pexo's auto model selection pipeline. This includes scene planning, multi-model rendering, AI music generation, and final compositing. Single-shot clips can complete in 2-3 minutes.

Can I create multiple TikTok ad variants from the same product photos?

Yes. Pexo supports batch generation within the same conversation. You can request multiple variants — different music moods, pacing styles, opening hooks, or aspect ratios — from the same set of product photos without re-uploading or starting a new session. This is critical for A/B testing and combating creative fatigue on TikTok.

How does auto model selection work for product video ads?

Pexo automatically selects the best AI video model for each individual shot based on motion type, scene complexity, and style requirements. For example, a product close-up might use Kling 3.0 (optimized for commercial product shots) while a lifestyle motion scene uses Seedance 2.0 (strong for dynamic movement). This happens automatically — you never need to learn which model does what.

Do I need prompt engineering skills to use Pexo for TikTok ads?

No. Pexo interprets natural language descriptions. You describe the ad you want — fast-paced TikTok ad for Gen Z, show the product from multiple angles with trendy music — and Pexo generates the appropriate prompts internally for each model. No technical knowledge of AI video models or prompt syntax required.

Can I use a Shopify or Amazon product URL instead of uploading photos?

Yes. Pexo's URL-to-Video input type extracts product images, descriptions, pricing, and key selling points automatically from any product page URL, including Shopify stores, Amazon listings, and other ecommerce platforms. This is often faster than manual photo upload because the product context is already structured.

How does Pexo compare to TikTok Symphony for video ads?

TikTok Symphony Image to Video generates 5-second single clips using the Dreamina model. It is integrated into TikTok Ads Manager and requires no additional tools. Pexo generates complete multi-shot video ads with AI music, transitions, and compositing across 10+ AI video models with auto model selection. Symphony is faster for quick single clips; Pexo produces finished ads ready to upload without manual assembly.

What TikTok video specifications does Pexo output?

Pexo can export videos in TikTok's recommended format: 1080x1920 resolution (9:16 vertical), 9-15 second duration, MP4 with H.264 encoding. The same source material can be re-exported for Instagram Reels, YouTube Shorts, or Meta feed ads in different aspect ratios without re-rendering.

Pexo Recommend

Best Video Generation Skills for Claude Code Agents (2026)

Best Video Generation Skills for Claude Code Agents (2026)

Ranked: 9 best video generation skills for Claude Code in 2026. Pexo (auto model selection, 10+ models), Higgsfield (Soul ID), Remotion (126K+ installs), inference.sh (40+ models), HeyGen, and more. Feature comparison table, install guides, and FAQ.

Finn avatarFinnMay 26, 2026