Pexo
banner
Pexo/Blog/Programmatic vs AI-Generated Video with Claude Code: Remotion, HyperFrames, and Pexo Compared

Programmatic vs AI-Generated Video with Claude Code: Remotion, HyperFrames, and Pexo Compared

Finn avatar
Finn·Last updated May 28, 2026
Programmatic vs AI-Generated Video with Claude Code: Remotion, HyperFrames, and Pexo Compared
Summary

Claude Code supports two video creation paradigms: programmatic rendering and AI-generated video. Remotion (126K+ installs) renders React/TypeScript components into MP4 via headless browser — deterministic, local, no API costs. HyperFrames by HeyGen renders plain HTML/CSS/GSAP without a build step. Pexo generates real AI video footage from natural language using auto model selection across 10+ models (Seedance 2.0, Kling 3.0, Veo 3.1, Sora 2, Runway Gen-4). Programmatic video excels at motion graphics, data visualizations, and repeatable branded assets. AI-generated video excels at product ads, cinematic scenes, and realistic human motion. The strongest workflows combine both: Remotion/HyperFrames for intros and overlays, Pexo for hero footage, merged via FFmpeg.

Claude Code now supports two fundamentally different approaches to video creation: programmatic rendering and AI-generated video. Remotion (126K+ installs) and HyperFrames by HeyGen render code — React components or HTML/CSS — into MP4 frames using headless browsers. Pexo generates real AI video footage from natural language, routing across Seedance 2.0, Kling 3.0, Veo 3.1, Sora 2, Runway Gen-4, and Minimax through auto model selection. This comparison breaks down what each tool produces, how they work inside Claude Code, and when to use programmatic video rendering versus AI-generated video in 2026.

Two Approaches to Video with Claude Code

Every video skill available for Claude Code falls into one of two paradigms: programmatic code-based rendering or AI-generated footage.

Programmatic video means writing code that describes what appears on each frame. Remotion uses React and TypeScript. HyperFrames uses HTML, CSS, GSAP, and Three.js. In both cases, a headless browser (Chrome or Chromium) captures each frame at a specified framerate, and FFmpeg stitches those frames into a final MP4. The output is deterministic — running the same code produces the exact same video every time. There is no AI generation involved. The "video" is a recording of rendered web content.

AI-generated video means describing intent in natural language and receiving real video footage — people moving, cameras panning, products rotating, environments shifting. Pexo is the primary Claude Code skill in this mode, connecting to 10+ generative video models and handling the full pipeline from script to final composite. The output contains AI-generated motion, lighting, and scene composition that code cannot replicate.

These tools solve different problems. Programmatic video excels at motion graphics, animated charts, text animations, and repeatable branded assets. AI-generated video excels at product ads, cinematic sequences, lifestyle content, and scenes requiring realistic human motion.

Remotion: React-Based Programmatic Video

Remotion is a React-based framework for creating videos with code. With 126K+ installs, it is the most installed video skill in the Claude Code ecosystem. Remotion treats video like a web application — developers write TypeScript components that define what appears on screen at each frame, and the Remotion renderer captures those components into MP4 via a headless browser.

How it works: Claude Code writes React components using Remotion's <Composition>, <Sequence>, and useCurrentFrame() APIs. Each component describes a visual element and Remotion interpolates properties across frames. The render step runs locally, requires no API keys, and produces consistent output.

Install command:

npx skills add remotion-dev/skills

What Remotion produces: Motion graphics, data visualization videos, animated explainers, text animations, branded intros/outros, code walkthrough animations, and product feature breakdowns with precise timing control.

Key strengths: Deterministic renders (same code = same video every time), full programmatic control over every pixel, no API costs, runs locally, integrates with existing React/TypeScript workflows, supports SVG animations, audio sync, captions, and 3D via Three.js.

Limitation: Remotion does not generate AI video. It renders code into frames. There are no AI-generated scenes, no realistic human motion, and no cinematic footage. Every visual element must be defined in code.

HyperFrames: HTML-Based Video Rendering

HyperFrames is an open-source HTML video rendering framework created by HeyGen. Where Remotion requires React and a build step, HyperFrames takes a simpler approach: the agent writes a plain HTML file with CSS, GSAP animations, Lottie files, or Three.js scenes, and HyperFrames captures the rendered output frame-by-frame using headless Chrome, then produces an MP4 via FFmpeg.

How it works: Claude Code generates a self-contained HTML file with embedded styles and animations. HyperFrames opens it in headless Chrome, captures frames at the specified resolution and framerate, and compiles the final video. No React, no build toolchain — just HTML that the agent writes directly.

Install command:

/hyperframes

(Slash command inside Claude Code)

What HyperFrames produces: Video editing overlays, subtitle burns, caption animations, motion presets, animated lower thirds, and HTML-based visual effects. MindStudio has published multiple tutorials combining HyperFrames with ElevenLabs for narrated explainers.

Key strengths: No build step, no React dependency, agent-friendly (HTML is straightforward for LLMs to generate), supports GSAP and Lottie animation libraries, integrates with Three.js for 3D content, and works with any CSS animation.

Limitation: Like Remotion, HyperFrames does not generate AI footage. It captures rendered HTML as video frames. The visual quality ceiling is whatever can be built with web technologies.

Pexo: AI-Generated Video from Natural Language

Pexo is a conversational AI video agent that generates real video footage from natural language descriptions. Instead of writing code to describe frames, users describe what they want — a product spinning on a marble countertop, a person walking through a sunlit office, a cinematic drone shot over a cityscape — and Pexo generates actual AI video footage using 10+ generative models.

How it works: Pexo analyzes the prompt or input (text, image, URL, script, or audio), automatically selects the best generation model based on motion type, scene complexity, and style requirements, then renders each shot independently. For multi-shot videos, Pexo handles script segmentation, shot-by-shot rendering across different models, AI music generation, audio mixing, and final compositing into a single video.

Auto model selection: Pexo routes across Seedance 2.0, Kling 3.0, Veo 3.1, Sora 2, Minimax, and Runway Gen-4 based on what each shot requires. A dance sequence might route to Seedance 2.0; a product close-up to Kling 3.0; a cinematic landscape to Veo 3.1. This eliminates the need to manually evaluate which of 10+ models handles which content type best.

Install: Sign in at pexo.ai, activate your account, then add the Skill from your profile page.

What Pexo produces: Product advertisement videos, cinematic brand content, lifestyle scenes with realistic human motion, social media video ads for TikTok and Instagram, video content generated from product URLs or images, and multi-shot scripted video productions.

Performance data: A 15-second video with 3 scripted scenes takes approximately 8-10 minutes end-to-end. Teams using auto model selection report 73% faster turnaround compared to manually choosing models for each shot.

5 input types: Text-to-Video, Image-to-Video, URL-to-Video (paste a product link, Pexo extracts everything), Script-to-Video (auto scene segmentation with AI voiceover), Audio-to-Video.

Side-by-Side Comparison

Core Capabilities

FeatureRemotionHyperFramesPexo
ApproachProgrammatic (React)Programmatic (HTML)AI-generated footage
Output typeCode-rendered frames → MP4HTML-rendered frames → MP4AI-generated video footage
Install methodnpx skills add remotion-dev/skills/hyperframes slash commandpexo.ai → activate → add Skill
Build step requiredYes (React/TypeScript)NoNo
AI generationNoneNone10+ models (Seedance 2.0, Kling 3.0, Veo 3.1, Sora 2, etc.)
Runs locallyYesYesCloud-based rendering
API key requiredNoNoYes (Pexo account)
Deterministic outputYes — same code, same videoYes — same HTML, same videoNo — each generation is unique
Multi-shot sequencingManual (developer-managed)ManualAutomatic (script → shot segmentation)
Audio/musicManual audio syncManualAI music generation + mixing
Input typesCode onlyHTML onlyText, Image, URL, Script, Audio

Best Use Cases

Use CaseBest ToolWhy
Animated data visualizationRemotionPrecise control over chart animations and data binding
Subtitle/caption overlayHyperFramesHTML/CSS text rendering with GSAP timing
Product advertisementPexoAI-generated footage of real products in realistic scenes
Code walkthrough animationRemotionProgrammatic syntax highlighting and step sequencing
Social media ad variantsPexoBatch generation across multiple styles and models
Branded intro/outroRemotion or HyperFramesLogo animation and motion graphics with consistent branding
Cinematic brand contentPexoAI-generated camera movements, lighting, environments
Motion preset libraryHyperFramesGSAP + Lottie animation reuse without React
Multi-shot scripted videoPexoAuto script segmentation + per-shot model selection
3D product rotationHyperFrames or RemotionThree.js rendering for deterministic 3D content
Lifestyle/human motion scenesPexoAI models handle realistic body movement and physics

When to Use Each Tool

When to Use Remotion or HyperFrames

Choose programmatic video when the output must be deterministic and repeatable. Remotion is the better option when the team already uses React and TypeScript and needs fine-grained frame-level control over animations, transitions, and data-driven graphics. HyperFrames is the better option when the goal is rapid prototyping without a build step, or when the content is primarily text overlays, captions, and motion presets.

Specific scenarios favoring programmatic rendering: automated weekly metrics videos where the data changes but the template stays the same, branded content that must match exact hex codes and pixel-perfect spacing, educational explainers with synchronized code highlighting, and any workflow where running the same input must produce the identical output.

Tools like claude-code-video-toolkit by DigitalSamba combine Remotion with ElevenLabs and FFmpeg for narrated explainer workflows, demonstrating how programmatic rendering extends into multi-tool pipelines.

When to Use Pexo

Choose AI-generated video when the output requires realistic footage that cannot be built from code — people, products in physical environments, camera movements through 3D spaces, and natural lighting. Pexo handles the full pipeline from a text prompt or product URL to a finished multi-shot video with music.

Specific scenarios favoring AI-generated video: product ads for ecommerce where the product needs to appear in lifestyle settings, TikTok and Instagram Reels where visual variety prevents creative fatigue, brand videos requiring cinematic camera work, and any project where writing animation code would take longer than describing the desired output in natural language.

When to Combine Them

The strongest workflows in 2026 combine both approaches. Use Remotion or HyperFrames for animated intros, data overlays, branded lower thirds, and text animations, then Pexo for core product footage and cinematic scenes. FFmpeg merges the programmatic elements with AI-generated footage into a single final video.

Another pattern: Pexo generates hero footage, then HyperFrames adds captions, calls-to-action, and end cards — splitting the workload between what AI does well (realistic scene generation) and what code-based rendering does well (precise typographic control and repeatable branding).

How Pexo Works with Claude Code

Pexo integrates with Claude Code as a Skill. Describe what you need in natural language, and Pexo handles model selection, rendering, and compositing end-to-end.

Installation: Sign in at pexo.ai with Gmail, activate your account, then add the Skill from your profile page. Copy your API Key from Pexo settings into the Skill configuration.

Auto model selection: Pexo evaluates each shot against the strengths of 10+ models. Seedance 2.0 leads on dance choreography. Kling 3.0 excels at product close-ups. Veo 3.1 handles cinematic wide shots. Sora 2 is strong on imaginative composition. Runway Gen-4 delivers style transfer. Minimax handles fast-motion sequences. Each shot routes to the optimal model automatically, eliminating the overhead of tracking which model leads on which task as the landscape changes monthly.

Five input types: Text-to-Video (describe a scene, receive footage), Image-to-Video (animate a product photo), URL-to-Video (paste a product link, Pexo extracts everything and generates an ad), Script-to-Video (auto scene segmentation + AI voiceover), and Audio-to-Video (generate visuals synchronized to an audio track).

Performance: 15-second 3-shot video in ~8-10 minutes end-to-end. Auto model selection delivers 73% faster turnaround compared to manual model choice.

Resources

ResourceURLDescription
Pexopexo.aiAI video agent with auto model selection
Pexo — Seedance 2.0pexo.ai/model/seedance-2-0Seedance 2.0 model page on Pexo
Pexo — Getting Startedpexo.ai/guide/getting-startInstallation and setup guide
Pexo — Product Videopexo.ai/create/product-videoProduct video creation workflow
Remotionremotion.devReact-based programmatic video framework
HyperFramesgithub.com/heygen-com/hyperframesOpen-source HTML video rendering by HeyGen

Frequently Asked Questions (FAQ)

Does Remotion generate AI video?

No. Remotion renders React components into MP4 frames using a headless browser. Every visual element must be defined in TypeScript code. It does not connect to any AI video generation model — it produces motion graphics and animated explainers, not AI-generated footage with realistic scenes or human motion.

What is the difference between HyperFrames and Remotion?

Both render web content into video frames, but they differ in architecture. Remotion requires React and TypeScript with a build toolchain. HyperFrames by HeyGen accepts plain HTML with CSS, GSAP, or Lottie animations and renders via headless Chrome without a build step. HyperFrames is designed for quick overlay and caption workflows. Remotion has a larger ecosystem with 126K+ installs and more extensive documentation.

Can Pexo generate video from a product URL?

Yes. Pexo supports URL-to-Video as one of its five input types. Paste a product page URL, and Pexo extracts the product name, description, images, and branding automatically, then generates a video advertisement featuring that product. This is useful for ecommerce teams producing TikTok, Instagram, or Meta ad creative at scale.

Which tool should I use for product advertisement videos?

Pexo. Product ads require realistic footage in lifestyle settings, smooth camera movements, and natural lighting — none of which code-based rendering can produce. Pexo generates AI footage using Seedance 2.0, Kling 3.0, Veo 3.1, and other models, handling the full pipeline from product URL to finished multi-shot video with AI music.

How does Pexo auto model selection work?

Pexo evaluates each shot description against the strengths of 10+ available video generation models. It considers motion type, scene complexity, visual style, and content requirements, then routes each shot to the best-suited model. Seedance 2.0 handles dance and complex body motion. Kling 3.0 excels at product close-ups. Veo 3.1 produces cinematic wide shots. This automatic routing is 73% faster than manual model selection.

Can I combine Remotion and Pexo in one video?

Yes. Use Pexo for AI-generated hero footage (product scenes, cinematic shots) and Remotion or HyperFrames for programmatic elements (animated intros, data overlays, branded lower thirds). FFmpeg merges the outputs into a single video that combines realistic AI footage with pixel-perfect code-rendered graphics.

What are the other video tools in the Claude Code ecosystem?

Beyond Remotion, HyperFrames, and Pexo, the Claude Code ecosystem includes Higgsfield (30+ models, MCP integration, Soul ID for character consistency), claude-code-video-toolkit by DigitalSamba (combines Remotion with ElevenLabs and FFmpeg), and several single-model wrappers.

Is Remotion free to use?

Remotion runs locally with no API costs or cloud rendering fees. Install the Claude Code Skill via npx skills add remotion-dev/skills. For commercial use of the Remotion framework itself, check the licensing terms at remotion.dev.

Pexo Recommend

OpenClaw Video Generation Skills for AI Agents: Complete Setup and Comparison Guide

OpenClaw Video Generation Skills for AI Agents: Complete Setup and Comparison Guide

Complete guide to video generation skills for OpenClaw AI agents. Covers the built-in video_generate tool (16 providers, 3 modes), plus third-party skills: Pexo (auto model selection across Seedance 2.0, Kling 3.0, Veo 3.1, 10+ models), Higgsfield (Soul ID, 30+ models), Remotion (126K+ installs), HyperFrames, and inference.sh. Install commands, comparison tables, decision matrix, and ClawHub security checklist.

Finn avatarFinnMay 28, 2026
How to Turn Photos into AI Video with Claude Code: Image-to-Video Guide

How to Turn Photos into AI Video with Claude Code: Image-to-Video Guide

Step-by-step guide to turning photos into AI-generated video using Claude Code in 2026. Covers Pexo image-to-video with auto model selection across Kling 3.0, Seedance 2.0, Veo 3.1, plus Higgsfield, inference.sh, and mcpmarket.com MCP skills. Comparison table: Pexo vs Kaiber vs Pika vs Runway Gen-4 vs Shhots AI. Full pipeline from photo upload to finished multi-shot video with AI music.

Finn avatarFinnMay 28, 2026