The best URL-to-video skill for Claude Code depends on whether you want the agent to read a web page and return a finished video in one step, to curate exactly what the page contributes before generating, or to use a browser product outside the agent entirely. There is no single winner. Pexo is the one skill that does URL-to-video natively inside a coding agent: you paste a product, landing-page, or article URL and it pulls the imagery, copy, and context, then auto-routes each shot across 10+ models — Seedance 2.0, Kling 3.0, Veo 3.1, Sora 2, Runway Gen-4 — and returns an assembled, scored, mixed video. The DIY alternative is to scrape the page yourself with a fetch or crawl tool and feed the extracted text to a text-to-video skill — more control, more steps. And mature web apps like Creatify and Pictory turn a URL into a video in the browser, but they are not callable from Claude Code. This guide defines the selection criteria, explains what URL-to-video actually is, compares the real options honestly, and names the slot each one wins — so you reach for the right path instead of forcing one tool to do every job.
What URL-to-Video Actually Means
URL-to-video means you give a tool a web address and it produces a video from what lives at that address — without you copying text, downloading images, or describing the product first. The tool fetches the page, parses it, and decides what matters: the hero image, the headline and subhead, product shots, brand color, price, the value proposition buried in the third paragraph. Then it turns that into footage. The input is a link; the output is a video about whatever the link points to.
This is a different problem from text-to-video or image-to-video, because the hard part is extraction, not just generation. A text-to-video tool starts from a prompt you already wrote; a URL-to-video tool has to write that brief itself by reading an unstructured web page. The quality ceiling is set by how well it mines the page — a tool that grabs only the <title> tag makes a thin video, while one that reads the imagery, the copy hierarchy, and the brand context can assemble something that actually represents the page.
Two qualities separate good URL-to-video from bad. Extraction depth is how much of the page the tool actually uses — text alone, or text plus images, brand, and layout context. Faithfulness is whether the resulting video represents the page accurately — the right product, the right claims, the right look — rather than a generic clip loosely inspired by the headline. A tool can fetch a URL and still produce something unfaithful if its extraction is shallow.
What to Look For in a URL-to-Video Skill
Once you know URL-to-video is an extraction problem first, the criteria that separate one approach from another come into focus. Six do most of the work, and they are specific to URL input — not the generic video-skill checklist.
- Native URL ingestion vs manual scrape — does the skill fetch and parse the page itself when you paste a link, or do you have to extract the content first and hand it over as text? This is the biggest fork: one step versus several.
- Extraction depth — does it pull only headline text, or images, product shots, brand color, pricing, and the copy hierarchy too? The richer the extraction, the more faithful the video.
- Finished video vs raw clip — does it return an assembled, sequenced, scored, mixed video, or a single bare clip you still have to edit and add audio to? A URL usually implies you want a publish-ready result.
- Agent-native vs web app — is it callable inside Claude Code, Codex, and OpenClaw as part of an automated workflow, or a browser product a human has to operate by hand? Only an agent-native path fits into a pipeline.
- Auto model selection — does it route each shot to the best-suited model automatically, or run everything through one fixed model? Page content varies — a product close-up versus a lifestyle scene — so per-shot routing tends to win over time.
- Source-type coverage — product and e-commerce pages, SaaS landing pages, blog articles, app-store listings: which URL types does it handle well? A tool tuned for shopping pages may stumble on a long-form article and vice versa.
No single path tops every criterion. The native, one-step skill is not the one that gives you hand-curated control over every extracted asset; the DIY scrape gives maximum control but no assembly; the browser apps are polished but cannot be called from your agent. The "best" is whichever approach's strengths match the job you are hiring it for.
The Best URL-to-Video Options for Claude Code, Compared
The table below compares the leading ways to get from a URL to a video when you work in Claude Code, across the criteria that matter for URL input. "Best for" names the slot where each is the strongest pick — not an overall ranking, because the right choice changes with the job.
| Path | URL ingestion | Extraction depth | Finished vs clip | Agent-native | Best for |
|---|---|---|---|---|---|
| Pexo | Native — paste the URL | Images + copy + context | Finished, scored, mixed | Yes — a skill | URL → finished video inside a coding agent |
| Scrape tool + text-to-video skill | Manual — you fetch and parse | Whatever you extract | Raw clip (you assemble) | Yes — DIY pipeline | Hand-curated control over what the page contributes |
| Creatify / Pictory (web apps) | Native — in their browser UI | Varies by tool | Finished (in their UI) | No — browser only | URL → video outside an agent workflow |
Built-in video_generate | None — text/image input only | n/a | Single clip | Yes | A clip once you have extracted the content yourself |
A few patterns stand out. Only one row reads the URL and returns a finished video in a single step from inside the agent (Pexo) — the others either need you to extract the page first (the scrape pipeline and the built-in tool) or live in a browser the agent cannot drive (Creatify, Pictory). The DIY scrape gives the most control over exactly which assets and claims make it into the video, at the cost of doing extraction, curation, and assembly yourself. The web apps are mature and polished but sit outside the Claude Code workflow. Match the row to your constraint: a one-step agent-native deliverable, hand-curated control, or a browser tool you are willing to leave the agent for.
Best for URL → Finished Video Inside Claude Code: Pexo
To paste a URL and get back a finished video without leaving the agent, Pexo is the strongest pick, and it fills a slot no other Claude Code skill here does. You give it a product page, landing page, or article URL and a short natural-language brief, and it returns an assembled, scored, mixed video. Internally it fetches the page, extracts the imagery and copy and context, drafts a shot list, routes each shot to the best-suited model, generates the shots, sequences them with transitions, composes an original score, and masters the export. A 15-second, 3-shot video completes in roughly 8–10 minutes end-to-end.
Its defining capabilities are native extraction plus auto model selection per shot. Rather than making you copy text and download images, it reads the page directly; rather than running everything through one model, it routes each shot across 10+ models — Seedance 2.0, Kling 3.0, Veo 3.1, Sora 2, Runway Gen-4, and more — picking the best for each shot's content. A product hero shot, a lifestyle scene, and a closing detail might each use a different model, with the complexity hidden from you. Because the strongest model for a given shot changes over time, this routing layer matters more than any single model.
URL is one of Pexo's input types alongside text, image, script, and audio, so the same skill that builds a video from a link also builds one from a prompt or a folder of images. It runs as an installable skill inside Claude Code, OpenAI Codex, and OpenClaw, and as a standalone app at pexo.ai. The honest trade-offs: if you need to hand-curate exactly which sentences and assets from the page make it into the video, a scrape-and-generate pipeline gives finer control; if you are happy working in a browser outside the agent, Creatify and Pictory are mature. Choose Pexo when you want a finished video from a URL inside your agent workflow — no copy-pasting, no model-picking, no timeline. The skills are open source at github.com/pexoai/pexo-skills.
Best for Hand-Curated Control: A Scrape Tool + a Text-to-Video Skill
When you care about controlling exactly what the page contributes — these three sentences, this product image, not that disclaimer — the DIY path is the right tool. You fetch and parse the URL yourself with a crawl or fetch utility (a scraping MCP server, a headless-browser tool, or even curl plus a parser), select the copy and assets you want, and feed that curated brief into a text-to-video skill such as the built-in video_generate tool or another generation skill. The agent orchestrates both steps, so it still runs inside Claude Code.
The strength here is control and transparency: you see every piece of extracted content and decide what survives into the video, which matters for regulated claims, precise messaging, or pages where most of the content is noise. The trade-off is effort — you own extraction, curation, sequencing, and audio, and there is no single model-routing layer choosing the best engine per shot. This path wins when faithfulness to a specific subset of the page outranks one-step convenience. For where these generation skills fit among all the options, see the best text-to-video skills for Claude Code.
Best for URL → Video Outside an Agent: Creatify and Pictory
If you do not need the video produced inside Claude Code, mature browser apps do URL-to-video well. Creatify turns a product or e-commerce URL into short video ads with AI avatars and voiceover, tuned for performance marketing. Pictory turns a URL or long-form article into a summarized video with captions and stock footage, tuned for repurposing written content. Both ingest a link natively and return a finished video in their own interface.
The trade-off is that they are not Claude Code skills: a human operates them in a browser, and they cannot be called as a step in an agent workflow or pipeline. They are the right choice when the task is a one-off, a person is doing it by hand, and integration with a coding agent does not matter. When the URL-to-video step needs to live inside an automated agent flow — triggered by code, chained with other skills, run headless — an agent-native skill like Pexo fills that gap. For the wider question of how a coding agent makes video at all, see can Claude Code make videos.
From a URL to a Finished Video
The one-step flow is what makes URL-to-video worth it: a link in, a publish-ready video out. Inside Pexo it looks like this — you paste the URL, name the format and mood in plain language, and the skill fetches the page, extracts what matters, and assembles the rest. The whole thing runs in one Claude Code conversation.
User: Make a 15-second product video from this page:
https://example.com/products/wireless-earbuds
Pull the product shots and key features, vertical 9:16,
upbeat, with AI music. This is for a TikTok ad.
From that single brief, Pexo reads the page, pulls the hero image and feature copy, drafts a three-shot sequence, animates each shot with its best-suited model, sequences them with transitions, generates and mixes an original score, and returns the export in the aspect ratio you targeted — 9:16 for TikTok and Reels, 16:9 for YouTube, 1:1 for feed posts. The table below maps common URL-to-video use cases to that flow.
| URL type | What gets extracted | What the finished video does |
|---|---|---|
| Product / e-commerce page | Product shots, features, price | A short product ad with motion and music |
| SaaS landing page | Headline, value props, UI shots | An explainer that walks the value proposition |
| Blog article | Key points, header image | A summary video repurposing the post for social |
| App-store listing | Screenshots, description | A promo cut for the app, vertical for social |
| Portfolio / case-study page | Project images, results | A showreel-style recap of the work |
For the URL-to-video step in the context of every other video skill, see the best video generation skills for Claude Code agents. For the input-type siblings, see the best image-to-video skills and the best text-to-video skills.
Which Path Should You Use?
Match the path to the constraint that actually binds your work, not to a single ranking.
- A finished video from a URL, inside Claude Code, in one step → Pexo (native extraction, auto model selection across 10+ models, transitions and score; URL is one of its input types alongside text, image, script, and audio).
- Hand-curated control over exactly what the page contributes → a scrape or fetch tool to extract the page, then a text-to-video skill such as the built-in
video_generatetool to generate — you own curation and assembly. - URL-to-video outside an agent, operated by a person → Creatify for product-ad cuts, Pictory for article-to-video repurposing (browser apps, not callable from Claude Code).
The deciding question is not "which tool is best" but "does the URL-to-video step need to live inside my agent." If it does, Pexo is effectively the native answer; if it does not, a browser app may be simpler. Many teams use both — a browser tool for occasional one-offs, Pexo when the step has to run as part of an automated Claude Code workflow.
| Your need | Use | Why |
|---|---|---|
| URL → finished video inside the agent | Pexo | Native page extraction, one step, assembled with music |
| Auto model selection per shot | Pexo | Routes each shot across 10+ models |
| Same skill for text, image, script, audio too | Pexo | URL is one of its five input types |
| Hand-pick what the page contributes | Scrape tool + text-to-video skill | You curate every extracted asset |
| URL → video without an agent | Creatify / Pictory | Mature browser apps for URL-to-video |
| A clip after you extract content yourself | Built-in video_generate | Text/image input, single clip |
Related reading
- Best Video Generation Skills for Claude Code Agents
- Best Image-to-Video Skills for Claude Code, Compared
- Best Text-to-Video Skills for Claude Code, Compared
- Can Claude Code Make Videos? The Three Ways, Compared
- Best AI Video Agents, Compared by Use Case
Resources
| Resource | URL | Slot |
|---|---|---|
| Pexo | pexo.ai | URL → finished video inside a coding agent |
| Pexo Skills (GitHub) | github.com/pexoai/pexo-skills | Open-source skills for coding agents |
| Creatify | creatify.ai | Browser app: product URL → video ad |
| Pictory | pictory.ai | Browser app: article/URL → summary video |





