Getting started with faceless videos
Schedulin's faceless video generator turns a written script into a finished short video — voiceover, captions, and visuals included — and hands it off to the scheduler so you can publish across TikTok, Instagram Reels, YouTube Shorts, and more.
This article walks through your first render end-to-end.
Before you start
You need:
- An active Schedulin account (a free 7-day trial is enough).
- A script between 20 and 1,200 characters. This usually translates to a 15-90 second video.
- At least one connected social account if you want to schedule directly. You can also download the rendered video.
Step 1 — Open the faceless editor
From your Schedulin dashboard, click Create → Faceless video. This opens the editor with the script field, voice picker, engine selector, and caption style.
Step 2 — Paste your script
Paste the full narration in the script field. Schedulin uses this exact text for both the voiceover and the on-screen karaoke captions, so write it as you'd want it spoken.
A few rules:
- Keep it conversational. AI voices read literally — they'll say "U.S." as "you ess" if you abbreviate.
- One sentence per line is fine. Pauses come from punctuation.
- Avoid emoji and markdown — they'll be read aloud.
Step 3 — Pick a voice
Schedulin ships with six ElevenLabs voices out of the box (Rachel, Sarah, Domi, Antoni, Arnold, Adam). Click Preview next to any voice to hear a sample.
If you've cloned your own voice in ElevenLabs, paste the voice ID into the custom voice field.
Step 4 — Choose an engine
You'll see three engine options. The right one depends on your niche — see Choosing an engine for the full breakdown.
Quick rules of thumb:
- Stock footage ($0.30/video) — best for daily uploads, motivation, fun facts, finance.
- AI stills ($0.50/video) — best for storytelling niches like horror or history.
- Veo3 ($1/second) — premium hero clips only.
Step 5 — Pick captions and aspect ratio
Three caption styles are available:
- Bold yellow — high-contrast, classic motivation/finance style.
- TikTok white — clean white text with a soft drop shadow.
- Minimal — small, unobtrusive, for cinematic content.
Aspect ratio defaults to 9:16 (TikTok / Shorts / Reels). Switch to 1:1 for square Instagram feed posts or 16:9 for YouTube long-form.
Step 6 — Render
Click Generate. Schedulin runs a four-stage pipeline: voiceover → footage selection → scene composition → final render. Stock footage renders in about 90 seconds; AI stills take ~2 minutes; Veo3 takes ~3 minutes.
You'll see a progress bar. You can leave the editor and come back — the render continues in the background.
Step 7 — Schedule or download
When the render completes, you can:
- Send to scheduler — opens the post composer with the video pre-attached. Pick the platforms and a publish time.
- Download — grab the MP4 to use elsewhere.
That's it. Your first faceless video is live (or queued).