Loading tutorials…
Loading tutorials…
Synthesia for sales outreach scales personalization where live-record video cannot. Done right, response rates beat plain-text cold email by 40-80%.
Who this is forSales teams running cold outreach at volume. Founders doing personalized outbound to mid-market accounts. Agencies running prospecting for clients who want video without the live-record burden.
What you'll need
Step 1
Synthesia supports personalization variables. Pick 2-3 high-impact variables before scripting.
Variables: first name, company name, role, recent triggering event (funding, hire, launch).
Two variables is the sweet spot. One feels minimal; four feels manipulative.
Best pairs: first name + company name. Or first name + recent event.
Document the variables in your CRM. Synthesia pulls them from a CSV or API connection.
Step 2
45-60 second scripts beat 90+ second scripts for cold outreach. Tight, specific, one CTA.
Opening (5-10s): "Hey {first_name}, I made this quick video specifically for you and {company_name}."
Hook (10-20s): one specific observation about their company or industry. The more specific, the higher the reply rate.
Value (20-40s): one sentence about what you do and why it might matter to them.
CTA (40-60s): one specific ask. "15-minute call next Tuesday?" or "Reply with 'send it' for the deck." Not "let me know if interested."
Total length: 45-60 seconds. Longer videos get skipped past the hook.
Step 3
Avatar should match your ICP's expectation of who would send this video. Age, dress, tone all signal credibility.
B2B SaaS sellers: business casual avatar, friendly but professional.
Enterprise sales: more formal, confident, age-aligned with senior buyers.
Startup-to-startup: more casual, energetic, founder-like.
Test 3-5 avatars with the same script on a small batch (10 prospects each). Reply rate variance can be 2-3x between avatars.
Step 4
Synthesia supports CSV uploads with personalization variables. Render 20-50 videos from one script template in 30-45 minutes.
Prepare a CSV: columns for first_name, company_name, any other variables. 20-50 rows per batch.
Synthesia → Video → "Create from data" or use the API for higher volume.
Upload CSV. Synthesia generates one video per row, with variables filled.
Rendering 50 personalized videos takes 30-60 minutes depending on plan tier.
Each video gets a unique URL. Add the URL to your CRM or sequence tool (Apollo, Outreach, Lemlist) for sending.
Step 5
Email body: minimal text + video thumbnail link. The thumbnail should show the avatar with the prospect's name overlay.
Email body structure: 1-2 sentence opener mentioning the video → thumbnail with play button → 1-sentence CTA below the thumbnail.
Thumbnail: Synthesia auto-generates one. Customize to show the avatar with text overlay "Hey {first_name}" or similar.
Subject line: short, specific, intriguing. "Quick video for {company_name}" or "{first_name}, 45 seconds on {their_priority}."
Track: video view rate, video completion rate, reply rate. Treat them as separate funnels.
Step 6
Don't optimize the wrong metric. View rate is interesting; reply rate is the goal.
Run each script template for 50-100 sends before judging.
Track: send → open → video view → video complete → reply → meeting booked.
View rate without reply rate means the video is interesting but not actionable. Tighten the CTA.
Reply rate without meeting rate means the CTA is good but the offer is wrong. Refine the offer.
Iterate the script monthly based on the funnel. Avatars and visuals tend to stay constant; scripts evolve.
Common mistakes
90-second outreach videos
What goes wrong: Average view time is 25-35 seconds. Anything past that is unwatched. Your CTA at second 75 is invisible to 90% of prospects.
How to avoid: 45-60 second scripts. CTA before second 50. Long video for nurture, short for cold.
Surface-level personalization
What goes wrong: "Hey {first_name}" without anything else specific feels mail-merge. Reply rate barely exceeds plain text.
How to avoid: Add one specific observation per video. Even shared variable + light specificity ("Hey Sarah, congrats on the Series B last month") works.
Generic CTAs
What goes wrong: "Let me know if interested" earns near-zero replies. Prospect has no clear next step.
How to avoid: Specific CTA with date and action. "15-minute call next Tuesday?" "Reply with 'send the deck' if curious."
Sending in massive batches without testing
What goes wrong: First batch is 500 videos at once. Reply rate is 0.5%. You burn through your prospecting list before realizing the script does not work.
How to avoid: Test 50 sends with each script variant before scaling. 50 sends gives enough signal to know if the script converts.
No follow-up sequence
What goes wrong: Video sent → no reply → prospect forgotten. 80% of replies come from email 2-4 in a sequence. Single-send video outreach misses most of the available conversion.
How to avoid: Video as send 1 or 2. Plain-text follow-ups as sends 3-5. Final break-up email as send 6. Sequences outperform single sends 4-7x.
Recap
Done — what's next
How to create your first Synthesia video with an AI avatar
Read the next tutorial
Hand it off
Sales outreach video at volume is exactly where a video specialist + your SDR team produces compounding returns. EverestX video specialists run weekly outreach video production for $400-800/mo, typically lifting reply rates 30-80% versus plain-text outreach.
See specialist rates
Industry benchmark for cold email is 1-3% reply. Synthesia video outreach typically lifts to 3-6%, sometimes 8-10% with strong personalization and ICP-fit. Anything below 2% means the script or targeting is off.
Most do not, if the video provides value. Some prefer it because it feels less invasive than a 'real' video. Be transparent in the script: "I made this quick video for you" — let them assume what they want.
Yes on Enterprise plan — Synthesia offers Personal Avatar (your face). For volume outreach where personality matters, this can lift reply rates further. See the voice-cloning tutorial.
BombBomb and Loom require you to record each video personally. Synthesia lets you generate 50 personalized videos from one script template in 30 minutes. For volume, Synthesia wins. For warm follow-ups, live recording still wins.
Synthesia
Your first Synthesia video sets the pattern for every video after it. This is the structure specialists use so video #1 looks like video #100.
Synthesia
Personal Avatar + voice clone lets you appear in 50 videos a week without recording any of them. Done well, it is your real face and voice at 20x scale. Done poorly, it is uncanny.
Synthesia
DIY Synthesia works for a stretch. Then production volume, brand consistency, and editing time hit a ceiling. This is the framework for when a specialist earns their fee.