← Back to feature backlog

πŸ€– AI Avatar β€” for the camera-shy realtor

Upload one headshot. Get an AI avatar of yourself. Generate videos of that avatar speaking your scripts β€” in your voice (cloned via ElevenLabs). Realtors who freeze on camera become content machines. Two separate credit pools (creation + per-video) make this a serious upsell layer on top of subscription.

Wave 2 Β· 🟒 Later Heygen v1 β†’ Higgsfield v2 F-070 Β· F-071 Β· F-072 Β· F-073
The goal

Unlock the segment of realtors who refuse to film themselves.

Roughly 40% of realtors say "I'd post more video, but I hate being on camera." That's a massive segment we lose to camera anxiety. AI avatars solve it β€” same realtor, same voice, no actual recording. Script Generator writes the words β†’ ElevenLabs clones the voice β†’ Avatar speaks them. Done in 90 seconds. Realtor doesn't even open the camera app.

+ 40% segment
The "won't be on camera" cohort becomes addressable. Big TAM unlock.
2 credit pools
Avatar create (one-time per avatar) + Avatar video (per generation). Separate buckets, separate top-ups.
+ Script funnel
Funnels straight from Script Generator β†’ Avatar speaks it. Already-built pipeline.
+ Tripod kit alt
For realtors who don't even want to use the AI Tripod, this is the alternative content engine.
Tech recommendation Β· F-070

Heygen API for v1 (ship in weeks). Higgsfield + NanoBanana for v2 (own the margin).

Don't build avatars from scratch in v1 β€” ship fast on Heygen's API while we validate demand. Once we have signal (volume of avatar credits being purchased), evaluate moving to Higgsfield + NanoBanana for in-house generation. The math: Heygen takes ~$5–15 per avatar minute; in-house can do it for $1–3 once we own the pipeline. Phase the build to validate first, optimize after.

Higgsfield + NanoBanana
v2 Β· Own the margin
  • Pros: Per-minute cost drops 5–10Γ—. Differentiated quality if we tune the model. Long-term moat.
  • Cons: Months of engineering. Quality risk during ramp.

Migrate to in-house once Heygen volume justifies the engineering investment. Run them in parallel for 60 days; cut over once quality matches.

Voice cloning (both v1 + v2)
ElevenLabs Β· paid tier
Same voice clone whether the avatar is rendered by Heygen or in-house. Lock in voice provider; swap out the rendering provider underneath. ~$5/mo per cloned voice.
Avatar creation flow Β· F-070 Β· F-071

3 steps: upload β†’ generate β†’ ready

First-time setup. Realtor uploads 1 headshot + records a 30-second voice sample (so we can clone). System processes for ~10 minutes. Done β€” they have a working avatar attached to their account. Burns 1 Avatar Create credit.

1

πŸ“· Upload headshot

1 photo from Brand Profile. We use the best one we have.

β†’
2

πŸŽ™ 30-sec voice sample

Read our short paragraph. ElevenLabs clones the voice.

β†’
3

⏱ ~10 min processing

Heygen renders the avatar + ElevenLabs trains the voice.

β†’
βœ“

βœ… Avatar ready

Live in your account. Generate videos anytime.

Avatar video generation Β· F-072 Β· F-073

Pick a script β†’ click Generate β†’ 60 seconds later, video

The everyday flow. Realtor opens AI Avatar, picks a script (or generates one with Script Generator), clicks Generate. 60 seconds later, the video is rendered. Burns 1 Avatar Video credit. Then 4 destinations: download, schedule directly, send to Creative Studio for AI editing (captions, B-roll), or send to us as a project.

app.socialrealtr.com/ai-avatar
πŸ€– AI Avatar
πŸ€– 1 avatar create πŸŽ₯ 5 avatar video
SK
YOUR AVATAR
SK
Sarah Β· default voice
VARIATION
SK
Sarah Β· enthusiastic
+
Create another
Burns 1 create credit

πŸ“œ Script for the avatar to speak

SCRIPT Β· 60 SEC Β· TOM FERRY VOICE
Most agents lead with the granite countertops. Wrong move. When I tour a buyer through a $1.4M listing in West Surrey, I don't start with the kitchen. I start with the school catchment, the commute, and the fact that this neighborhood appreciated 12% last year…

Background

Length

~60 seconds to render. We'll notify you when it's ready.

πŸ“Ί Latest video preview

PREVIEW
0:58
SK
β–Ά
"Most agents lead with the granite countertops. Wrong move..."
1
2
3
4
5
1Credit balances in top bar. Two pools visible β€” Avatar Create (rare, 1 per realtor usually) + Avatar Video (each gen burns 1). Same pattern as Studio's credit bar.
2Multiple avatar variations. Same person, different "moods" or wardrobes. Each variation is its own avatar create burn. Power users will want a "default" + "enthusiastic" + "casual" set.
3Script slot funnels from Script Generator. Three sources: open Script Generator (with the funnel handoff from F-066), paste own, pick from saved. Same UX as the recorder's script intake.
4Generate button shows credit cost upfront. Burn-confirmation modal fires on click (per the credits mockup pattern). Never burn silently.
5Inline preview after generation. Watermarked PREVIEW until they pick a destination. Regenerate is free if they hate it within 60 seconds (1 free retry per generation β€” Heygen's policy).
Output destinations Β· F-073

Four paths after the avatar speaks

Same destination model as Creative Studio + Recorder output. One mental model across the platform β€” what you do with a finished asset is always the same four options.

β–’
Send as project
Our editor adds B-roll Β· burns 1 project credit
✨
Send to Studio
AI editor adds captions Β· burns 1 studio video credit
↓
Download
Use it however you want
Brand Profile integration

Headshot pool feeds avatar variations (F-052)

If the realtor has 5 headshots in Brand Profile, they can spin up 5 different avatars from the same person β€” different moods, outfits, settings. F-052 (headshot expansion) generates additional expression variations from each base headshot. Result: a library of "Sarah" personas the realtor can pick from per video.

This is why Brand Profile depth (F-021–F-030) and AI Avatar are linked. The richer the Brand Profile, the better the avatars feel.

Open questions for Trent