Synced from references/shortform-virality.md in content-extraction skill on 2026-05-18. Edit upstream in the skill; this file is overwritten on next sync.

Short-Form Virality Canon: Founder-Monologue Clip Extraction

Programmatic knowledge base for identifying, scoring, and extracting ship-worthy clips from founder transcripts. Built on 2025–2026 authoritative research, sharpened by real-review feedback (see refinement-log.md). Each rule is testable; each signal a script can apply.

Substance over Hook-Aggression — the meta rule

Before any hook-archetype matching, before any Tier-1/2/3 scoring, this rule wins:

A clip exists to deliver substance. The hook is the doorway, not the room.

Three sub-rules, all encoded from real review failures (see refinement-log.md 2026-05-01):

Rule M1 — Hook without setup is a dead hook

A specific-number hook (“Approximately 14 hours.”) sitting at clip-position-0 with no preceding context is not a hook — it’s a riddle. The viewer has no scaffold to receive the number. They swipe.

A hook archetype must be received against context the viewer can ground. Either:

The archetype is self-supplying (a contrarian categorical reframe like “We are not merely building toys” carries its own context). OK to lead with.
Or the hook needs a prior segment supplying the topic. Number-only hooks are the most setup-dependent — never lead with a bare number unless an immediate visual overlay supplies the unit AND the topic.

When in doubt: add the setup segment. A 5-second context-prefix is cheaper than a swipe.

Rule M2 — Substance over runtime

Aggressive reduction (2 segments, 17 seconds, hook-and-close) optimizes for the swipe boundary and sacrifices what makes founder content actually pull: depth, layered reasoning, the texture of a real thought. A 60-second clip with substance pulls harder than a 17-second clip without.

Default target_duration_seconds is 30–90 seconds, not 15–60. Reduce only when reduction sharpens substance — never when it just shortens runtime. YouTube Shorts caps at 60s; TikTok and Reels accept 60s+. Use the headroom.

Rule M3 — Compose, don’t just cut

A clip is a story arc — setup → hook → body → reframe → close — not “find the hook, find the close, concat.” The cut plan’s job is composition, not selection.

Move sentences across distance in the source if it builds a tighter arc. Pull a Q&A answer forward of its question. Glue a reflection at minute 5 to a vision-statement at minute 12 if they’re thematically the same beat. Plan the arc, then pick segments to fill it. Not the other way around.

Valid arc shapes (pick the one that fits the substance, not a default template):

Arc shape	When to use
Cold-open	Hook self-supplies context (contrarian, named-product); body deepens; close promises
Setup-then-hook	Hook needs context; build curiosity in setup; deliver hook mid-clip
Question-then-answer	Q&A material; pose then resolve in 2-3 segments
Number-then-context	Number hook with overlay support; immediately ground in topic
Vision-then-vulnerability-then-promise	Long-form (60-90s); emotional pull; rare but powerful
Vulnerability-then-recovery	Tier-1 Pattern A; hardest pull; deepest stake

Hook Archetypes

Six hook types that work reliably in 2026 short-form, with concrete failure modes a script can detect.

1. Product / Outcome Showcase

Definition: Lead with the finished result, no setup. “You lead with the thing — the product in action, the before-and-after.”

Pattern: Segment opens with a concrete outcome, demo result, or “here’s what we shipped” statement. Zero preamble before the reveal.

Performance: Highest-performing hook type across 34,635 analyzed clips (avg 6,037 views).

Scriptable signal: First sentence contains a tangible result + present/near-past tense + visual proof potential.

Fails when: Setup precedes outcome, or “outcome” is abstract (“I learned something valuable”). Requires tangible proof in first ~2 seconds.

2. Expert Explainer Setup

Definition: Establish authority → tease insight → promise payoff.

Pattern: “I’ve worked in [role] for 12 years. Here’s what nobody tells you about [topic].” Specific number + credential + contrarian claim.

Scriptable signal: Opening contains: role/credential keyword (built / shipped / years / founded) AND a contrarian framing (“nobody tells you,” “the real reason,” “what actually matters”) within first 3 seconds.

Fails when: Credential vague (“I work in tech”) or claim lacks punch. Weak contrarian = audience assumes generic advice.

3. Contrarian Take

Definition: Reject widely-held belief in opening sentence. Cognitive dissonance — brain can’t leave contradictions unresolved.

Pattern: “Everyone says [common advice]. It’s wrong.” Or “Stop doing [popular practice].”

Performance: 156% more For-You-page appearances when framed as a pattern interrupt.

Scriptable signal: Opening contains negation frame (don’t / stop / wrong / myth) + specific belief or action. Must deliver substance within 5 seconds or credibility collapses.

Fails when: False contrarian without supporting evidence. Once trained that a creator makes empty contrarian claims, audiences ignore — and the failure is cumulative.

4. Specific Number Hook

Definition: Odd, specific numbers signal authenticity. “23 pounds in 67 days” beats generic claims.

Pattern: “I wrote 3,247 cold emails. Here are the 4 subject lines that worked.” Or “We grew from 0 to $1.2M ARR in 14 months because of [one insight].”

Scriptable signal: Transcript contains a concrete, non-round number paired with a claim or result. Examples: revenue, user counts, days elapsed, dollars spent, components iterated, errors fixed.

Fails when: Number lacks context, or isn’t paired with insight. Random statistics feel like bragging; paired statistics feel like proof.

5. Curiosity Gap Hook

Definition: Incomplete information that creates a desire to discover the answer.

Pattern: “I tested this one weird trick…” “The real reason most founders fail isn’t what you think…” Or end opening sentence with an unanswered question.

Performance: 234% more completion, 178% higher shares vs direct-statement openings.

Scriptable signal: Opening sentence leaves a void — open question, teased fact, or promised reveal not yet delivered. Must deliver answer by 8s or retention collapses.

Fails when: Gap too wide (payoff feels distant), payoff underwhelming, or viewer guesses the answer in first 3s.

6. Vulnerability Beat (Personal Stakes)

Definition: Founder admits struggle, failure, or counterintuitive experience. Creates emotional permission for honesty.

Pattern: “I almost shut down the company.” “I’ve made this mistake 4 times.” “I was terrified to [X] but here’s what happened.”

Performance: 89% retention improvement when vulnerability is specific and time-stamped (“in 2021,” “after our Series A”).

Scriptable signal: First-person past tense about failure, near-miss, fear, or surprise. Phrases: “I almost,” “I remember thinking,” “I was wrong about,” “I was scared,” “we almost didn’t.”

Fails when: Vulnerability without lesson. Generic struggle (“starting a company was hard”) without specific detail. Or perceived as fishing for sympathy without action.

Retention Mechanics: Second-by-Second Rules

What must happen across the arc for TikTok / Reels / Shorts to hold attention. Algorithmic thresholds, not artistic goals.

0–3s — The Swipe Boundary

Target retention: 80% min (YouTube Shorts benchmark). TikTok algorithm makes its first distribution decision at 1.5s.
Required: Hook communicates in 1.5s (text + visual + spoken word aligned). Pattern interrupt OR immediate value promise.
Signal: Does the segment open with one of the 6 hook archetypes above?
Failure: Slow intro / preamble / context-setting before hook. Every second of pre-hook ≈ 15% retention loss.

3–8s — Hook Delivery & First Context

Target retention: 70%+ (maintenance phase).
Required: Hook promise clarified (question answered, claim explained, direction revealed). Transition feels inevitable, not forced.
Signal: By 8s, viewer should know the clip’s thesis. Ambiguity here kills retention.
Failure: Hook delivered, then silence/delay. Pacing collapse.

8–15s — Proof or Story Escalation

Target retention: 60%+ (midpoint).
Required: Specific evidence (number, screenshot, testimonial quote, before/after) OR narrative escalation (stakes rise, contradiction deepens). Pacing tightens, not plateaus.
Signal: Concrete fact, metric, quote, or plot turn — not filler.
Failure: Repetition of hook without new info. Talking head without visual support. Common retention cliff at 10–12s.

15–30s — Value Delivery / Climax

Target retention: 55%+ (most viewers leave before completion; this is the acceptable-loss phase).
Required: Core lesson, insight, or emotional beat delivered. Setup for ending: resolution or CTA.
Signal: The “why” for watching is satisfied. Viewers now have a reason to complete or share.
Failure: Anticlimactic payoff, generic advice, trailing off.

30–60s — Conclusion / Loop / Finish Strong

Target retention: 70%+ average percentage viewed (short-form-overall benchmark; completion is secondary to APV).
Required: Rewatch hook (loopability) OR clear CTA OR micro-ending (closed-loop, punchline, emotional landing).
Signal: Last 3s feel intentional.
Failure: Rambling into silence, no closure, unclear takeaway.

Founder-Content Levers: What Makes Startup Content Pull

Founder monologues differ from lifestyle / narrative / comedy content. These specific angles make founders go viral in 2026.

L1 — Named Numbers + Founder Attribution

Specificity signals authenticity. “We grew a lot” reads as marketing; “we hit $1.2M ARR because of the 3-email cold outreach sequence” reads as earned data.

Signal: Precise revenue / user / growth / ROI figure + paired with a specific founder action (“we did X, got Y”) + ideally non-round number (23 vs 25, 1.2M vs 1M).

Fails: Numbers without context, or generic attribution.

L2 — Technical Detail as Credibility Proof

When a founder explains implementation detail (API choice, algorithm, schema pattern), it signals they actually built something. AI-startup founders especially: “we prompt-chained X to Y” reads as expertise.

Signal: Brief technical term, tool name, or build step that only an actual builder would know. Examples: “we switched from GPT-4 to Claude for latency reasons,” “the bottleneck was inference cost, so we quantized the model.”

Fails: Buzzword soup without specificity. Technical detail that doesn’t connect to business outcome.

L3 — Building-in-Public Moment

Live decisions, launches, milestones trigger FOMO + credibility simultaneously.

Signal: Present tense or near-past (“we just shipped,” “we’re about to,” “as of yesterday”) + concrete artifact (new feature, user testimonial, metrics movement).

Fails: Vague future tense (“we’re planning to”) without shipped proof.

L4 — Vulnerability + Recovery Pattern

Admitting mistakes or near-deaths (and what was learned) earns trust.

Signal: Specific failure or crisis with timeframe (“in month 4,” “before we pivoted”) + decision in response + outcome.

Fails: Humble-brag without real stakes. “We learned a lot from our mistakes.”

A testimonial with a specific use case or first-name reads as proof.

Signal: Direct quote or paraphrase from a named customer, partner, or user, with a specific use case.

Fails: Attributed quotes with no context. Generic praise.

L6 — Contrarian Bet Explained

Founders explaining why they chose the unpopular path signal conviction.

Signal: Bold claim (“we decided to go vertical instead of horizontal”) + reasoning in 2–3 sentences. High-velocity, not lengthy.

Fails: Contrarian without reasoning.

Platform Deltas: Same Clip, Three Platforms

Signal	TikTok	Instagram Reels	YouTube Shorts
Algorithm priority	Watch-through in first 1.5s; completion rate	Saves & shares > likes; aesthetic consistency	Watch time + search keywords (title/description)
Native style	Raw, trend-aware, quick cuts	Polished, aesthetic, visual consistency	Educational, clear metadata, “how-to” friendly
Optimal length	15–60s (avg swipe at 41s)	15–30s for highest completion	30–60s; keyword in title matters more
Hook requirement	Critical; lands in 1–2s	Important but secondary to visual polish	Important but searchable title compensates
Audio	Trending sounds boost discovery; original audio now performs 156% better than before	Can use trending audio, but watermarked TikTok reposts get punished	Title/thumbnail > audio
Captions	In-app overlays; on-screen text +30% retention	Visual-overlay consistency; on-screen text expected	Highly important; many watch muted
Repurposing cost	Highest; TikTok-watermarked Reels get punished	High; IG detects and deprioritizes TikTok watermarks	Low; can upload clean vertical video
Failure mode	Hook misses first 2s → cold-start, no distribution	Production-value + weak hook = siloed to followers	Poor metadata (generic title) → no search traffic

Per-platform adapt:

TikTok version: Use trending sound when relevant, max contrast + quick cuts in first 3s.
Reels version: Polish visual consistency, strip watermarks, lean save/share CTAs.
Shorts version: Optimize title + description with keywords (founder/builder/startup topic), lean educational (“How we X’d our Y by Z%”).

Brand-Voice Tension: When to Bend, When to Hold

Founder monologues carry brand voice — but platforms reward platform-native voice. Tradeoff: earned authenticity vs algorithm score.

Hold brand voice when:

Founder has accumulated audience trust (100K+ followers, >1% engagement across 20+ clips).
Contrarian position already established.
Niche audience expects long-form depth (B2B SaaS, deep-tech, academic — more patient).

Bend toward platform when:

Clip is first/early in a series (audience doesn’t yet know the founder’s voice).
Goal is discovery/reach over brand tone.
Founder’s brand is itself about being authentic-to-platform.

Hybrid (recommended for A Friend / Daniel):

Hook (0–3s): Use platform-native archetypal hook (pattern interrupt, curiosity gap, specific number).
Body (3–30s): Founder’s voice — authentic vocabulary, pacing, examples. Daniel: keep German/English code-switching here.
Ending (30–60s): Reinforce brand position or learning, not a virality hack.

For A Friend specifically: brand voice rules from _brand/voice-preservation.md are non-negotiable in body + ending. Hook can be slightly bent toward platform-native if it makes the clip land — but the bend must not violate banned vocabulary or break the two-beat cadence.

Segment-Selection Heuristics: Scriptable “Ship-Worthy” Signals

Given a transcript, these signals identify segments to clip.

Tier 1 — High Confidence (script SHOULD flag)

Combinations nearly guaranteed to work.

Pattern A — Vulnerability + Specific Number + Outcome

Contains: “I was [scared/wrong/nearly]” + timeframe/dollar/count + “so we [did X]” + result.
Example: “I almost shut the company down in month 4 when we were burning $40 K / m o n t h . W e p i v o t e d t h ee n t i re GTM s t r a t e g y . T h ree m o n t h s l a t er w e hi t$ 500K ARR.”
Recommendation: SHIP.

Pattern B — Bold Contrarian + Expert Credential + Specific Example

Contains: “Everyone says [belief]” / “common advice is wrong” + credential + concrete example.
Example: “Everyone says you need VC funding to scale. I’ve bootstrapped 4 companies. We grew this one to $2M ARR without a single dollar of venture capital.”
Recommendation: SHIP.

Pattern C — Shipped Artifact + Named Metric

Contains: “We just launched [X]” / “as of yesterday” + early-user metric or % delta.
Example: “We shipped the new vector search this morning. Early beta users say it reduced their inference latency by 60%.”
Recommendation: SHIP.

Pattern D — Technical Detail + Business Outcome

Contains: tool/architecture/API choice + reason + outcome.
Example: “We switched from GPT-4 to Claude 3.5 Sonnet because we needed faster outputs for our chat interface. Turns out our users prefer the response style anyway.”
Recommendation: SHIP.

Tier 2 — Medium Confidence (script SHOULD CONSIDER)

One or two signals; likely to work but not guaranteed.

Pattern E — Specific Number Alone: single claim with precise non-round number. Medium-high — needs narrative arc or contrast to be great.

Pattern F — Contrarian Claim Alone: bold opening, proof in second sentence. Medium — risky if explanation fails.

Pattern G — Customer Moment: quoted/described customer experience. Medium — engaging but generic without name + use case + revenue impact.

Tier 3 — Lower Confidence (script MAY skip or mark CANDIDATE)

Pattern H — Lesson/Advice Standalone: pure teaching without personal story or proof. Low-medium — depends on existing credibility.

Pattern I — Vague Aspiration / Future Tense: “we’re planning to,” “our goal is” without shipped proof. Low.

Pattern J — Generic Gratitude / Filler: thank-you, conversational filler. Skip.

Scoring Rubric

Signals present	Recommendation
3+ Tier-1 signals	SHIP — high confidence
2 Tier-1 signals	SHIP — strong clip
1 Tier-1 + 2+ Tier-2	SHIP — good clip
2+ Tier-2 only	CONSIDER — needs review
1 Tier-2 only	HOLD — re-evaluate in context
Tier-3 only	SKIP unless niche-specific

Mapping to `hook_strength` (1–5) in cut-plan.json

5 = Tier-1 Pattern A/B/C/D AND opens with a clear hook archetype (1.5s rule met).
4 = Tier-1 pattern present but hook is in the body, requires reorder to lead.
3 = Tier-2 pattern with narrative arc; needs supporting B-roll to hold retention.
2 = Tier-2 alone, no clear hook; CONSIDER but expect lower pull.
1 = Tier-3; skip unless contextual reason to ship.

Topics for deeper research as the skill matures:

Niche-specific hook variance — does founder/builder content respond differently to curiosity-gap vs contrarian vs vulnerability hooks across niches (fintech / climate / deep-tech)? Early signal: vulnerability beats outperform in deep-tech. Needs 5+ niche validation.
Cross-platform serialization — when a founder posts the same clip TikTok → Reels → Shorts, does sequence order matter for audience journey? (Reels performs better if they’ve seen TikTok first?) Compounding effect unexplored.
AI-startup content premium — does AI/LLM/agentic-systems content have higher retention floors? Sample data shows 8.2% engagement vs 2.8% TikTok baseline. Needs control for founder-credibility confound.
A Friend brand-voice eval — once 20+ clips are shipped from this skill, build a feedback loop scoring which clips Daniel approved unchanged vs which he edited. The edit pattern reveals where the skill’s brand-voice match is weakest.
German-language short-form deltas — most virality data is English-centric. Daniel’s clips will be German/English mixed. Open question: do the same hook archetypes pull on German-speaking TikTok / Reels audiences? Needs separate research pass.

A Friend — Distribution

Explorer

references__shortform-virality

Short-Form Virality Canon: Founder-Monologue Clip Extraction

Substance over Hook-Aggression — the meta rule

Rule M1 — Hook without setup is a dead hook

Rule M2 — Substance over runtime

Rule M3 — Compose, don’t just cut

Hook Archetypes

1. Product / Outcome Showcase

2. Expert Explainer Setup

3. Contrarian Take

4. Specific Number Hook

5. Curiosity Gap Hook

6. Vulnerability Beat (Personal Stakes)

Retention Mechanics: Second-by-Second Rules

0–3s — The Swipe Boundary

3–8s — Hook Delivery & First Context

8–15s — Proof or Story Escalation

15–30s — Value Delivery / Climax

30–60s — Conclusion / Loop / Finish Strong

Founder-Content Levers: What Makes Startup Content Pull

L1 — Named Numbers + Founder Attribution

L2 — Technical Detail as Credibility Proof

L3 — Building-in-Public Moment

L4 — Vulnerability + Recovery Pattern

L6 — Contrarian Bet Explained

Platform Deltas: Same Clip, Three Platforms

Brand-Voice Tension: When to Bend, When to Hold

Segment-Selection Heuristics: Scriptable “Ship-Worthy” Signals

Tier 1 — High Confidence (script SHOULD flag)

Tier 2 — Medium Confidence (script SHOULD CONSIDER)

Tier 3 — Lower Confidence (script MAY skip or mark CANDIDATE)

Scoring Rubric

Mapping to `hook_strength` (1–5) in cut-plan.json

Refinement Backlog

Sources

Graph View

Table of Contents

Backlinks

A Friend — Distribution

Explorer

references__shortform-virality

Short-Form Virality Canon: Founder-Monologue Clip Extraction

Substance over Hook-Aggression — the meta rule

Rule M1 — Hook without setup is a dead hook

Rule M2 — Substance over runtime

Rule M3 — Compose, don’t just cut

Hook Archetypes

1. Product / Outcome Showcase

2. Expert Explainer Setup

3. Contrarian Take

4. Specific Number Hook

5. Curiosity Gap Hook

6. Vulnerability Beat (Personal Stakes)

Retention Mechanics: Second-by-Second Rules

0–3s — The Swipe Boundary

3–8s — Hook Delivery & First Context

8–15s — Proof or Story Escalation

15–30s — Value Delivery / Climax

30–60s — Conclusion / Loop / Finish Strong

Founder-Content Levers: What Makes Startup Content Pull

L1 — Named Numbers + Founder Attribution

L2 — Technical Detail as Credibility Proof

L3 — Building-in-Public Moment

L4 — Vulnerability + Recovery Pattern

L5 — Customer Quote or Social Proof (With Names)

L6 — Contrarian Bet Explained

Platform Deltas: Same Clip, Three Platforms

Brand-Voice Tension: When to Bend, When to Hold

Segment-Selection Heuristics: Scriptable “Ship-Worthy” Signals

Tier 1 — High Confidence (script SHOULD flag)

Tier 2 — Medium Confidence (script SHOULD CONSIDER)

Tier 3 — Lower Confidence (script MAY skip or mark CANDIDATE)

Scoring Rubric

Mapping to hook_strength (1–5) in cut-plan.json

Refinement Backlog

Sources

Graph View

Table of Contents

Backlinks

Mapping to `hook_strength` (1–5) in cut-plan.json