Synced from
references/shortform-virality.mdincontent-extractionskill on 2026-05-18. Edit upstream in the skill; this file is overwritten on next sync.
Short-Form Virality Canon: Founder-Monologue Clip Extraction
Programmatic knowledge base for identifying, scoring, and extracting ship-worthy clips from founder transcripts. Built on 2025–2026 authoritative research, sharpened by real-review feedback (see
refinement-log.md). Each rule is testable; each signal a script can apply.
Substance over Hook-Aggression — the meta rule
Before any hook-archetype matching, before any Tier-1/2/3 scoring, this rule wins:
A clip exists to deliver substance. The hook is the doorway, not the room.
Three sub-rules, all encoded from real review failures (see refinement-log.md 2026-05-01):
Rule M1 — Hook without setup is a dead hook
A specific-number hook (“Approximately 14 hours.”) sitting at clip-position-0 with no preceding context is not a hook — it’s a riddle. The viewer has no scaffold to receive the number. They swipe.
A hook archetype must be received against context the viewer can ground. Either:
- The archetype is self-supplying (a contrarian categorical reframe like “We are not merely building toys” carries its own context). OK to lead with.
- Or the hook needs a prior segment supplying the topic. Number-only hooks are the most setup-dependent — never lead with a bare number unless an immediate visual overlay supplies the unit AND the topic.
When in doubt: add the setup segment. A 5-second context-prefix is cheaper than a swipe.
Rule M2 — Substance over runtime
Aggressive reduction (2 segments, 17 seconds, hook-and-close) optimizes for the swipe boundary and sacrifices what makes founder content actually pull: depth, layered reasoning, the texture of a real thought. A 60-second clip with substance pulls harder than a 17-second clip without.
Default target_duration_seconds is 30–90 seconds, not 15–60. Reduce only when reduction sharpens substance — never when it just shortens runtime. YouTube Shorts caps at 60s; TikTok and Reels accept 60s+. Use the headroom.
Rule M3 — Compose, don’t just cut
A clip is a story arc — setup → hook → body → reframe → close — not “find the hook, find the close, concat.” The cut plan’s job is composition, not selection.
Move sentences across distance in the source if it builds a tighter arc. Pull a Q&A answer forward of its question. Glue a reflection at minute 5 to a vision-statement at minute 12 if they’re thematically the same beat. Plan the arc, then pick segments to fill it. Not the other way around.
Valid arc shapes (pick the one that fits the substance, not a default template):
| Arc shape | When to use |
|---|---|
| Cold-open | Hook self-supplies context (contrarian, named-product); body deepens; close promises |
| Setup-then-hook | Hook needs context; build curiosity in setup; deliver hook mid-clip |
| Question-then-answer | Q&A material; pose then resolve in 2-3 segments |
| Number-then-context | Number hook with overlay support; immediately ground in topic |
| Vision-then-vulnerability-then-promise | Long-form (60-90s); emotional pull; rare but powerful |
| Vulnerability-then-recovery | Tier-1 Pattern A; hardest pull; deepest stake |
Hook Archetypes
Six hook types that work reliably in 2026 short-form, with concrete failure modes a script can detect.
1. Product / Outcome Showcase
Definition: Lead with the finished result, no setup. “You lead with the thing — the product in action, the before-and-after.”
Pattern: Segment opens with a concrete outcome, demo result, or “here’s what we shipped” statement. Zero preamble before the reveal.
Performance: Highest-performing hook type across 34,635 analyzed clips (avg 6,037 views).
Scriptable signal: First sentence contains a tangible result + present/near-past tense + visual proof potential.
Fails when: Setup precedes outcome, or “outcome” is abstract (“I learned something valuable”). Requires tangible proof in first ~2 seconds.
2. Expert Explainer Setup
Definition: Establish authority → tease insight → promise payoff.
Pattern: “I’ve worked in [role] for 12 years. Here’s what nobody tells you about [topic].” Specific number + credential + contrarian claim.
Scriptable signal: Opening contains: role/credential keyword (built / shipped / years / founded) AND a contrarian framing (“nobody tells you,” “the real reason,” “what actually matters”) within first 3 seconds.
Fails when: Credential vague (“I work in tech”) or claim lacks punch. Weak contrarian = audience assumes generic advice.
3. Contrarian Take
Definition: Reject widely-held belief in opening sentence. Cognitive dissonance — brain can’t leave contradictions unresolved.
Pattern: “Everyone says [common advice]. It’s wrong.” Or “Stop doing [popular practice].”
Performance: 156% more For-You-page appearances when framed as a pattern interrupt.
Scriptable signal: Opening contains negation frame (don’t / stop / wrong / myth) + specific belief or action. Must deliver substance within 5 seconds or credibility collapses.
Fails when: False contrarian without supporting evidence. Once trained that a creator makes empty contrarian claims, audiences ignore — and the failure is cumulative.
4. Specific Number Hook
Definition: Odd, specific numbers signal authenticity. “23 pounds in 67 days” beats generic claims.
Pattern: “I wrote 3,247 cold emails. Here are the 4 subject lines that worked.” Or “We grew from 0 to $1.2M ARR in 14 months because of [one insight].”
Scriptable signal: Transcript contains a concrete, non-round number paired with a claim or result. Examples: revenue, user counts, days elapsed, dollars spent, components iterated, errors fixed.
Fails when: Number lacks context, or isn’t paired with insight. Random statistics feel like bragging; paired statistics feel like proof.
5. Curiosity Gap Hook
Definition: Incomplete information that creates a desire to discover the answer.
Pattern: “I tested this one weird trick…” “The real reason most founders fail isn’t what you think…” Or end opening sentence with an unanswered question.
Performance: 234% more completion, 178% higher shares vs direct-statement openings.
Scriptable signal: Opening sentence leaves a void — open question, teased fact, or promised reveal not yet delivered. Must deliver answer by 8s or retention collapses.
Fails when: Gap too wide (payoff feels distant), payoff underwhelming, or viewer guesses the answer in first 3s.
6. Vulnerability Beat (Personal Stakes)
Definition: Founder admits struggle, failure, or counterintuitive experience. Creates emotional permission for honesty.
Pattern: “I almost shut down the company.” “I’ve made this mistake 4 times.” “I was terrified to [X] but here’s what happened.”
Performance: 89% retention improvement when vulnerability is specific and time-stamped (“in 2021,” “after our Series A”).
Scriptable signal: First-person past tense about failure, near-miss, fear, or surprise. Phrases: “I almost,” “I remember thinking,” “I was wrong about,” “I was scared,” “we almost didn’t.”
Fails when: Vulnerability without lesson. Generic struggle (“starting a company was hard”) without specific detail. Or perceived as fishing for sympathy without action.
Retention Mechanics: Second-by-Second Rules
What must happen across the arc for TikTok / Reels / Shorts to hold attention. Algorithmic thresholds, not artistic goals.
0–3s — The Swipe Boundary
- Target retention: 80% min (YouTube Shorts benchmark). TikTok algorithm makes its first distribution decision at 1.5s.
- Required: Hook communicates in 1.5s (text + visual + spoken word aligned). Pattern interrupt OR immediate value promise.
- Signal: Does the segment open with one of the 6 hook archetypes above?
- Failure: Slow intro / preamble / context-setting before hook. Every second of pre-hook ≈ 15% retention loss.
3–8s — Hook Delivery & First Context
- Target retention: 70%+ (maintenance phase).
- Required: Hook promise clarified (question answered, claim explained, direction revealed). Transition feels inevitable, not forced.
- Signal: By 8s, viewer should know the clip’s thesis. Ambiguity here kills retention.
- Failure: Hook delivered, then silence/delay. Pacing collapse.
8–15s — Proof or Story Escalation
- Target retention: 60%+ (midpoint).
- Required: Specific evidence (number, screenshot, testimonial quote, before/after) OR narrative escalation (stakes rise, contradiction deepens). Pacing tightens, not plateaus.
- Signal: Concrete fact, metric, quote, or plot turn — not filler.
- Failure: Repetition of hook without new info. Talking head without visual support. Common retention cliff at 10–12s.
15–30s — Value Delivery / Climax
- Target retention: 55%+ (most viewers leave before completion; this is the acceptable-loss phase).
- Required: Core lesson, insight, or emotional beat delivered. Setup for ending: resolution or CTA.
- Signal: The “why” for watching is satisfied. Viewers now have a reason to complete or share.
- Failure: Anticlimactic payoff, generic advice, trailing off.
30–60s — Conclusion / Loop / Finish Strong
- Target retention: 70%+ average percentage viewed (short-form-overall benchmark; completion is secondary to APV).
- Required: Rewatch hook (loopability) OR clear CTA OR micro-ending (closed-loop, punchline, emotional landing).
- Signal: Last 3s feel intentional.
- Failure: Rambling into silence, no closure, unclear takeaway.
Founder-Content Levers: What Makes Startup Content Pull
Founder monologues differ from lifestyle / narrative / comedy content. These specific angles make founders go viral in 2026.
L1 — Named Numbers + Founder Attribution
Specificity signals authenticity. “We grew a lot” reads as marketing; “we hit $1.2M ARR because of the 3-email cold outreach sequence” reads as earned data.
Signal: Precise revenue / user / growth / ROI figure + paired with a specific founder action (“we did X, got Y”) + ideally non-round number (23 vs 25, 1.2M vs 1M).
Fails: Numbers without context, or generic attribution.
L2 — Technical Detail as Credibility Proof
When a founder explains implementation detail (API choice, algorithm, schema pattern), it signals they actually built something. AI-startup founders especially: “we prompt-chained X to Y” reads as expertise.
Signal: Brief technical term, tool name, or build step that only an actual builder would know. Examples: “we switched from GPT-4 to Claude for latency reasons,” “the bottleneck was inference cost, so we quantized the model.”
Fails: Buzzword soup without specificity. Technical detail that doesn’t connect to business outcome.
L3 — Building-in-Public Moment
Live decisions, launches, milestones trigger FOMO + credibility simultaneously.
Signal: Present tense or near-past (“we just shipped,” “we’re about to,” “as of yesterday”) + concrete artifact (new feature, user testimonial, metrics movement).
Fails: Vague future tense (“we’re planning to”) without shipped proof.
L4 — Vulnerability + Recovery Pattern
Admitting mistakes or near-deaths (and what was learned) earns trust.
Signal: Specific failure or crisis with timeframe (“in month 4,” “before we pivoted”) + decision in response + outcome.
Fails: Humble-brag without real stakes. “We learned a lot from our mistakes.”
L5 — Customer Quote or Social Proof (With Names)
A testimonial with a specific use case or first-name reads as proof.
Signal: Direct quote or paraphrase from a named customer, partner, or user, with a specific use case.
Fails: Attributed quotes with no context. Generic praise.
L6 — Contrarian Bet Explained
Founders explaining why they chose the unpopular path signal conviction.
Signal: Bold claim (“we decided to go vertical instead of horizontal”) + reasoning in 2–3 sentences. High-velocity, not lengthy.
Fails: Contrarian without reasoning.
Platform Deltas: Same Clip, Three Platforms
| Signal | TikTok | Instagram Reels | YouTube Shorts |
|---|---|---|---|
| Algorithm priority | Watch-through in first 1.5s; completion rate | Saves & shares > likes; aesthetic consistency | Watch time + search keywords (title/description) |
| Native style | Raw, trend-aware, quick cuts | Polished, aesthetic, visual consistency | Educational, clear metadata, “how-to” friendly |
| Optimal length | 15–60s (avg swipe at 41s) | 15–30s for highest completion | 30–60s; keyword in title matters more |
| Hook requirement | Critical; lands in 1–2s | Important but secondary to visual polish | Important but searchable title compensates |
| Audio | Trending sounds boost discovery; original audio now performs 156% better than before | Can use trending audio, but watermarked TikTok reposts get punished | Title/thumbnail > audio |
| Captions | In-app overlays; on-screen text +30% retention | Visual-overlay consistency; on-screen text expected | Highly important; many watch muted |
| Repurposing cost | Highest; TikTok-watermarked Reels get punished | High; IG detects and deprioritizes TikTok watermarks | Low; can upload clean vertical video |
| Failure mode | Hook misses first 2s → cold-start, no distribution | Production-value + weak hook = siloed to followers | Poor metadata (generic title) → no search traffic |
Per-platform adapt:
- TikTok version: Use trending sound when relevant, max contrast + quick cuts in first 3s.
- Reels version: Polish visual consistency, strip watermarks, lean save/share CTAs.
- Shorts version: Optimize title + description with keywords (founder/builder/startup topic), lean educational (“How we X’d our Y by Z%”).
Brand-Voice Tension: When to Bend, When to Hold
Founder monologues carry brand voice — but platforms reward platform-native voice. Tradeoff: earned authenticity vs algorithm score.
Hold brand voice when:
- Founder has accumulated audience trust (100K+ followers, >1% engagement across 20+ clips).
- Contrarian position already established.
- Niche audience expects long-form depth (B2B SaaS, deep-tech, academic — more patient).
Bend toward platform when:
- Clip is first/early in a series (audience doesn’t yet know the founder’s voice).
- Goal is discovery/reach over brand tone.
- Founder’s brand is itself about being authentic-to-platform.
Hybrid (recommended for A Friend / Daniel):
- Hook (0–3s): Use platform-native archetypal hook (pattern interrupt, curiosity gap, specific number).
- Body (3–30s): Founder’s voice — authentic vocabulary, pacing, examples. Daniel: keep German/English code-switching here.
- Ending (30–60s): Reinforce brand position or learning, not a virality hack.
For A Friend specifically: brand voice rules from _brand/voice-preservation.md are non-negotiable in body + ending. Hook can be slightly bent toward platform-native if it makes the clip land — but the bend must not violate banned vocabulary or break the two-beat cadence.
Segment-Selection Heuristics: Scriptable “Ship-Worthy” Signals
Given a transcript, these signals identify segments to clip.
Tier 1 — High Confidence (script SHOULD flag)
Combinations nearly guaranteed to work.
Pattern A — Vulnerability + Specific Number + Outcome
- Contains: “I was [scared/wrong/nearly]” + timeframe/dollar/count + “so we [did X]” + result.
- Example: “I almost shut the company down in month 4 when we were burning 500K ARR.”
- Recommendation: SHIP.
Pattern B — Bold Contrarian + Expert Credential + Specific Example
- Contains: “Everyone says [belief]” / “common advice is wrong” + credential + concrete example.
- Example: “Everyone says you need VC funding to scale. I’ve bootstrapped 4 companies. We grew this one to $2M ARR without a single dollar of venture capital.”
- Recommendation: SHIP.
Pattern C — Shipped Artifact + Named Metric
- Contains: “We just launched [X]” / “as of yesterday” + early-user metric or % delta.
- Example: “We shipped the new vector search this morning. Early beta users say it reduced their inference latency by 60%.”
- Recommendation: SHIP.
Pattern D — Technical Detail + Business Outcome
- Contains: tool/architecture/API choice + reason + outcome.
- Example: “We switched from GPT-4 to Claude 3.5 Sonnet because we needed faster outputs for our chat interface. Turns out our users prefer the response style anyway.”
- Recommendation: SHIP.
Tier 2 — Medium Confidence (script SHOULD CONSIDER)
One or two signals; likely to work but not guaranteed.
Pattern E — Specific Number Alone: single claim with precise non-round number. Medium-high — needs narrative arc or contrast to be great.
Pattern F — Contrarian Claim Alone: bold opening, proof in second sentence. Medium — risky if explanation fails.
Pattern G — Customer Moment: quoted/described customer experience. Medium — engaging but generic without name + use case + revenue impact.
Tier 3 — Lower Confidence (script MAY skip or mark CANDIDATE)
Pattern H — Lesson/Advice Standalone: pure teaching without personal story or proof. Low-medium — depends on existing credibility.
Pattern I — Vague Aspiration / Future Tense: “we’re planning to,” “our goal is” without shipped proof. Low.
Pattern J — Generic Gratitude / Filler: thank-you, conversational filler. Skip.
Scoring Rubric
| Signals present | Recommendation |
|---|---|
| 3+ Tier-1 signals | SHIP — high confidence |
| 2 Tier-1 signals | SHIP — strong clip |
| 1 Tier-1 + 2+ Tier-2 | SHIP — good clip |
| 2+ Tier-2 only | CONSIDER — needs review |
| 1 Tier-2 only | HOLD — re-evaluate in context |
| Tier-3 only | SKIP unless niche-specific |
Mapping to hook_strength (1–5) in cut-plan.json
- 5 = Tier-1 Pattern A/B/C/D AND opens with a clear hook archetype (1.5s rule met).
- 4 = Tier-1 pattern present but hook is in the body, requires reorder to lead.
- 3 = Tier-2 pattern with narrative arc; needs supporting B-roll to hold retention.
- 2 = Tier-2 alone, no clear hook; CONSIDER but expect lower pull.
- 1 = Tier-3; skip unless contextual reason to ship.
Refinement Backlog
Topics for deeper research as the skill matures:
- Niche-specific hook variance — does founder/builder content respond differently to curiosity-gap vs contrarian vs vulnerability hooks across niches (fintech / climate / deep-tech)? Early signal: vulnerability beats outperform in deep-tech. Needs 5+ niche validation.
- Cross-platform serialization — when a founder posts the same clip TikTok → Reels → Shorts, does sequence order matter for audience journey? (Reels performs better if they’ve seen TikTok first?) Compounding effect unexplored.
- AI-startup content premium — does AI/LLM/agentic-systems content have higher retention floors? Sample data shows 8.2% engagement vs 2.8% TikTok baseline. Needs control for founder-credibility confound.
- A Friend brand-voice eval — once 20+ clips are shipped from this skill, build a feedback loop scoring which clips Daniel approved unchanged vs which he edited. The edit pattern reveals where the skill’s brand-voice match is weakest.
- German-language short-form deltas — most virality data is English-centric. Daniel’s clips will be German/English mixed. Open question: do the same hook archetypes pull on German-speaking TikTok / Reels audiences? Needs separate research pass.
Sources
- The 5 TikTok Hook Types That Actually Go Viral in 2026 — OpusClip
- TikTok Algorithm 2025: Complete Marketer’s Guide — DataSlayer
- 10 Viral Video Hooks That Work Every Time in 2025 — Reezo AI
- YouTube Shorts Benchmarks 2026 — Humble&Brag
- Top 10 Viral TikTok Hooks Examples That Still Work in 2025 — SMMNut
- How the TikTok Algorithm Works in 2025 — Hootsuite
- YouTube Shorts vs TikTok vs Instagram Reels: Which Platform Should You Focus On in 2026 — Heropost
- Short-Form Video Trends in 2025 — Flying V Group
- The Only Short-Form Strategy You Need In 2026 — Nathan Barry