Synced from
references/essence-extraction-framework.mdincontent-extractionskill on 2026-05-18. Edit upstream in the skill; this file is overwritten on next sync.
Essence-Extraction Framework — Schärfung / Trim / Essenz
The operational framework the virality sub-agent applies to Daniel’s source. Owned by
virality/SKILL.md. The German triple — Schärfung (sharpening), Trim, Essenz-Extraktion (essence-extraction) — is Daniel’s framing (verbatim 2026-05-03). This file makes those three actions concrete.
The premise: a raw 2-3 minute monologue contains 30-60s of actually-good content. The sub-agent’s job is to find that 30-60s, cut everything else, and surface what specifically pulls. Length is not a virtue. A 50s clip with substance pulls harder than a 115s clip with substance + fluff.
Three operations, applied in order:
- Schärfung — find the strongest punches in source; pull them to clip-second 0
- Trim — drop weak material adjacent to the punches; cut redundancy WITHIN strong segments
- Essenz — distill the kernel of the clip’s argument; verify the I-believe-that landed
This file defines the heuristics each operation applies. The sub-agent runs each operation against the draft cut-plan arcs (post-silence-removal, pre-cut) and proposes specific changes in virality-pass.md.
Operation 1: Schärfung — find the strongest punch, lead with it
Premise: A 2-3min Daniel monologue typically has 1-3 Tier-1 self-supplying scroll-stoppers. They’re rarely at the start of source. They’re usually in the LAST third (Daniel’s cadence — he warms up, then lands the kernel).
Where to look (in order of yield):
-
Last 30s of source. Daniel’s strongest reframes statistically land here. His “warm-up → land” arc means the last beat IS often the kernel. Already encoded in parent SKILL.md Rule A “Tail-of-source as opener” — sub-agent enforces explicitly.
-
Mid-source vulnerability beats. When Daniel says “I almost…” / “I was wrong about…” / “this can’t be it” — those are usually unfiltered. Specificity earned through emotion.
-
Tool-naming declaratives. “Pulled up all my agents” / “I’m using Claude + Figma + the skill” → Pillar 7 punches. Land hard because they’re concrete + named.
-
Two-beat cadence in source. Brand’s signature rhythm. Daniel’s: “Tools end conversations. Companions hold them.” These are auto-Tier-1 — pull forward whenever found.
-
Categorical reframes. “We are not merely building toys” / “this constant overwhelm of having multiple windows of agents” — reframes existing assumptions. Self-supplying.
Where NOT to look:
- Source intro. Series-anchor language (“welcome to episode N” / “today I want to talk about”) almost never qualifies. Functional, not punchy.
- Restart-pause openers. “Yeah, so…” / “Okay, basically…” → filler.
- Generic vulnerability. “Building a startup is hard” without specificity → fails So-What test.
Schärfung outputs (per draft clip):
### Schärfung
- Current opener: "<verbatim>" (segN, source HH:MM:SS)
- **Proposed opener:** "<verbatim from elsewhere in source>" (segM, source HH:MM:SS)
- Why this lands harder: <name the technique — categorical reframe / vulnerability / tool-naming / two-beat / tail-distillation>
- Source-tail searched: ✓ (sub-agent MUST verify last 30s explicitly)If the proposed opener is literally the original opener — that’s a valid output. But the sub-agent must explicitly state the tail was searched and rejected (don’t skip the work).
Operation 2: Trim — kill weak material, internal + adjacent
Once the punch is identified, every adjacent second must earn its place. Three sub-operations:
Trim 2a: Adjacent-weakness elimination
A weak segment between two strong ones gets cut. The viewer’s retention is determined by the WEAKEST adjacent beat, not the average. A clip with [punch — filler — punch] reads worse than [punch — punch] at half the runtime.
What counts as “weak adjacent”:
- Restating the previous beat in different words
- Conjunction-led continuations (“So that’s why…”, “And that’s how…”) without new substance
- Repetition of theme without new angle
- Fluffy reflection that doesn’t add specificity (“I think about this a lot”)
- Topic drift that disrupts the arc
Heuristic: for each segment N, ask “does this segment introduce a NEW substance beat? (named thing / specific number / vivid image / pivot / cliffhanger)” — if no, propose drop.
Trim 2b: Internal redundancy (Rule K extension)
A single segment can have internal redundancy. Daniel’s natural cadence sometimes restates inside a sentence — “I got two more days and I said to myself, two more days, then it’s done.” The repetition + connective tissue (“and I said to myself”) can usually be cut to land tighter.
When to apply:
- A phrase repeats inside a 30s window (often natural-speech redundancy).
- Connective tissue (“and I said to myself”, “you know”, “so basically”) sits between two punches.
- A long sentence has a tight kernel buried in qualifying clauses.
When to NOT apply:
- The repetition IS the rhetoric (deliberate restatement for emphasis — Daniel’s “two more days. Two more days.” cadence).
- Connective tissue carries voice (cutting kills it).
- Sub-cut would break audio flow into a glitch.
Path: word-level data tells you when each word starts/ends. Find redundant span in/out points, encode as sub-segment cut. Validate audibly.
Trim 2b outputs:
### Trim (word-level cuts inside quotes)
- segN quote: "<original>"
- **Drop:** "<phrase>" (word-level: X.XXs–Y.YYs)
- Audio result: tighter [recursive-shrink reveal / punchline landing / cadence]Trim 2c: Weak-tail elimination
Outro/CTA segments are almost always trimmable. Daniel’s “so let’s get on this journey…” closes have value as series anchors — but for shipped short-form they’re sells-y and break the loopability rule.
Default action: propose dropping outro segments unless they contain a promise-close or two-beat cadence. The clip should END on the climax beat, not the wave-goodbye.
Operation 3: Essenz — what is this clip’s ONE sentence?
Every clip has exactly one I-believe-that line that completes its argument. If the clip can’t be reduced to one sentence Daniel would own, the clip isn’t ready.
The kernel test:
“If this monologue had to fit in one sentence Daniel would still own, which sentence would it be?”
That sentence is the Essenz. It’s:
- A verbatim line from source (NOT synthesized)
- Specific enough that someone could quote it back to him and he’d recognize it
- Actually about ONE thing (not a list)
- Stake-bearing (passes So-What test)
The kernel comes BEFORE the cut, not after. The sub-agent identifies the Essenz first, then asks: “what segments support this Essenz? what segments distract from it?”
Segments that don’t support the Essenz get cut. Even if they’re well-said. Even if Daniel meant them. They belong in a DIFFERENT clip with a DIFFERENT Essenz.
This is how a 115s master becomes a 50s clip: most of the master’s segments don’t all support the same Essenz. They support multiple Essenz lines that need separate clips.
Essenz outputs:
### Essenz (one-sentence kernel)
> "<the I-believe-that line that completes the clip's argument>"
**Source location:** segN, HH:MM:SS
**Pillar fit:** <pillar from _brand/pillars.md>
**Schwartzberg pass:**
- I-believe-that: ✓ "<expanded sentence>"
- So-what: <Yes — who pushes back / No — too obvious>
- Why-test: <weighted vs filler analysis>
**Segments supporting this Essenz:** seg2, seg4, seg7
**Segments NOT supporting this Essenz (propose drop):** seg1, seg3, seg5, seg6If a clip’s source contains TWO viable Essenz lines, the sub-agent proposes splitting into TWO clips. Don’t try to ship a clip that argues two things.
The Compression Math
How much should a clip get cut from raw source? Heuristic:
| Source duration | Typical shipped duration | Compression ratio |
|---|---|---|
| 30s raw | 20-25s | 80% kept |
| 60s raw | 30-40s | 50-65% kept |
| 90s raw | 35-50s | 40-55% kept |
| 120s raw | 40-60s | 35-50% kept |
| 180s raw | 45-70s | 25-40% kept |
These ratios are rough — substance density varies. But they’re the bias: a 3-minute monologue should yield a sub-1-minute clip in the typical case.
If silence-removal alone (Phase 1) produces a 1.5min cut from a 3min source, the sub-agent’s job is to recommend cutting it FURTHER to 50-60s. The path is: identify Essenz → keep supporting segments → drop the rest, even if the rest is nice.
Anti-pattern: treating Phase 1’s silence-removed output as the final cut. That’s the equivalent of “I removed the dead-air, ship it.” Dead-air removal is necessary but not sufficient. The substance-density check (Essenz alignment) is what shortens the clip from 1.5min to 50s.
The Schärfung-Trim-Essenz integration loop
Per draft clip, sub-agent runs:
- Schärfung pass → propose hook reorder
- Essenz pass → identify the kernel
- Trim pass → propose adjacent + internal cuts that don’t support the Essenz
- Recompute target duration → cut should be 30-60s for short-form publish, ≤30s if Essenz is tight, ≤90s only if Essenz justifies depth
- Re-run attention-preservation scorecard (
references/attention-preservation.md) → verify second-hook lands at 12-18s, pacing-shifts present, loopability achievable
If after all passes the proposed clip still exceeds 60s OR has a flat retention curve, the sub-agent’s verdict is REVIEW or REJECT — Daniel decides. Don’t ship a clip the sub-agent itself flagged as marginal.
What the framework does NOT do
- Does NOT synthesize new lines. Every proposed opener / kept segment / trim recommendation is verbatim from
transcript.txt. - Does NOT auto-merge into
cut-plan.json. Outputsvirality-pass.mdfor Daniel review. - Does NOT enforce hard runtime caps. Recommends a target; Daniel decides.
- Does NOT preserve Daniel’s intended order. Reorder is on the table — anything verbatim can move.
- Does NOT score against general short-form virality. Scores against A Friend register (anti-slick — see
references/aesthetic-register-anti-slick.md).
Brand non-negotiables (hard gates)
The sub-agent’s output MUST pass these checks before being surfaced to Daniel:
- Every proposed opener / kept segment / trim is verbatim substring of
transcript.txt(whitespace-tolerant). - No banned vocab in any sub-agent surface text (titles, framing, commentary). Banned: revolutionize, seamless, AI-powered, game-changer, leverage, exclamation points anywhere.
- Two-beat cadence preserved in any proposed re-arc.
- Code-switching preserved (German/English mix).
- Tool-naming check (Pillar 7 only) — surfaced in
virality-pass.md. - Trim proposals ≥0.4s tagged “VALIDATE BY EAR” — Daniel listens before merge.
Failures auto-inject BRAND-NONNEGOTIABLE-VIOLATION tag inside virality-pass.md rather than silent emit.
Sources & canon basis
references/shortform-virality.md— parent rubric (Tier-1/2/3, hook archetypes, Schwartzberg).references/attention-preservation.md— temporal mechanics (second-hook window, loopability, retention floors)._brand/voice-preservation.md— verbatim contract, banned vocab, two-beat, code-switching, “feels like me” gate._brand/pillars.md— 7 pillars, anti-pattern detection.- Joel Schwartzberg Get to the Point — I-believe-that / So-what / Why-test framework.
- William Strunk Jr. & E. B. White The Elements of Style — omission discipline (“Omit needless words”).
- Nathan Barry “compression as care” essay — substance-density per second as the right metric.
- A Friend campaign-state.md — what’s been shipped, which Essenz lines exist already, anti-duplication.
The framework cites THIS file when documenting a recommendation. Not generic content-creator advice.