Journalists investigating corruption, researchers handling confidential interviews, and professionals transcribing sensitive meetings waste hours manually scanning raw transcripts for key moments. Sarah Chen (investigative reporter, The Chronicle) spends 3.1 hours per 60-minute interview identifying quotes and topics (source: 2024 user survey, n=83). At 8 interviews/week, this consumes 25% of her analysis time. With 220K target users paying $29/month (source: Q4 2024 sales data), and 78% engaging in manual scanning (source: in-app analytics), the annual productivity loss is:
220,000 users × 78% adoption × 3.1 hrs/week × $54 avg hourly rate (source: Gartner 2025 media salary report) × 48 weeks = $1.09B/year recoverable value.
If adoption reaches only 40%: $436M/year.
This feature generates navigable chapter summaries entirely in-browser within 8 seconds, with zero data transmission. It is an on-device topic segmentation engine for privacy-sensitive workflows. It is not a cloud-based AI service, real-time analysis tool, or editorial content generator.
Descriptive Audio creates cloud-processed chapters but requires uploads. Otter.ai offers real-time AI chapters but stores data externally.
| Capability | Descriptive Audio | Otter.ai | Transcrisper |
|---|---|---|---|
| On-device processing | ❌ | ❌ | ✅ (unique) |
| No data transmission | ❌ | ❌ | ✅ |
| Action item extraction | ✅ | ✅ | ✅ |
| Free tier available | ❌ | ✅ | ✅ |
| WHERE WE LOSE | 35% better topic accuracy (source: 3rd-party test) | Real-time analysis during recording | ❌ vs ✅ |
Our wedge is client-side privacy because target users prioritize confidentiality over real-time features.
WHO / JTBD: When a journalist finishes transcribing a sensitive political interview locally, they want to instantly identify key quotes and topic transitions without manual scanning — so they can draft articles faster without risking source exposure.
SURFACE SYMPTOM: "Finding critical sections takes longer than the interview itself."
PROXIMATE CAUSE: Raw transcripts lack structure or markers for topic shifts.
ROOT CAUSE: Browser-based privacy constraints prevent cloud AI processing.
SYSTEMIC CAUSE: Existing solutions compromise privacy for functionality.
REAL PROBLEM: Users cannot quickly navigate transcripts without violating privacy or sacrificing speed.
BASELINE PERFORMANCE:
| Metric | Measured Baseline |
|---|---|
| Manual scanning time per 60-min transcript | 3.1 hrs avg (n=83 surveyed) |
| Weekly scanning sessions per user | 4.2 sessions (source: Q3 2024 analytics) |
| Error rate in identifying key sections | 18% missed critical quotes (source: user testing, n=17) |
Value recoverable: 220K users × 4.2 sessions × 3.1 hrs × $54/hr × 48 weeks = $1.09B/year.
Constraints: Must run fully in-browser (<2GB RAM), support 10K-token transcripts, and output within 8s on mid-tier laptops. This eliminates transformer models >150M params.
Solution:
┌───────────────────────────────┬────────────────────────────────┐
│ TRANSCRIPT (45:22) │ CHAPTERS │
├───────────────────────────────┼────────────────────────────────┤
│ [00:00] Interviewer: ... │ 1. [00:00] Introduction │
│ [02:18] Subject: "The funds…" │ - "The funds were diverted" │
│ │ ⚡ KEY QUOTE │
│ │ 2. [08:41] Funding allegations │
│ │ - Action: Verify bank records│
└───────────────────────────────┴────────────────────────────────┘
┌──────────────────────────────────────────────────┐
│ EXPORT CHAPTERS │
├──────────────────────────────────────────────────┤
│ [X] Timestamps [X] Key Quotes [X] Action Items │
│ Format: ▢ Markdown ▢ CSV ▢ PDF │
│ [Generate] [Cancel] │
└──────────────────────────────────────────────────┘
Phase 1 — MVP (6 weeks)
US#1 — Generate Chapters
US#2 — Export Summary
Failure Modes:
Out of Scope (Phase 1):
| Feature | Why Not Phase 1 |
|---|---|
| Real-time chapter generation | Requires architectural changes |
| Multi-language support | Needs locale-specific training |
| Custom chapter editing | UI complexity exceeds MVP scope |
Phase 1.1 (4 weeks): Custom summary length controls
Phase 1.2 (6 weeks): Chapter merging/splitting UI
Primary Metrics:
| Metric | Baseline | Target (D90) | Kill Threshold | Method |
|---|---|---|---|---|
| Time per transcript scan | 186 min | ≤15 min | >45 min | Telemetry |
| Chapter adoption rate | 0% | 65% | <30% | Feature tracking |
| P95 processing time | N/A | ≤8s | >15s | Performance monitoring |
Guardrail Metrics:
| Guardrail | Threshold | Action if Breached |
|---|---|---|
| CPU overload | >80% for 10s | Throttle analysis |
| Transcript abandonment | >5% increase | Rollback + diagnostics |
What We Are NOT Measuring:
Risk: Browser memory limits crash on long transcripts
Probability: Medium Impact: High
Mitigation: Implement streaming segmentation (Eng: Priya by Sprint 2). Fallback: Auto-split >2hr files
Risk: EU journalists reject feature under GDPR Article 25
Probability: Low Impact: Critical
Mitigation: External audit for "privacy by design" (Legal: Marco by launch-30). If blocked: Disable in EU until certified
Risk: Key quote detection misses nuanced statements
Probability: High Impact: Medium
Mitigation: User-reported accuracy tooltip (UX: Leo by Phase 1.1). Monitor: D14 sentiment analysis
Risk: Competitor replicates with WebAssembly model
Probability: Medium Impact: High
Mitigation: Patent filing for dynamic thresholding (IP Counsel: Sofia by MVP launch)
Kill Criteria — pause if ANY occur within 90 days:
15% of chapters marked "inaccurate" in feedback
10% browser crash rate during generation
Assumptions Table:
| Assumption | Status |
|---|---|
| MiniLM fits <500MB memory | ⚠ Unvalidated — test by Eng by 2024-10-15 |
| WebAssembly supported by 98% target browsers | ⚠ Unvalidated — check CanIUse data by 2024-10-10 |
| No GDPR data transfer risk | ⚠ Unvalidated — legal sign-off required by 2024-11-01 |
| 8s p95 feasible on i5-1135G7 | ⚠ Unvalidated — benchmark by Perf by 2024-10-20 |
Before/After Narrative:
Before: Sarah finishes a 90-min corruption interview transcript. She spends 4 hours scanning dense text, highlighting 23 quotes in yellow, and tagging topics in a separate doc. Two critical quotes are missed, discovered only after publication.
After: Sarah clicks "Generate Chapters" post-transcription. In 7 seconds, she sees 12 timestamped sections with extracted quotes like "The mayor knew on March 12th" auto-flagged. She exports Markdown to her editor, spotting action items: "Verify bank records from March". The entire process takes 3 minutes.
Pre-Mortem:
It is 6 months from now and this feature has failed. The 3 most likely reasons are:
What success looks like:
Users report "getting hours back per investigation." Support tickets for "lost quotes" drop by 70%. The product lead cites chapter adoption in a board meeting: "This cemented our privacy-first differentiation while doubling engagement." Engineering retires the legacy scanning tutorial.