Demo · test idea — Babelio is an exploratory concept, not a real product.
b
Babelio Playbook Lesson 08 / 08
2026-05-16
Lesson 08 · AI Thesis

AI Thesis: AI as the foundation, not the feature.

"AI-powered" is a marketing word. "AI-impossible" is a product fact. Your job in this lesson is to state, in one paragraph, the thing your product needs that did not exist 18 months ago — and then to be honest about which parts of that stack are commodity and which parts are yours.

Duration
16 min read
Format
read + checklist
Goal
Thesis / Flywheel / Moat
Outcome
A one-paragraph AI thesis and a model-routing plan

What this lesson does / does not do.

Does
  • Define "AI-impossible" as a sharper bar than "AI-improved".
  • Explain the data flywheel — usage to data to model to product.
  • Pick a primary model + fallback per stage with cost numbers.
  • Name AI-specific risks and the mitigations that ship in v1.
Does not
  • Pretend you will train a foundation model. You will not.
  • Promise a research moat that doesn't exist on commodity APIs.
  • Replace the audio-routing engineering work (that's the real moat).
  • Cover go-to-market — that was Lesson 06.
01.
Concept 01 · AI-impossible vs AI-improved

The product fact that pre-dates the marketing.

4 minread

"AI-powered" is the laziest phrase on the internet. The useful question is sharper: without the AI capability of the last 18 months, is your product impossible — or merely improved?

An AI-improved product gets a faster autocomplete, a better summary, a smarter search ranker. It would still exist without the model — just slightly worse. An AI-impossible product cannot exist at all if you remove the model: the core experience requires a capability that did not have a price-per-call eighteen months ago. Investors fund the second category. The first is feature-list dressing.

The test is simple. Delete the AI from your description. Can you still write a one-sentence product? If yes, you are AI-improved. If the sentence collapses into nonsense, you are AI-impossible — and your job is to identify which capability curve crossed which threshold in which quarter. That date is your "why now".

02.
Concept 02 · The data flywheel

Usage to data to model to product.

4 minread

A flywheel is not a metaphor for "more users = more better". It is a specific four-arrow loop where each turn lowers the cost of the next turn and a competitor without the loop cannot catch up.

The classical loop is: product creates usage, usage emits a signal, the signal improves the model, the improved model differentiates the product. The honest version names what kind of signal: clicks for ranking, corrections for translation, completions for code, outcomes for diagnosis. Without naming the signal you have a slogan, not a flywheel.

The second honest test: does the signal accumulate where you sit, or where the foundation-model vendor sits? If OpenAI sees the corrections before you do, your flywheel powers their factory. If the corrections live in your stack as glossaries, prompt overrides, routing rules, eval sets — then the asset is yours, and a competitor with the same APIs starts from zero.

03.
Concept 03 · Model strategy + routing

Cheap by default, smart on demand.

5 minread

A serious AI product is not one model behind one prompt. It is a pipeline of stages with a primary and a fallback per stage, routed by signal: confidence, cost ceiling, latency tail, user tier.

The cheap-to-smart pattern works because most requests are easy and a small percentage need the heavyweight. Route by default to the cheapest model that meets the latency and quality bar; reroute the segment to a more expensive model when a confidence signal trips a threshold. This pattern is how you keep gross margin alive while still delivering the best result on the long tail.

Two architectural rules. First, every model goes behind an adapter — never call a vendor SDK directly from product code, or you lock yourself into their roadmap and their price list. Second, every stage has a local-only fallback, even if degraded. If your only option is "cloud or nothing", a price hike or an outage is a company-ending event.

Stage
Primary → Fallback
VAD
Silero (local, always)
STT
Deepgram Nova-3 → WhisperKit local → GPT-4o-transcribe
MT
Gemini 2.5 Flash → Claude Haiku 4.5 (idiom) → GPT-4o-mini (cost floor)
TTS
Cartesia Sonic (free) → ElevenLabs Flash v2.5 (paid) → Piper (offline)
04.
Concept 04 · Risks & mitigations

Failure modes name themselves first.

3 minread

An AI product fails in patterned, predictable ways. The mature founder enumerates the failure modes before the user does, and ships the mitigation in v1 — not "on the roadmap".

Four risk families come up in every conversational-AI product: vendor lock-in (your supply chain), cost spikes (your unit economics), hallucination (your trust budget), and privacy (your regulatory surface). Each has a known structural mitigation. Each must be visible in the architecture diagram, not buried in the FAQ.

The discipline is to convert each risk into a concrete shipping artefact: an adapter interface, a per-user quota, a confidence gate, a retention policy. If a risk has no artefact, it is a wish. Investors and serious customers can tell the difference in under a minute.

Don't claim a model-quality moat in your pitch deck

You use the same Deepgram, Gemini, Claude and ElevenLabs APIs every YC batch does. Sophisticated investors read "AI moat" as either dishonest or naïve. Lead with OS integration depth, latency-tuned UX, and the data-flywheel corpus — those are real and yours.

Don't promise on-device translation parity

WhisperKit is great for STT. Local MT (NLLB-200) is meaningfully worse than Gemini Flash on conversational, idiomatic, code-switched speech. Be honest: "private mode is degraded by design" beats "fully on-device" lying badly to a customer once.

Checklist for this week.

Five concrete actions. By Friday you should be able to read your one-paragraph AI thesis out loud without flinching, and your routing table should fit on one page.

lesson mantra

«AI is the floor, not the ceiling.»

— back to the cover
Next · 00

Playbook: back to the cover.