Record your home. Keep its language. Nothing leaves the house.

Voicebook turns the conversations in your home — ad-hoc or ongoing — into a private, searchable archive of how your family actually talks, all processed on your own machine. From that one pipeline come the things you'll actually open: a child's language journal, a searchable family memory, a Sunday digest for the grandparents.

See how it works Book a conversation →

Language Journal · entryClaire · age 3

妈妈，我们能读恐龙书吗？

"Mama, can we read the dinosaur book?"

Asked in Mandarin, then repeated "dinosaur book!" in English — a clean mid-sentence code-switch, caught and tagged automatically.

2026-06-08bedtimebookscode-switch

The pipeline

Record once. The pipeline does the rest — on your own machine.

Leave it running in the background or capture a moment on the spot. Either way, one local pipeline turns raw sound into a structured record of who said what, in which language, over time.

Record

Ad-hoc or always-on. A memo, a dinner, a bedtime story — captured in the room, not the cloud.

household audio

→

Process locally

whisper.cpp and local models on your Mac transcribe, separate speakers, and detect language — no upload, ever.

whisper.cpp · on-device

→

Private archive

A searchable record of your household's language — every speaker, every code-switch, every moment, kept and yours.

local · searchable

One capture layer, tracking how your family really talks — and none of it leaves the house.

The applications

One archive. Many ways to use it.

The pipeline is the product; these are what it becomes once your home's voice is captured and indexed.

where it started

语

Flagship use case

Language Journal

An auto-generated bilingual diary of a child's language — code-switching and all. Watch Claire's Mandarin and English grow month by month, in her own words, without transcribing a thing by hand.

探

Everyday

Family Search

Ask "what did Claire say about the museum last Sunday?" and jump straight to the moment — audio queued, in context.

藏

Ongoing

Weekly Digest

A one-page Sunday summary — bilingual, side by side — for grandparents overseas who miss the kid between visits.

Under the hood

Built for how families actually talk — not an API call.

Off-the-shelf transcription falls apart on overlapping speakers, code-switching, and a three-year-old in a noisy kitchen. So the hard parts were rebuilt around that.

01Local & private by default

whisper.cpp and local models run on your own Mac. Your family's voice is never uploaded, brokered, or turned into someone's training data.

02Finds the kid's voice

A pitch-analysis layer tags who's speaking by voice, catching a child even in a noisy room — the moments generic transcription would miss.

03Bilingual by design

Tuned for mid-sentence Mandarin–English code-switching and your household's own vocabulary — nicknames, inside jokes, character names.

04It never invents a memory

Cleanup only classifies lines that were actually said — the model is never allowed to write new text, so it can't fabricate a moment that never happened. See the pipeline →

Yours, on your machine

The moments in your home are worth keeping — privately.

Open source, local-first, and built around one family's real problem before it was ever a product. If that's your household too, let's talk.

Book a conversation