Products  /  Voicebook
Private · on-device · open source

Record your home. Keep its language. Nothing leaves the house.

Voicebook turns the conversations in your home — ad-hoc or ongoing — into a private, searchable archive of how your family actually talks, all processed on your own machine. From that one pipeline come the things you'll actually open: a child's language journal, a searchable family memory, a Sunday digest for the grandparents.

Language Journal · entryClaire · age 3
妈妈,我们能读恐龙书吗?
"Mama, can we read the dinosaur book?"
Asked in Mandarin, then repeated "dinosaur book!" in English — a clean mid-sentence code-switch, caught and tagged automatically.
2026-06-08bedtimebookscode-switch
The pipeline

Record once. The pipeline does the rest — on your own machine.

Leave it running in the background or capture a moment on the spot. Either way, one local pipeline turns raw sound into a structured record of who said what, in which language, over time.

Record
Ad-hoc or always-on. A memo, a dinner, a bedtime story — captured in the room, not the cloud.
household audio
Process locally
whisper.cpp and local models on your Mac transcribe, separate speakers, and detect language — no upload, ever.
whisper.cpp · on-device
Private archive
A searchable record of your household's language — every speaker, every code-switch, every moment, kept and yours.
local · searchable
One capture layer, tracking how your family really talks — and none of it leaves the house.
The applications

One archive. Many ways to use it.

The pipeline is the product; these are what it becomes once your home's voice is captured and indexed.

where it started
Flagship use case

Language Journal

An auto-generated bilingual diary of a child's language — code-switching and all. Watch Claire's Mandarin and English grow month by month, in her own words, without transcribing a thing by hand.

Everyday

Family Search

Ask "what did Claire say about the museum last Sunday?" and jump straight to the moment — audio queued, in context.

Ongoing

Weekly Digest

A one-page Sunday summary — bilingual, side by side — for grandparents overseas who miss the kid between visits.

Under the hood

Built for how families actually talk — not an API call.

Off-the-shelf transcription falls apart on overlapping speakers, code-switching, and a three-year-old in a noisy kitchen. So the hard parts were rebuilt around that.

01Local & private by default

whisper.cpp and local models run on your own Mac. Your family's voice is never uploaded, brokered, or turned into someone's training data.

02Finds the kid's voice

A pitch-analysis layer tags who's speaking by voice, catching a child even in a noisy room — the moments generic transcription would miss.

03Bilingual by design

Tuned for mid-sentence Mandarin–English code-switching and your household's own vocabulary — nicknames, inside jokes, character names.

04It never invents a memory

Cleanup only classifies lines that were actually said — the model is never allowed to write new text, so it can't fabricate a moment that never happened. See the pipeline →

Yours, on your machine

The moments in your home are worth keeping — privately.

Open source, local-first, and built around one family's real problem before it was ever a product. If that's your household too, let's talk.

Book a conversation