Expressive TTS
700 hours, full of feeling
Around 8 dedicated speakers record 700 hours — not flat studio reading, but speech carrying real emotion: laughter, lament, lullaby. Enough for a voice that actually sounds Kashmiri.
Flagship effort
Zoon means moon. It is also the largest open speech corpus ever attempted for Kashmiri — built so the language can live inside the voice technology the rest of the world takes for granted.
The idea
How Zoon is built · scroll
Expressive TTS
Around 8 dedicated speakers record 700 hours — not flat studio reading, but speech carrying real emotion: laughter, lament, lullaby. Enough for a voice that actually sounds Kashmiri.
Speech-to-text
500 hours gathered from about 500 everyday speakers — every district, dialect, age and accent — so recognition works for the whole valley, not just newsreaders.
Curate & align
Every clip is transcribed, timestamped and quality-checked — turning raw recordings into data a model can actually learn from.
Release openly
The finished corpus is released openly for researchers, developers and dreamers — anyone who wants to build Kashmiri into their tools.
Your part
The STT half needs hundreds of ordinary speakers. Five minutes of reading aloud is a real contribution.
By the numbers
Lend your voice
No studio, no signup fuss — just your phone and your voice. Every dialect you add makes Kashmiri speech tech work for more people.