Brazilian Portuguese,
model-ready.
The direct alternative to enterprise voice marketplaces. Pick the voices from our exclusive Brazilian artist roster, brief the corpus, receive a custom voice pack recorded by a Grammy-credited engineer — model-ready, 10 hours to 1,000+, at up to 75% lower cost than the major voice marketplaces.
License · commercial · cleared
Format · WAV / FLAC · 48 kHz
250 million plays. ~33 million hours of voice listened to.
Our voices have been heard ~3,800 years' worth of continuous play. The same speakers, the same booth — now licensed for your training pipeline.
A specialist studio. Not a marketplace.
We don't license stock libraries and we don't broker freelancers. We produce voice data — to your model's spec, in the volume you need.
An exclusive roster. Direct.
We don't broker — we represent a curated Brazilian artist network and intermediate every booking ourselves. No bidding pools, no marketplace lottery. You pick from a real list of real speakers, and we lock the cast in 48h.
Grammy-grade audio engineering.
Every voice pack is captured and supervised by our chief engineer — Grammy-credited, ten years on broadcast-grade Brazilian productions. The signal your model trains on is the signal that wins records.
Turnkey, any volume.
Casting, direction, recording, capture chain, licensing — we run the production side end-to-end. 10 hours for a voice clone, 100 for fine-tuning, 1,000+ for foundation training. You receive clean audio, ready to feed into your pipeline.
Up to 75% lower than the major marketplaces.
The large voice-talent platforms carry platform markup, per-clip licensing, escrow, and weeks of intake bureaucracy. Direct-to-studio means same quality, faster delivery, and a price that makes large datasets actually fit a training budget.
Pick from the roster. Female and male, every age and tone.
Real speakers, not stock. Browse the profiles, shortlist what fits your model, and we lock the cast within 48h. New voices and timbres added every quarter.
Sung leads & character voices.
Sopranos and child male voices, classically trained. Reliable across sung leads, dialog, character work, and emotional range. The corpus that powered 200+ children's productions.
Conversational & expressive.
Young-adult tenors and altos with natural conversational texture — interjections, recoveries, turn-taking. Built for voice agents and teen-targeted TTS.
Broadcast neutral & character ranges.
Adult sopranos, altos, tenors, baritones. Neutral broadcast register plus character work — calm, urgent, comedic, dramatic. The most flexible block of the catalog.
Narration & warm storytelling.
Documentary-grade baritones and warm female voices. Slow prosody, full dynamic range, ideal for narration, audiobooks, and reflective conversational agents.
◆ Custom castings on request — bilingual, regional accents, specific timbres. New profiles added each quarter.
Three steps. From brief to voice pack.
No studio bookings, no talent contracts, no casting calls. You stay on your model — we handle the production side end-to-end.
- 01 YOU
Pick the voices.
Browse the roster — female & male, children to mature, neutral broadcast to character work. Shortlist the profiles that fit your model. We send samples on the same day; lock the cast within 48h.
Output → cast confirmed - 02 YOU · or · US
Brief the corpus.
Send your script and target volume — or hand us the training objective and we'll write a phonetically-balanced corpus tuned to it. We confirm tone, register, pace, and edge cases (laughter, interjections, multilingual tokens) in one round.
Output → corpus + production plan - 03 US
Receive your voice pack.
We record, run capture-chain QA, and license-clear. Delivery to your cloud bucket — 10h, 100h, or more — clean WAV files with manifest, checksums, and a commercial training license. One invoice, one vendor. Your pipeline takes it from there.
Output → clean audio, your spec
What arrives in your bucket.
We deliver one thing — clean voice audio for AI training. Defaults below match common training norms, and every parameter adapts to your spec on intake: sample rate, file format, noise floor, license scope. You set the bar, we capture to it. Nothing else, no upsell.
- Format
- WAV · 24-bit · 48 kHz · mono · unprocessed RAW SPEC.01
- Noise floor
- ≤ -60 dBFS · captured at source · no de-noising, no plugins, no post-processing SPEC.02
- Capture
- Treated booth · large-diaphragm condenser + shotgun · breaths, clicks, mouth noise naturally minimized at source SPEC.03
- Licensing
- Perpetual · scope-defined per pack (product line · internal R&D · full buyout) · talent release on file SPEC.04
- Delivery
- Cloud bucket (your provider) · direct download — manifest + SHA-256 checksums SPEC.05
- Language
- Brazilian Portuguese (PT-BR) · neutral broadcast and regional registers · child-safe corpus separable SPEC.06
Inside the booth. What happens before delivery.
The three studio stages we run on every voice pack.
- 01
Casting & direction
Speakers cast from our exclusive Brazilian roster — directed for phonetic coverage, register variety, and emotional range. Same human, multiple takes, controlled prosody — not crowdsourced fragments from a marketplace.
Exclusive roster - 02
Studio capture
Treated booth · large-diaphragm condenser + shotgun · Apogee front-end · captured and supervised by our Grammy-credited chief engineer. Unprocessed RAW · ≤ -60 dBFS noise floor · no de-noising, no compression, no EQ — what the model trains on is what the mic heard.
Grammy-grade · RAW - 03
QA & delivery
Automated QA on noise floor, clipping, level consistency. Each take checked against the spec — yours or our defaults — before it leaves the studio. License signed at contract; files dropped straight into your bucket.
Custom-spec · audit-ready
Built for the pipelines you already run.
Diverse speakers, multiple registers, child-safe split.
Phonetically balanced audio suitable for fine-tuning open TTS models or pretraining at scale. Child-safe content delivered as a separate audio bundle.
Per-speaker audio packs — 30 min to 1,000+ hours.
Single-speaker recordings with explicit consent and likeness rights. Compatible with zero-shot, few-shot, and full fine-tune voice clone architectures.
Natural prosody, dialog turns, real disfluencies.
Conversation-grade takes captured with interjections, recoveries, and back-channels. Train voice agents that don't sound like ad reads.
Source/target paired captures.
Paired source/target voice recordings, captured to identical specs. Reference audio for training automated localization stacks targeting PT-BR.
Tell us what you're training.
Tell us your model and the volume you need. We send a free audio sample — voices across age ranges, raw WAV files, license preview — within 48 hours, plus a quote for the full voice pack.