AI VOICE TRAINING / PT-BR · MODEL-READY

Brazilian Portuguese,
model-ready.

The direct alternative to enterprise voice marketplaces. Pick the voices from our exclusive Brazilian artist roster, brief the corpus, receive a custom voice pack recorded by a Grammy-credited engineer — model-ready, 10 hours to 1,000+, at up to 75% lower cost than the major voice marketplaces.

Request a sample pack → View dataset spec

PT-BR sample · WAV · 48 kHz · 24-bit · -23 LUFS

Talent · agreements on file
License · commercial · cleared
Format · WAV / FLAC · 48 kHz

[ 00 / The numbers ]

250 million plays. ~33 million hours of voice listened to.

Our voices have been heard ~3,800 years' worth of continuous play. The same speakers, the same booth — now licensed for your training pipeline.

250M⁺

YouTube plays · across our productions

2B⁺

Audio minutes streamed · est. retention

33Mh

Of voice consumed · ≈ 3,800 years

700⁺

Productions shipped · 10+ years working

[ 01 / Why DMH ]

A specialist studio. Not a marketplace.

We don't license stock libraries and we don't broker freelancers. We produce voice data — to your model's spec, in the volume you need.

An exclusive roster. Direct.

We don't broker — we represent a curated Brazilian artist network and intermediate every booking ourselves. No bidding pools, no marketplace lottery. You pick from a real list of real speakers, and we lock the cast in 48h.

Grammy-grade audio engineering.

Every voice pack is captured and supervised by our chief engineer — Grammy-credited, ten years on broadcast-grade Brazilian productions. The signal your model trains on is the signal that wins records.

Turnkey, any volume.

Casting, direction, recording, capture chain, licensing — we run the production side end-to-end. 10 hours for a voice clone, 100 for fine-tuning, 1,000+ for foundation training. You receive clean audio, ready to feed into your pipeline.

Up to 75% lower than the major marketplaces.

The large voice-talent platforms carry platform markup, per-clip licensing, escrow, and weeks of intake bureaucracy. Direct-to-studio means same quality, faster delivery, and a price that makes large datasets actually fit a training budget.

[ 02 / Voices ]

Pick from the roster. Female and male, every age and tone.

Real speakers, not stock. Browse the profiles, shortlist what fits your model, and we lock the cast within 48h. New voices and timbres added every quarter.

CHILDREN0 – 12

F · M

Sung leads & character voices.

Sopranos and child male voices, classically trained. Reliable across sung leads, dialog, character work, and emotional range. The corpus that powered 200+ children's productions.

Sung
Spoken
Character
Emotive

TEEN13 – 24

F · M

Conversational & expressive.

Young-adult tenors and altos with natural conversational texture — interjections, recoveries, turn-taking. Built for voice agents and teen-targeted TTS.

Conversational
Expressive
Agent-grade

ADULT25 – 49

F · M

Broadcast neutral & character ranges.

Adult sopranos, altos, tenors, baritones. Neutral broadcast register plus character work — calm, urgent, comedic, dramatic. The most flexible block of the catalog.

Broadcast
Character
Dialog
Documentary

MATURE50+

F · M

Narration & warm storytelling.

Documentary-grade baritones and warm female voices. Slow prosody, full dynamic range, ideal for narration, audiobooks, and reflective conversational agents.

Narration
Storytelling
Calm
Documentary

◆ Custom castings on request — bilingual, regional accents, specific timbres. New profiles added each quarter.

[ 03 / How it works ]

Three steps. From brief to voice pack.

No studio bookings, no talent contracts, no casting calls. You stay on your model — we handle the production side end-to-end.

01 YOU

Pick the voices.

Browse the roster — female & male, children to mature, neutral broadcast to character work. Shortlist the profiles that fit your model. We send samples on the same day; lock the cast within 48h.
Output → cast confirmed
02 YOU · or · US

Brief the corpus.

Send your script and target volume — or hand us the training objective and we'll write a phonetically-balanced corpus tuned to it. We confirm tone, register, pace, and edge cases (laughter, interjections, multilingual tokens) in one round.
Output → corpus + production plan
03 US

Receive your voice pack.

We record, run capture-chain QA, and license-clear. Delivery to your cloud bucket — 10h, 100h, or more — clean WAV files with manifest, checksums, and a commercial training license. One invoice, one vendor. Your pipeline takes it from there.
Output → clean audio, your spec

[ 04 / Spec ]

What arrives in your bucket.

We deliver one thing — clean voice audio for AI training. Defaults below match common training norms, and every parameter adapts to your spec on intake: sample rate, file format, noise floor, license scope. You set the bar, we capture to it. Nothing else, no upsell.

Format: WAV · 24-bit · 48 kHz · mono · unprocessed RAW
Noise floor: ≤ -60 dBFS · captured at source · no de-noising, no plugins, no post-processing
Capture: Treated booth · large-diaphragm condenser + shotgun · breaths, clicks, mouth noise naturally minimized at source
Licensing: Perpetual · scope-defined per pack (product line · internal R&D · full buyout) · talent release on file
Delivery: Cloud bucket (your provider) · direct download — manifest + SHA-256 checksums
Language: Brazilian Portuguese (PT-BR) · neutral broadcast and regional registers · child-safe corpus separable

[ 05 / Studio QA ]

Inside the booth. What happens before delivery.

The three studio stages we run on every voice pack.

01
Casting & direction

Speakers cast from our exclusive Brazilian roster — directed for phonetic coverage, register variety, and emotional range. Same human, multiple takes, controlled prosody — not crowdsourced fragments from a marketplace.
Exclusive roster
02
Studio capture

Treated booth · large-diaphragm condenser + shotgun · Apogee front-end · captured and supervised by our Grammy-credited chief engineer. Unprocessed RAW · ≤ -60 dBFS noise floor · no de-noising, no compression, no EQ — what the model trains on is what the mic heard.
Grammy-grade · RAW
03
QA & delivery

Automated QA on noise floor, clipping, level consistency. Each take checked against the spec — yours or our defaults — before it leaves the studio. License signed at contract; files dropped straight into your bucket.
Custom-spec · audit-ready

[ 06 / Use cases ]

Built for the pipelines you already run.

A. TTS foundation models

Diverse speakers, multiple registers, child-safe split.

Phonetically balanced audio suitable for fine-tuning open TTS models or pretraining at scale. Child-safe content delivered as a separate audio bundle.

B. Voice cloning

Per-speaker audio packs — 30 min to 1,000+ hours.

Single-speaker recordings with explicit consent and likeness rights. Compatible with zero-shot, few-shot, and full fine-tune voice clone architectures.

C. Conversational agents

Natural prosody, dialog turns, real disfluencies.

Conversation-grade takes captured with interjections, recoveries, and back-channels. Train voice agents that don't sound like ad reads.

D. Dubbing automation

Source/target paired captures.

Paired source/target voice recordings, captured to identical specs. Reference audio for training automated localization stacks targeting PT-BR.

[ 07 / Contact ]

Tell us what you're training.

Tell us your model and the volume you need. We send a free audio sample — voices across age ranges, raw WAV files, license preview — within 48 hours, plus a quote for the full voice pack.

Email dmhvoices@gmail.com

Studio São Paulo · BR

Response window < 48h

Brazilian Portuguese,
model-ready.

250 million plays. ~33 million hours of voice listened to.

A specialist studio. Not a marketplace.

An exclusive roster. Direct.

Grammy-grade audio engineering.

Turnkey, any volume.

Up to 75% lower than the major marketplaces.

Pick from the roster. Female and male, every age and tone.

Sung leads & character voices.

Conversational & expressive.

Broadcast neutral & character ranges.

Narration & warm storytelling.

Three steps. From brief to voice pack.

Pick the voices.

Brief the corpus.

Receive your voice pack.

What arrives in your bucket.

Inside the booth. What happens before delivery.

Casting & direction

Studio capture

QA & delivery

Built for the pipelines you already run.

Diverse speakers, multiple registers, child-safe split.

Per-speaker audio packs — 30 min to 1,000+ hours.

Natural prosody, dialog turns, real disfluencies.

Source/target paired captures.

Tell us what you're training.

Brazilian Portuguese, model-ready.

250 million plays. ~33 million hours of voice listened to.

A specialist studio. Not a marketplace.

An exclusive roster. Direct.

Grammy-grade audio engineering.

Turnkey, any volume.

Up to 75% lower than the major marketplaces.

Pick from the roster. Female and male, every age and tone.

Sung leads & character voices.

Conversational & expressive.

Broadcast neutral & character ranges.

Narration & warm storytelling.

Three steps. From brief to voice pack.

Pick the voices.

Brief the corpus.

Receive your voice pack.

What arrives in your bucket.

Inside the booth. What happens before delivery.

Casting & direction

Studio capture

QA & delivery

Built for the pipelines you already run.

Diverse speakers, multiple registers, child-safe split.

Per-speaker audio packs — 30 min to 1,000+ hours.

Natural prosody, dialog turns, real disfluencies.

Source/target paired captures.

Tell us what you're training.

Brazilian Portuguese,
model-ready.