Language: Mandarin
DB Name: MAC_ASR001
Product Type: Telephony
Environment: Mixed
Speaker: 2,000
Prompts/spkr: 98
Utterances: 2,00,000
Audio Hrs: 323.3
kHz: 8
Channels: 1
Description

  • This is a 2,000 speaker Mandarin mobile telephony speech data collection
  • The database comprises 2,000 Mandarin speakers recorded on location in China
  • 2,000 speakers recorded in China
  • 50% male, 50% female
  • 100% Mobile Telephony
  • Broad distribution of age groups (16-60 years) Language Materials
  • 98 prompts per speaker, including:
  • Digits
  • Natural Numbers
  • Personal, place, and business names
  • Confirmation items (yes, no + fuzzy)
  • Generic Command and Control items
  • Phonetically rich Sentences and Words
  • Transcriptions
  • Fully transcribed to SpeechDAT type conventions
  • Lexicon
  • Database is accompanied by a pronunciation lexicon [SAMPA] containing all transcribedwords
Source: Appen