Language: Hindi
DB Name: HIN_ASR001
Product Type: Telephony
Environment: Low background noise
Speaker: 1,920
Prompts/spkr: 50
Utterances: 96,000
Audio Hrs: 224
kHz: 8
Channels: 1
Description

  • This is a 1,920 speaker Hindi mobile telephony speech database. The database comprises1,920 speakers who speak Hindi as a second language (i.e. native speakers of Telugu,Gujarati, etc who use Hindi as a second language) recorded on location in India
  • Database Type
  • 1,920 speakers recorded in India
  • 50% male, 50% female
  • Broad distribution of age groups (16-60 years) and dialects
  • Roughly even distribution of speakers whose native language is: Urdu, Tamil, Gujarati, Telugu, Malayalam, Kannada, Bengali, Assamese, Punjabi, Marathi
  • 100% mobile telephones
  • Medium Level background noise - in-car, home/office, roadside and other publicplace type environments
  • Language Materials
  • 50 prompts per speaker, including Digits; Natural Numbers; Personal, Place and
  • Business names; Confirmation items (yes, no + fuzzy); Generic Command andControl items; Phonetically rich Sentences and Words; and Web addresses
  • Transcriptions
  • Fully transcribed to SpeechDAT type conventions
  • Lexicon
  • Database is accompanied by a pronunciation lexicon [SAMPA] containing alltranscribed words
  • Lexicon - 9,853 unique headwords
  • Total audio length - Approximately 224 hours
Source: Appen