Language: English (Indian)
DB Name: ENI_ASR001
Product Type: Telephony
Environment: Mixed
Speaker: 2,358
Prompts/spkr: 49
Utterances: 1,17,900
Audio Hrs: 217
kHz: 8
Channels: 1
Description

  • This is a 2,358 speaker Indian English mobile telephony speech database recorded onlocation in India
  • Database Type
  • Medium Level background noise - in-car, home/office, roadside and other publicplace type environments
  • Total audio length - Approximately 217 hours
  • Demographics
  • 2,358 speakers recorded in India
  • 50% male, 50% female
  • Broad distribution of age groups (16-60 years) and dialects
  • Roughly even distribution of speakers of: Hindi/Urdu, Tamil, Gujarati, Telugu, Malayalam, Kannada, Bengali, Assamese, Punjabi, Marathi
  • 100% mobile telephones
  • Language Materials
  • 49 prompts per speaker, including Digits; Natural Numbers; Personal, Place, and
  • Business Names; Confirmation items (yes, no + fuzzy); Generic Command andControl items and Phonetically rich Sentences and Words
  • Transcription and Lexicon
  • Fully transcribed to SpeechDAT type conventions.
  • Database is accompanied by a pronunciation lexicon [SAMPA] containing alltranscribed words.
  • Lexicon - 10,128 unique headwords
Source: Appen