Language: Marathi
DB Name: MAR_ASR001
Product Type: Conversational Telephony
Environment: Mixed
Speaker: 1,000
Prompts/spkr:
Utterances:
Audio Hrs: 108
kHz: 8
Channels: 2
Description

  • This is a 1,000 speaker conversational telephony database
  • Approximately 54 hours of conversation data (equivalent to 108 hours of single channel audio).
  • Portion of the database are transcribed and time stamped. Full transcripts can be made available.
  • 50% male, 50% female
  • Broad distribution of age groups (16-60 years)
  • Roughly even distribution of speakers from the following dialect region groups: Khandeshi, Nagpuri, Puneri
  • Approximately 16% landline/84% mobile
  • Medium Level background noise - in-car, home/office, roadside and other public place type environments
  • Speakers speak on a range of generic topics
  • Database is accompanied by a pronunciation lexicon containing all transcribed words
Source: Appen