Gain immediate access to a complete speech and language database to accelerate your product development efforts

Use case
Whether you are working on a text-to-speech system, a voice recognition system or another solution that relies on speech recognition databases, high quality licensed speech data allows you go to market faster, and reach more potential customers.

Our approach
Our high-quality licensable materials cover:
- fully transcribed speech recognition databases for broadcast, call center, in-car and telephony applications
- pronunciation lexicons, both general and domain specific (e.g. names, places, natural numbers)
- POS-tagged lexicons and thesauri
- speech corpora annotated for POS, morphological information and named entities

Why Appen?
Appen has an extensive catalog of off-the-shelf, licensable speech databases ready to ship. We even cover low-resource languages, including dialects from West and North Asia, the Middle East and Africa.
Additional resources

Got Data? The Importance of High-Quality Data for Building Effective Machine Learning-Based Solutions [AI Trends Webinar]
Watch this webinar for key insights on how to collect data for machine learning, including pros and cons and trade offs that come with different approaches. When it comes to annotating data for academic purposes, there are specific industry standards that are commonly used. However, when it comes to the commercial sector, building a solution that relies on machine learning …