Gain immediate access to a complete speech and language database to accelerate your product development efforts
Whether you are working on a text-to-speech system, a voice recognition system or another solution that relies on speech recognition databases, high quality licensed speech data allows you go to market faster, and reach more potential customers.
Our high-quality licensable materials cover:
- fully transcribed speech recognition databases for broadcast, call center, in-car and telephony applications
- pronunciation lexicons, both general and domain specific (e.g. names, places, natural numbers)
- POS-tagged lexicons and thesauri
- speech corpora annotated for POS, morphological information and named entities
Appen has an extensive catalog of off-the-shelf, licensable speech databases ready to ship. We even cover low-resource languages, including dialects from West and North Asia, the Middle East and Africa.
Got Data? The Importance of High-Quality Data for Building Effective Machine Learning-Based Solutions [AI Trends Webinar]
Watch this webinar for key insights on how to collect data for machine learning, including pros and cons and trade offs that come with different approaches. When it comes to annotating data for academic purposes, there are specific industry standards that are commonly used. However, when it comes to the commercial sector, building a solution that relies on machine learning …
AI Requires a Human Touch: How Appen Recruits Crowds to Improve Technology
Kerri Reynolds, Sr. VP of Human Resources and Crowdsourcing, shares insights into crowdsourcing and crowd recruiting.
Appen Gears Up for Big Presence at Interspeech 2017
Appen to present research on speech recognition, speaker comparisons, and Pashto speech data at this year’s Interspeech conference.