Limited time offer: European speech recognition datasets

50% discount on up to 500 hours of data when ordered before February 28th, 2023!

Speech datasets

We’re proud to introduce the largest set of European speech recognition datasets on the market consisting of speech captured from hundreds of thousands unique speakers across Europe, ready for all speech recognition use cases.

We’re offering the first 500 hours of speech at -50% if you order before February 28th, 2023!

Languages 

Our ASR and IVR datasets will cover over 25 European languages, fully validated and transcribed by other humans. Click Order now to see the list of languages.

Specifications

The datasets contain transcribed spontaneous monologues of people talking about various topics of up to 15 minutes of speech per person. The speech is captured using mobile phones. Recordings vary in length up to 30 seconds each and are classified by background noise level, age groups, gender, and region.

License

Perpetual, reusable for commercial use. Not for reselling.

Sample

Spontaneous speech monologue sample in Swedish on the topic "Is artificial intelligence something that will affect you in the future?" (Female, 24 years)

Transcribed speech starting from

109€ / hour
218€ / hour
Order now
REQUEST MORE INFO

Pricing

Hours of free-speech monologue (Unscripted)Transcripted priceTranscripted price per hourPrice without transcriptionPrice per hour without transcription
Introduction deal: first 500 hours at -50%

109,000.00
54,500.00 €

218.00
109.00 €

81,750.00
40,875.00 €

163.50
81.75 €

250

59,000.00

236.00

44,250.00

177.00

500

109,000.00

218.00

81,750.00

163.50

1000

195,000.00

195.00

146,250.00

146.25

10000

1,665,000.00

166.50

1,248,750.00

124.88

50000

7,795,000.00

155.90

5,846,250.00

116.93

Details

Technical

SAMPLING RATE
16 – 44 kHz
BACKGROUND NOISE
Classified by noise level
RECORDINGS
1 - 30 seconds
FILE FORMAT
.wav

Demographics

AGE RANGE
16 – 85 years
GENDER
Female 40%, Male 40%, Other 20%
PROFICIENCY
Grouped by native and non-native speakers
REGION
Grouped by country of origin
”StageZero's flexibility and their professional setup in terms of data privacy made it easy for us to comply with our strict corporate policies regarding data privacy and cybersecurity. They understand our specific needs and are proactive in facilitating solutions."
Dr. Markus Weber
Senior Ink Technologist at Wacom
©2022 StageZero Technologies
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram