Romanian speech sentiment and emotion dataset

Develop new or improve existing sentiment and emotion AI algorithms with our Romanian speech sentiment dataset that covers the basic six emotions: anger, fear, joy, love, sadness, and surprise.

BACK TO DATASETS

Romanian speech sentiment dataset

Deploy new or improve your English speech sentiment or emotion algorithms in days by using our ready-to-use speech emotion dataset.

The dataset consists of recordings collected from thousands of people speaking Romanian. The emotions and sentiment covered are: anger, fear, joy, love, sadness, and surprise.

Furthermore, the recordings are validated by humans and transcripts are available.

Regions

Romanian language: native and non-native RO.


Specifications

The speech emotion dataset contains audio clips of people recording themselves speaking with different emotions, up to 15 minutes of speech per person. The speech is captured using mobile phones from a diverse crowd of speakers representing all ages and backgrounds. Because of that, the dataset is perfect for use cases involving mobile devices.

The recordings vary in length with an average of 5-second clips. Furthermore, they are classified by the background noise level, age group, gender, and region. The recordings are transcribed verbatim with speech transcribed as said by the person if spontaneous.

Emotion AI and sentiment are sometimes used interchangeably. Emotion AI is an umbrella term for various algorithms of which sentiment is a part of.

Dataset license

Our data licenses agreement covers commercial use, and the datasets can be reused for multiple cases. However, they are not for reselling.

Sample

Samples available upon request.

What is sentiment and emotion AI?

Read more about sentiment analysis here.And more about emotion recognition here.

Quality guarantee

We are confident in the quality of our data, and all customers can review a sample batch of data before buying. Request samples when filling in the Order now form.

Romanian speech sentiment data starting from

1.49€ / recording
Order now

Sentiment dataset details

Technical

SAMPLING RATE
16 – 44 kHz
BACKGROUND NOISE
Classified by noise level
SPEECH EMOTIONS
Anger, fear, joy, love, sadness, and surprise
FILE FORMAT
.wav
RECORDINGS
5 seconds average

Demographics

AGE RANGE
16 – 85 years
GENDER
Female 40%, Male 60%
PROFICIENCY
Grouped by native and non-native Romanian speakers
REGION
Grouped by country of origin and region within country
“Partnering with StageZero has been vital in providing us with high-quality utterance corpora for training our proprietary language and semantic models.”
Dr. Christoph Neumann
CTO at German Autolabs

Custom training data collection for speech and NLP.

Didn’t find the speech dataset you need or your industry in our marketplace? Get in touch with us, so we can use our global network to source the training or testing data that fits your needs.
TELL US ABOUT YOUR NEEDS
©2022 StageZero Technologies
linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram