TESS Dataset: The Toronto Emotional Speech Set (TESS) contains a collection of short, semantically neutral, and emotionally spoken sentences. It includes seven emotions: anger, disgust, fear, happiness, pleasant surprise, sadness, and neutral. Each emotion is represented by a different speaker. The dataset consists of 2800 audio files, each lasting approximately 3.5 seconds.