TY - JOUR AU - Ahmed, Salah PY - 2021/11/20 Y2 - 2024/03/29 TI - ASER: Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset JF - Transactions on Engineering and Computing Sciences JA - TECS VL - 9 IS - 6 SE - Articles DO - 10.14738/tmlai.96.11039 UR - https://journals.scholarpublishing.org/index.php/TMLAI/article/view/11039 SP - 1-8 AB - <p>Recently, there have been tremendous research outcomes in the fields of speech recognition and natural language processing. This is &nbsp;due to the well-developed multilayers deep learning paradigms such as wav2vec2.0, Wav2vecU, WavBERT, and HuBERT that provide better representation learning and high information capturing.&nbsp; Such paradigms run on hundreds of unlabeled data, then fine-tuned on a small dataset for specific tasks.</p><p>This paper introduces a deep learning constructed emotional recognition model for Arabic speech dialogues. The developed model employs the state of the art audio representations include wav2vec2.0 and HuBERT. The experiment and performance outcomes of our model overcome the previous known results.</p> ER -