Název: An octonion-based nonlinear echo state network for speech emotion recognition in Metaverse
Autoři: Daneshfar, Fatemeh
Jamshidi, Mohammad
Citace zdrojového dokumentu: DANESHFAR, F. JAMSHIDI, M. An octonion-based nonlinear echo state network for speech emotion recognition in Metaverse. Neural Networks, 2023, roč. 163, č. June 2023, s. 108-121. ISSN: 0893-6080
Datum vydání: 2023
Nakladatel: Elsevier
Typ dokumentu: článek
article
URI: 2-s2.0-85151634585
http://hdl.handle.net/11025/53028
ISSN: 0893-6080
Klíčová slova v dalším jazyce: speech emotion recognition;digital twins;metaverse;octonion algebra;echo state network;machine learning
Abstrakt v dalším jazyce: While the Metaverse is becoming a popular trend and drawing much attention from academia, society, and businesses, processing cores used in its infrastructures need to be improved, particularly in terms of signal processing and pattern recognition. Accordingly, the speech emotion recognition (SER) method plays a crucial role in creating the Metaverse platforms more usable and enjoyable for its users. However, existing SER methods continue to be plagued by two significant problems in the online environment. The shortage of adequate engagement and customization between avatars and users is recognized as the first issue and the second problem is related to the complexity of SER problems in the Metaverse as we face people and their digital twins or avatars. This is why developing efficient machine learning (ML) techniques specified for hypercomplex signal processing is essential to enhance the impressiveness and tangibility of the Metaverse platforms. As a solution, echo state networks (ESNs), which are an ML powerful tool for SER, can be an appropriate technique to enhance the Metaverse's foundations in this area. Nevertheless, ESNs have some technical issues restricting them from a precise and reliable analysis, especially in the aspect of high-dimensional data. The most significant limitation of these networks is the high memory consumption caused by their reservoir structure in face of high -dimensional signals. To solve all problems associated with ESNs and their application in the Metaverse, we have come up with a novel structure for ESNs empowered by octonion algebra called NO2GESNet. Octonion numbers have eight dimensions, compactly display high-dimensional data, and improve the network precision and performance in comparison to conventional ESNs. The proposed network also solves the weaknesses of the ESNs in the presentation of the higher-order statistics to the output layer by equipping it with a multidimensional bilinear filter. Three comprehensive scenarios to use the proposed network in the Metaverse have been designed and analyzed, not only do they show the accuracy and performance of the proposed approach, but also the ways how SER can be employed in the Metaverse platforms.
Práva: Plný text je přístupný v rámci univerzity přihlášeným uživatelům
© Elsevier
Vyskytuje se v kolekcích:Články / Articles (KEV)
OBD

Soubory připojené k záznamu:
Soubor VelikostFormát 
Jamshidi_1-s2.0-S0893608023001600-main.pdf3,47 MBAdobe PDFZobrazit/otevřít  Vyžádat kopii


Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/53028

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání
navigace
  1. DSpace at University of West Bohemia
  2. Publikační činnost / Publications
  3. OBD