Integration of the ASR Toolkit Kaldi Into a Domoticz Home Automation System
DOI:
https://doi.org/10.14738/tmlai.54.3424Keywords:
Speech recognition, domotics, Kaldi, Smart home, OPC.Abstract
This paper presents the design and the implementation of an interface between Kaldi, automatic speech recognition toolkit, and a home automation system. This interface is based on Open Platform communication (OPC) protocol. The developed architecture allows the injection of vocal data into a home automation system. Developed in C++, The Kaldi behaves as aclient of the OPC server.
References
(1) Noury, N., Virone, G., Barralon, P., Ye, J., Rialle, V., & Demongeot, J. (2003, June). New trends in health smart homes. In enterprise networking and computing in healthcare industry, 2003. Healthcom 2003.
(2) Frederic Aman. Reconnaissance automatique de la parole de personnes agées pour les services d'assistance à domicile. Traitement du signal et de l'image. Université Grenoble Alpes, 2014.
(3) The Kaldi Speech Recognition Toolkit, Povey Daniel, Ghoshal Arnab, Boulianne, GillesBurget, LukasGlembek, OndrejGoel, Nagendra, Hannemann, MirkoMotlicek Petr, Qian Yanmin, Schwarz Petr, Silovsky Jan, Stemmer Georg and Vesely Karel, Idiap-RR-04-2012
(4) Richard Dufour. Transcription automatique de la parole spontanée. Informatique [cs]. Université du Maine, 2010.
(5) Mohamed Bouallegue. L'analyse factorielle pour la modélisation
acoustique des systèmes dereconnaissance de la parole. Autre [cs.OH]. Université d'Avignon, 2013.
(6) Insect sound recognition based on mfcc and pnn. In Multimedia and Signal Processing (CMSP), 2011 International Conference IEEE
(7) Walker, W., Lamere, P., Kwok, P., Raj, B., Singh, R., Gouvea, E., ... & Woelfel, J. (2004). Sphinx-4: A flexible open source framework for speech recognition
(8) Sakriani Sakti, Oyunchimeg Shagdar, Fawzi Nashashibi, Satoshi Nakamura. Context Awarenessand Priority Control for ITS based on Automatic Speech Recognition. International conference on ITS
Telecommunications, Dec 2015, Copenhagen, Denmark. 2015,
(9) D. Povey, L. Burget et al., “The subspace Gaussian mixture model A structured model for speech recognition,” Computer Speech & Language,vol. 25, no. 2, pp. 404–439, April 2011.
(10) Fethi Bougares. Attelage de systèmes de transcription automatique de la parole. Ordinateur et societé [cs.CY]. Université du Maine, 2012.
(11) Michel Vacher, Benjamin Lecouteux, Javier Serrano-Romero, Moez Ajili, Fran_cois Portet, etal.. Speech and Speaker Recognition for Home Automation: Preliminary Results. 8th International Conference Speech Technology and Human-Computer Dialogue "SpeD 2015", Oct 2015.
(12) Augusto, J. C., Liu, J., McCullagh, P., Wang, H., & Yang, J. B. (2008). Management of uncertainty and spatio-temporal aspects for monitoring and diagnosis in a smart home. International Journal of Computational Intelligence Systems.
(13) Vacher, J.-F Serignat, Pelayo Menendez-Garcia, D Istrate. FIRST IMPLEMENTATION OF A SOUND/SPEECH REMOTE MONITORING REAL-TIME SYSTEM FOR HOME HEALTHCARE. The 6th International Conference Communications, Jun 2006
(14) Olivier Passalacqua, Eric Benoit, Marc-Philippe Huget, Patrice Moreaux. INTEGRATING OPC DATA INTO GSN INFRASTRUCTURES. IADIS International Conference APPLIED COMPUTING 2008
(15) Zheng, L., & Nakagawa, H. (2002, August). OPC (OLE for process control) specification and its developments. In SICE 2002. Proceedings of the 41st SICE Annual Conference (Vol. 2, pp. 917-920). IEEE.
(16) Topalis, E., Orphanos, G., Koubias, S., & Papadopoulos, G. (2000). A generic network management architecture targeted to support home automation networks and home internet connectivity. IEEE Transactions on Consumer Electronics, 46(1), 44-51.
(17) Liu, J., Lim, K. W., Ho, W. K., Tan, K. C., Tay, A., & Srinivasan, R. (2005). Using the OPC standard for real-time process monitoring and control. IEEE software, 22(6), 54-59.
(18) Grossmann, D., Bender, K., & Danzer, B. (2008, August). OPC UA based field device integration. In SICE Annual Conference, 2008 (pp. 933-938). IEEE.