A Grapheme-based Text to Speech System for Yoruba Language
Itunuoluwa Isewon, Adejumoke Famade and Jelili Oyelade. A Grapheme-based Text to Speech System for Yoruba Language. International Journal of Applied Information Systems 12(39):1-5, April 2022. URL, DOI BibTeX
@article{10.5120/ijais2022451924, author = "Itunuoluwa Isewon and Adejumoke Famade and Jelili Oyelade", title = "A Grapheme-based Text to Speech System for Yoruba Language", journal = "International Journal of Applied Information Systems", issue_date = "April 2022", volume = 12, number = 39, month = "April", year = 2022, issn = "2249-0868", pages = "1-5", url = "http://www.ijais.org/archives/volume12/number39/1124-2022451924", doi = "10.5120/ijais2022451924", publisher = "Foundation of Computer Science (FCS), NY, USA", address = "New York, USA" }
Abstract
A Text to Speech synthesizer is an application that converts text to speech; the process of conversion is achieved by the application of the analysis of natural and digital signal processing on the input text. Over the years, some Text to Speech systemshas been built for different languages e.g. English, Kiswahili, German, French, Telugu, Mandarin, etc. The aim of this study is to develop an application for Yoruba Text to speech synthesis.
The Mary Text to Speech framework was used to implement a speech synthesizer for the Yoruba language. The Text to Speech system developed can convert an input text into speech sound in Yoruba. This application allows a user to input a text and the engine reads out the text as a synthesized speech. The user is also able to upload a text file and the engine also reads the uploaded text file, the user can save the synthesized speech inthe local storage ofthe system.
A basic graphical user interface has been produced that plays out the essential elements of a speech synthesizer like conversion and speech synthesis. Agrapheme-based speech synthesizer has been produced for the Yoruba language which is a tonal language.
Reference
- Isewon, I., Oyelade, J., &Oladipupo, O. (2014). Design and Implementation of Text to Speech Conversion for Visually Impaired People. International Journal of Applied Information Systems, 7(2), 25–30.
- Dutoit, T., 1993. High quality text-to-speech synthesis ofthe French language. Doctoral dissertation, FacultePolytechnique de Mons.
- Duanmu, S. (2004). Tone and Non-Tone Languages: An Alternative to Language Typology and Parameters. Language and Linguistics, 5(4), 891–924.
- Ekekwe, N. (2015). Development of Text to Speech Applications. Covenant University.
- Byrne, B. (2014). Engineering Part IIB?: Module 4F11 Speech and Language Processing Lectures 9 & 10?: Weighted Finite State Transducers for Speech and Language Processing.
- Oloruntoyin, S. T. (2014). DEVELOPMENT OF YORUBA LANGUAGE TEXT-TO-SPEECH E-LEARNING SYSTEM. International Journal of Scholarly Research Gate, 2(1), 19–36.
- Shih, C., &Sproat, R. (1996). Issues in text-to-speech conversion for Mandarin. Computational Linguistics and Chinese Language Processing, 1(1), 37–86.
- Polomé, E. C. (1967). Swahili language handbook.
- Laver, J. D. M. (1970). The production of speech. New Horizons in Linguistics, 53–75. https://doi.org/10.1007/978-1-4613-8202-7_6
- Sarroff, A. M., & Casey, M. (2014). Musical Audio Synthesis Using Autoencoding Neural Nets. Proceedings of the International Computer Music Conference, 1(September), 14–20.
- Raitio, T. (2008). Hidden Markov Model Based Finnish Text-to-Speech System Utilizing Glottal Inverse Filtering. Master’s Thesis, HELSINKI UNIVERSITY OF TECHNOLOGY. Retrieved from http://lib.tkk.fi/Dipl/2008/urn012274.pdf
- Afolabi, A., Omidiora, E., &Arulogun, T. (2013). Development of text to speech system for yoruba language. In Innovative Systems Design and Engineering—Special Issue of 2nd International Conference on Engineering and Technology Research (Vol. 4, pp. 1–8).
- Gakuru, M., Iraki, F. K., Tucker, R., Shalonova, K., & Ngugi, K. (2005). Development of a Kiswahili text to speech system. Interspeech-2005, 1481–1484.
- Charfuelan, M., Pammi, S., Steiner, I., Pierre, U., Upmc, C., & Tts, M. (2013). MARY TTS unit selection and HMM-based voices for the Blizzard Challenge 2013.
- Kamble, K., &Kagalkar, R. (2014). A Review?: Translation of Text to Speech Conversion for Hindi Language, 3(11), 1027–1031.
- Schröder, M., &Trouvain, J. (2003a). The German text-to-speech synthesis system MARY: A tool for research, development and teaching. International Journal of Speech Technology, 6(4), 365–377.
- Schröder, M., &Trouvain, J. (2003b). The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching. International Journal of Speech Technology, 6(4), 365–377. https://doi.org/10.1023/A:1025708916924
- Schröder, M., &Trouvain, J. (2003). The German Text-to-Speech Synthesis System MARY: A Tool for Research, Development and Teaching. International Journal of Speech Technology, 6, 365–377. https://doi.org/10.1023/A:1025708916924.
- Raitio, T., Suni, A., Yamagishi, J., Pulakka, H., Nurminen, J., Vainio, M., &Alku, P. (2011). HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering. IEEE Transactions on Audio, Speech, and Language Processing, 19(1), 153–165. https://doi.org/10.1109/TASL.2010.2045239
Keywords
Text to Speech, Speech synthesis, Mary tool, Yoruba language