A Novel Approach for Hindi Text Description to Speech and Expressive Speech Synthesis

Kamble Kaveri; Ramesh Kagalkar

Call for Paper

September Edition

IJAIS solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 28 August 2025

Submit your paper

Know more

The week's pick

Enhancing Financial Time Series Predictions with a Hybrid BNN-LSTM Approach

Anika Tahsin Biva A.B.M. Shahadat Hossain Md. Shafiul Alom Khan Iqbal Habib

Random Articles

Reseach Article

A Novel Approach for Hindi Text Description to Speech and Expressive Speech Synthesis

by Kamble Kaveri, Ramesh Kagalkar

International Journal of Applied Information Systems

Foundation of Computer Science (FCS), NY, USA

Volume 8 - Number 7

Year of Publication: 2015

Authors: Kamble Kaveri, Ramesh Kagalkar

10.5120/ijais15-451342

Kamble Kaveri, Ramesh Kagalkar . A Novel Approach for Hindi Text Description to Speech and Expressive Speech Synthesis. International Journal of Applied Information Systems. 8, 7 ( May 2015), 1-5. DOI=10.5120/ijais15-451342

@article{ 10.5120/ijais15-451342,

author = { Kamble Kaveri, Ramesh Kagalkar },

title = { A Novel Approach for Hindi Text Description to Speech and Expressive Speech Synthesis },

journal = { International Journal of Applied Information Systems },

issue_date = { May 2015 },

volume = { 8 },

number = { 7 },

month = { May },

year = { 2015 },

issn = { 2249-0868 },

pages = { 1-5 },

numpages = {9},

url = { https://www.ijais.org/archives/volume8/number7/738-1342/ },

doi = { 10.5120/ijais15-451342 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2023-07-05T18:59:17.837459+05:30

%A Kamble Kaveri

%A Ramesh Kagalkar

%T A Novel Approach for Hindi Text Description to Speech and Expressive Speech Synthesis

%J International Journal of Applied Information Systems

%@ 2249-0868

%V 8

%N 7

%P 1-5

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Communication plays a very important role in every days life. With the help of communication we can share information from one person to another. Speech is the primary means of communication. A Text to Speech (TTS) synthesizer is a computer based application which is capable of given reading out to the typed text. This generally forms basic two steps, such as text processing and speech generation. Our aim is to develop software that enhances the users way of speech through correctness of pronunciation for the Hindi phonetics. Firstly the simple TTS system is to perform operation to get the output in the form of Text for Hindi language. Then Speech to Text (STT) conversion may form effectively. Additionally we have to add Expressions for Expressive Speech synthesis for Hindi Language. TTS is one of the major applications of NLP. Expressive speech synthesis deals with synthesizing speech and adding various expressions related to different emotions and speaking styles to the synthesized speech. Emotion is an important element in expressive speech synthesis.

References

J. Tao ,Y. Kang,and A. Li ,Prosody Conversion From Neutral Speech to Emotional Speech, IEEE Transactions On Audio, Speech, And Language Processing, Vol. 14, No. 4, July 2006.
M. Theune, K. Meijs, D. Heylen, and R. Ordelman ,Generating Expressive Speech for storytelling Applications' , IEEE Transaction on Audio, Speech and Language Processing,Vol. 14, No. 4, July 2006.
D. Govind , S. Mahadeva Prasanna , Expressive speech synthesis: a review , Springer Science Business Media New York 2012.
B. Yegnarayana and K. Sri Rama Murty , Event-Based Instantaneous Fundamental Frequency Estimation From Speech signals, IEEE Transactions on Audio ,Speech. And Language Processing, Vol. 17. No. 4, May 2009.
O. Turk and M. Schroder , Evaluation of Expressive Speech Synthesis with Voice Conversion and Copy Resynthesis Techniques, IEEE Transactions on Audio, Speech and Language Processing, Vol. 18, No. 5 ,July 2010.
J. Jia, S. Zhang, F. Meng, Y. Wang, and L. Cai, Member, IEEE, Emotional Audio-Visual Speech Synthesis Based on PAD,IEEE Transactions on Audio, Speech and on Audio, Speech and Language Processing, Vol. 19, No. 3 , march 2011.
J. Sangeetha,S. Jothilakshmi , S. Sindhuja , V. Ramalingam, Text to Speech synthesis system for Tamil, International Conferenceon Information Systems and Computing (ICISC-2013),India.
K. Kamble and R. Kagalkar,A Review:Translation of Text toSpeech Conversion for Hindi language, International Journal of Science and Research (IJSR) Volume 3 Issue 11, November, 2014.
M. Singh, K. Verma , Text to Speech Synthesis for numerals into Punjabi language, International Journal of Computational Linguistics and Natural Language Processing Vol 2 Issue 7 July 2013 ISSN 2279 0756.
N. Swetha, K. Anuradha ,Text-to-speech conversion, International Journal of Advanced Trends in Computer Scienc and Engineering ,Vol . 2,No. 6, Pages (2013).
S. Ahlawat, R. Dahiya , A Novel Approach of Text to Speech Conversion Under Android Environment, (IJCSMS) International Journal of Computer Science Management Studies, Vol. 13, Issue 05, July 2013
P. Shetake, A. Patil, P. Jadhav , Review Of Text To Speech Conversion MethodS, International Journal of Industrial Electronics and Electrical Engineering, ISSN: 2347-6982 Volume-2, Issue-8, Aug. -2014.
S. Suryawanshi, R. Itkarkar, D. Mane , High Quality Text to Speech Synthesizer using Phonetic Integration, International Journal , Advanced Research in Electronics and Communication Engineering (IJARECE) Volume3, Issue 2, February 2014.
D. Sasirekha, E. Chandra ,Text to Speech: A Simple Tutorial,International Journal of Soft Computing and Engineering (IJCSE) ISSN: 2231-2307, Volume-2, Issue-1, March 2012.
S. Hertz, J. Kadin, And K. Karplus, Member, IEEE,The Delta Rule Development System for Speech Synthesis from Text,Proceedings of the IEEE ,Vol. 73, No. 11, November 1985.
R. San-Segundo, J. Montero, R. Barra-Chicote, J. Lorenzo, Architecture for Text Normalization using Statistical Machine translation techniques, Springer-verlag Berlin Heidelberg 2011.
A. Chauhan, V. Chauhan, S. Singh, A. Tomar, and H. Chauhan, A Text to Speech System for Hindi using English Language,IJCST Vol 2. Issue 3,September 2011.
S. Padmavathi, K. Reddy , Conversion Of Braille To Text in English,Hindi and Tamil Languages International Journal of Computer Science, Engineering and Applications (IJCSEA) Vol. 3, No. 3, June 2013.
S. Suryawanshi, R. Itkarkar, D. Mane , High Quality Text to Speech Synthesizer using phonetic Integration, International Journal of Advenced Research in Electronics and Communication Engineering (IJARECE) Volume 3, Issue 2,February 2014.
O. Trk, O. Byk, A. Haznedaroglu, and L. Arslan, Application of conversion for cross language rap singing transformation in proc. IEEE ICASSP, Taipei, Taiwan, April 2009.
Z. Zeng, P. Maja, G. Roisman, and S. Thomas ,A survey of affect recognition methods: Audio, visual and spontaneous expressions, IEEE Trans. Pattern Anal. Mach. Intell. , vol. 31, no. 1, pp. 3958, Jan. 2009.

Index Terms

Computer Science

Information Sciences

Keywords

Text To Speech ; Speech To Text; Boosting-Gaussian Mixture Model(GMM); Mel Frequency Cepstral Coefficient (MFCC); Prosody Conversion; Hidden Markov Model(HMM); Time Domain Pitch Synchronous Overlap Add(TD-PSOLA).