SCOPUS 정보 검색 플랫폼

Volumn 24, Issue 3, 2010, Pages 474-494

Voice conversion by mapping the speaker-specific features using pitch synchronous approach

a INDIAN INSTITUTE OF TECHNOLOGY (India)

Author keywords

ABX test; Duration and energy patterns; Excitation source; Feedforward neural network (FFNN); Glottal closure; Instants of significant excitation (epochs); LP residual; Mapping function; Mean opinion score (MOS); Objective measures; Pitch contour; Prosody characteristics; Voice conversion

Indexed keywords

ABX TEST; ENERGY PATTERNS; EXCITATION SOURCE; EXCITATION SOURCES; MAPPING FUNCTIONS; MEAN OPINION SCORES; OBJECTIVE MEASURE; PITCH CONTOURS; VOICE CONVERSION;

FEEDFORWARD NEURAL NETWORKS; GROUP DELAY; PHOTOMAPPING; SPEECH PROCESSING; STATISTICAL TESTS;

CONTINUOUS SPEECH RECOGNITION;

EID: 77950029338 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2009.03.003 Document Type: Article

Times cited : (53)

References (38)

1
- 0023739214
- Voice conversion through vector quantization
- Abe, M., Nakanura, S., Shikano, K., Kuwabara, H., 1998. Voice conversion through vector quantization. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 655-658.
- (1998) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 655-658
- Abe, M.¹ Nakanura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 0033154052
- Speaker transformation algorithm using segmental code books (STASC)
- Arslan L.M. Speaker transformation algorithm using segmental code books (STASC). Speech Communication 28 (1999) 211-226
- (1999) Speech Communication , vol.28 , pp. 211-226
- Arslan, L.M.¹

3
- 1942535983
- Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances
- Chennai, India, pp
- Gangashetty, S.V., Sekhar, C.C., Yegnanarayana, B., 2004. Extraction of fixed dimension patterns from varying duration segments of consonant-vowel utterances. In: Proceedings of IEEE International Conference on Intelligent Sensing and Information Processing, Chennai, India, pp. 159-164.
- (2004) Proceedings of IEEE International Conference on Intelligent Sensing and Information Processing , pp. 159-164
- Gangashetty, S.V.¹ Sekhar, C.C.² Yegnanarayana, B.³

4
- 0003413187
- Pearson Education Aisa, Inc., New Delhi, India
- Haykin S. Neural Networks: A Comprehensive Foundation (1999), Pearson Education Aisa, Inc., New Delhi, India
- (1999) Neural Networks: A Comprehensive Foundation
- Haykin, S.¹

5
- 0003962869
- Macmillan Publishing Company, 866 Third Avenue, New York, USA
- Hogg R.V., and Ledolter J. Engineering Statistics (1987), Macmillan Publishing Company, 866 Third Avenue, New York, USA
- (1987) Engineering Statistics
- Hogg, R.V.¹ Ledolter, J.²

6
- 33947693233
- Master's Thesis, St. Edmunds College, University of Cambridge
- Inanoglu, Z., Transforming Pitch in A Voice Conversion Framework. Master's Thesis, St. Edmunds College, University of Cambridge.
- Transforming Pitch in A Voice Conversion Framework
- Inanoglu, Z.¹

7
- 4444285698
- PhD Thesis, OGI School of Science and Engineering, Oregon Health and Science University, USA
- Kain, A., 2001. High Resolution Voice Transformation. PhD Thesis, OGI School of Science and Engineering, Oregon Health and Science University, USA.
- (2001) High Resolution Voice Transformation
- Kain, A.¹

8
- 0034841948
- Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
- Kain, A., Macon, M.W., 2001. Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction. In: Proceedings of IEEE International Conference of Acoustics, Speech, Signal Processing, vol. 2. pp. 813-816.
- (2001) Proceedings of IEEE International Conference of Acoustics, Speech, Signal Processing , vol.2 , pp. 813-816
- Kain, A.¹ Macon, M.W.²

9
- 77950064323
- Web-based listening test system for speech synthesis and speech conversion evaluation
- Marrakech Morocco
- Laurent Blin, O.B., Barreaud, V., 2008. Web-based listening test system for speech synthesis and speech conversion evaluation. In: Proceedings of LREC (Marrakech (Morocco)).
- (2008) Proceedings of LREC
- Laurent Blin, O.B.¹ Barreaud, V.²

10
- 38149065136
- Statistical approach for voice personality transformation
- Lee K. Statistical approach for voice personality transformation. IEEE Transactions on Audio, Speech, and Lnguage Processing 15 (2007) 641-651
- (2007) IEEE Transactions on Audio, Speech, and Lnguage Processing , vol.15 , pp. 641-651
- Lee, K.¹

11
- 0030365550
- A new voice personality transformation based on both linear and nonlinear prediction analysis
- Lee, K.S., Youn, D.H., Cha, I.W., 1996. A new voice personality transformation based on both linear and nonlinear prediction analysis. In: Proceedings of International Conference on Spoken Language Processing, pp. 1401-1404.
- (1996) Proceedings of International Conference on Spoken Language Processing , pp. 1401-1404
- Lee, K.S.¹ Youn, D.H.² Cha, I.W.³

12
- 77950024658
- Xiao-dan Mei, Sheng-he Sun, 2000. An Efficient Method to Compute LSFs From LPC Coefficients, In: ICSP-2000, pp. 655-658.
- Xiao-dan Mei, Sheng-he Sun, 2000. An Efficient Method to Compute LSFs From LPC Coefficients, In: ICSP-2000, pp. 655-658.

13
- 0000668614
- Robustness of group-delay-based method for extraction of significant excitation from speech signals
- Murthy P.S., and Yegnanarayana B. Robustness of group-delay-based method for extraction of significant excitation from speech signals. IEEE Transactions on Speech and Audio Processing 7 (1999) 609-619
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 609-619
- Murthy, P.S.¹ Yegnanarayana, B.²

14
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- Narendranadh M., Murthy H.A., Rajendran S., and Yegnanarayana B. Transformation of formants for voice conversion using artificial neural networks. Speech Communication 16 (1995) 206-216
- (1995) Speech Communication , vol.16 , pp. 206-216
- Narendranadh, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

15
- 0003513556
- Prentice-Hall, Upper Saddle River, NJ
- Oppenheim A.V., Schafer R.W., and Buck J.R. Discrete-time Signal Processing (1999), Prentice-Hall, Upper Saddle River, NJ
- (1999) Discrete-time Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.² Buck, J.R.³

16
- 51449094434
- Voice conversion with linear prediction residual estimation
- Percybrooks, W.S., Moore II, E., 2008. Voice conversion with linear prediction residual estimation. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 4673-4676.
- (2008) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 4673-4676
- Percybrooks, W.S.¹ Moore II, E.²

17
- 33745205178
- PhD Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India
- Prasanna, S.R.M., 2004. Event-Based Analysis of Speech. PhD Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India.
- (2004) Event-Based Analysis of Speech
- Prasanna, S.R.M.¹

18
- 0036288088
- Detection of vowel onset point in speech
- Orlando, Florida, USA
- Prasanna, S.R.M., Zachariah, J.M., 2002. Detection of vowel onset point in speech. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, Orlando, Florida, USA.
- (2002) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing
- Prasanna, S.R.M.¹ Zachariah, J.M.²

19
- 52249110270
- Transformation of speaker characteristics in speech using support vector machines
- Guwahati, India
- Rao, K.S., Koolagudi, S.G., 2007. Transformation of speaker characteristics in speech using support vector machines. In: 15th International Conference on Advanced Computing and Communication (ADCOM-2007), Guwahati, India.
- (2007) 15th International Conference on Advanced Computing and Communication (ADCOM-2007)
- Rao, K.S.¹ Koolagudi, S.G.²

20
- 34047248058
- Prosody modification using instants of significant excitation
- Rao K.S., and Yegnanarayana B. Prosody modification using instants of significant excitation. IEEE Transactions on Audio, Speech, and Language Processing 14 (2006) 972-980
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 972-980
- Rao, K.S.¹ Yegnanarayana, B.²

21
- 36048979629
- Voice conversion by prosody and vocal tract modification
- Bhubaneswar, Orissa, India
- Rao, K.S., Yegnanarayana, B., 2006. Voice conversion by prosody and vocal tract modification. In: Ninth International Conference on Information Technology, Bhubaneswar, Orissa, India.
- (2006) Ninth International Conference on Information Technology
- Rao, K.S.¹ Yegnanarayana, B.²

22
- 33750713338
- Modeling durations of syllables using neural networks
- Rao K.S., and Yegnanarayana B. Modeling durations of syllables using neural networks. Computer Speech and Language 21 (2007) 282-295
- (2007) Computer Speech and Language , vol.21 , pp. 282-295
- Rao, K.S.¹ Yegnanarayana, B.²

23
- 77950056334
- Voice transformation by mapping the features at syllable level
- Kolkata, India
- Rao, K.S., Laskar, R.H., Koolagudi, S.G., 2007. Voice transformation by mapping the features at syllable level. In: Second International Conference on Pattern Recognition and Machine Intelligence (Premi-2007), Kolkata, India.
- (2007) Second International Conference on Pattern Recognition and Machine Intelligence (Premi-2007)
- Rao, K.S.¹ Laskar, R.H.² Koolagudi, S.G.³

24
- 0029375490
- Determination of instants of significant excitation in speech using group delay function
- Smits R., and Yegnanarayana B. Determination of instants of significant excitation in speech using group delay function. IEEE Transactions on Speech and Audio Processing 3 (1995) 325-333
- (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , pp. 325-333
- Smits, R.¹ Yegnanarayana, B.²

25
- 4544369752
- Extraction of pitch in adverse conditions
- Montreal, Canada
- Prasanna, S.R.M., Yegnanarayana, B., 2004. Extraction of pitch in adverse conditions. In: IEEE International Conference on Acoustics Speech and Audio Processing, Montreal, Canada.
- (2004) IEEE International Conference on Acoustics Speech and Audio Processing
- Prasanna, S.R.M.¹ Yegnanarayana, B.²

26
- 0032026483
- Continuous probabilistic transform for voice conversion
- Stylianou Y., Cappe Y., and Moulines E. Continuous probabilistic transform for voice conversion. IEEE Transactions Speech and Audio Processing 6 (1998) 131-142
- (1998) IEEE Transactions Speech and Audio Processing , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappe, Y.² Moulines, E.³

27
- 77950044335
- Voice conversion: State-of-the-art and future work
- Munich, Germany
- Sundermann, D., 2005. Voice conversion: state-of-the-art and future work. In: Proceedings of DAGA: 31st German Annual Conference on Acoustics, Munich, Germany.
- (2005) Proceedings of DAGA: 31st German Annual Conference on Acoustics
- Sundermann, D.¹

28
- 33646767751
- A study on residual prediction techniques for voice conversion
- Sundermann, D., Bonafonte, A., Ney, H., 2005. A study on residual prediction techniques for voice conversion. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 13-16.
- (2005) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 13-16
- Sundermann, D.¹ Bonafonte, A.² Ney, H.³

29
- 77950036367
- Tc-star: Evaluation plan for voice conversion technology
- Munich, Germany
- Sundermann, D., Bonafonte, A., Duxans, H., Hoege, H., 2005. Tc-star: evaluation plan for voice conversion technology. In: Proceedings of DAGA: 31st German Annual Conference on Acoustics, Munich, Germany.
- (2005) Proceedings of DAGA: 31st German Annual Conference on Acoustics
- Sundermann, D.¹ Bonafonte, A.² Duxans, H.³ Hoege, H.⁴

30
- 0034842552
- Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
- Toda, T., Saruwatari, H., Shikano, K., 2001. Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, vol. 2. pp. 841-844.
- (2001) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing , vol.2 , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

31
- 77950029784
- PhD Thesis, Institute for Graduate Studies in Science and Engineering, Bogaziti University, Berlin, Germany
- Turk, O., 2007. Cross-lingual Voice Conversion. PhD Thesis, Institute for Graduate Studies in Science and Engineering, Bogaziti University, Berlin, Germany.
- (2007) Cross-lingual Voice Conversion
- Turk, O.¹

32
- 85009250849
- Subband based voice conversion
- Denver-Colorado, USA
- Turk, O., Arslan, L.M., 2002. Subband based voice conversion. In: Proceedings of International Conference on Spoken Language Processing, Denver-Colorado, USA.
- (2002) Proceedings of International Conference on Spoken Language Processing
- Turk, O.¹ Arslan, L.M.²

33
- 84863647359
- Donor selection for voice conversion
- Antalya, Turkey
- Turk, O., Arslan, L.M., 2005. Donor selection for voice conversion. In: Proceedings of EUSIPCO, Antalya, Turkey.
- (2005) Proceedings of EUSIPCO
- Turk, O.¹ Arslan, L.M.²

34
- 33746653351
- Robust processing techniques for voice conversion
- Turk O., and Arslan L.M. Robust processing techniques for voice conversion. Computer Speech and Language 20 (2006) 441-467
- (2006) Computer Speech and Language , vol.20 , pp. 441-467
- Turk, O.¹ Arslan, L.M.²

35
- 4544284652
- High quality voice morphing
- Ye, H., Young, S., 2004. High quality voice morphing. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, pp. 9-12.
- (2004) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 9-12
- Ye, H.¹ Young, S.²

36
- 0035989168
- AANN an alternative to GMM for pattern recognition
- Yegnanarayana B., and Kishore S.P. AANN an alternative to GMM for pattern recognition. Neural Networks 15 (2002) 459-469
- (2002) Neural Networks , vol.15 , pp. 459-469
- Yegnanarayana, B.¹ Kishore, S.P.²

37
- 0032121729
- Extraction of vocal-tract system characteristics from speech signals
- Yegnanarayana B., and Veldhuis R.N.J. Extraction of vocal-tract system characteristics from speech signals. IEEE Transactions Speech and Audio Processing 6 (1998) 313-327
- (1998) IEEE Transactions Speech and Audio Processing , vol.6 , pp. 313-327
- Yegnanarayana, B.¹ Veldhuis, R.N.J.²

38
- 0034856452
- Source and system features for speaker recognition using AANN models
- Salt Lake City, Utah, USA, pp
- Yegnanarayana, B., Reddy, K.S., Kishore, S.P., 2001. Source and system features for speaker recognition using AANN models. In: Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing, Salt Lake City, Utah, USA, pp. 409-412.
- (2001) Proceedings of IEEE International Conference on Acoustics, Speech, Signal Processing , pp. 409-412
- Yegnanarayana, B.¹ Reddy, K.S.² Kishore, S.P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.