메뉴 건너뛰기




Volumn 24, Issue 3, 2010, Pages 474-494

Voice conversion by mapping the speaker-specific features using pitch synchronous approach

Author keywords

ABX test; Duration and energy patterns; Excitation source; Feedforward neural network (FFNN); Glottal closure; Instants of significant excitation (epochs); LP residual; Mapping function; Mean opinion score (MOS); Objective measures; Pitch contour; Prosody characteristics; Voice conversion

Indexed keywords

ABX TEST; ENERGY PATTERNS; EXCITATION SOURCE; EXCITATION SOURCES; MAPPING FUNCTIONS; MEAN OPINION SCORES; OBJECTIVE MEASURE; PITCH CONTOURS; VOICE CONVERSION;

EID: 77950029338     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2009.03.003     Document Type: Article
Times cited : (53)

References (38)
  • 2
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental code books (STASC)
    • Arslan L.M. Speaker transformation algorithm using segmental code books (STASC). Speech Communication 28 (1999) 211-226
    • (1999) Speech Communication , vol.28 , pp. 211-226
    • Arslan, L.M.1
  • 5
    • 0003962869 scopus 로고
    • Macmillan Publishing Company, 866 Third Avenue, New York, USA
    • Hogg R.V., and Ledolter J. Engineering Statistics (1987), Macmillan Publishing Company, 866 Third Avenue, New York, USA
    • (1987) Engineering Statistics
    • Hogg, R.V.1    Ledolter, J.2
  • 7
    • 4444285698 scopus 로고    scopus 로고
    • PhD Thesis, OGI School of Science and Engineering, Oregon Health and Science University, USA
    • Kain, A., 2001. High Resolution Voice Transformation. PhD Thesis, OGI School of Science and Engineering, Oregon Health and Science University, USA.
    • (2001) High Resolution Voice Transformation
    • Kain, A.1
  • 9
    • 77950064323 scopus 로고    scopus 로고
    • Web-based listening test system for speech synthesis and speech conversion evaluation
    • Marrakech Morocco
    • Laurent Blin, O.B., Barreaud, V., 2008. Web-based listening test system for speech synthesis and speech conversion evaluation. In: Proceedings of LREC (Marrakech (Morocco)).
    • (2008) Proceedings of LREC
    • Laurent Blin, O.B.1    Barreaud, V.2
  • 12
    • 77950024658 scopus 로고    scopus 로고
    • Xiao-dan Mei, Sheng-he Sun, 2000. An Efficient Method to Compute LSFs From LPC Coefficients, In: ICSP-2000, pp. 655-658.
    • Xiao-dan Mei, Sheng-he Sun, 2000. An Efficient Method to Compute LSFs From LPC Coefficients, In: ICSP-2000, pp. 655-658.
  • 13
    • 0000668614 scopus 로고    scopus 로고
    • Robustness of group-delay-based method for extraction of significant excitation from speech signals
    • Murthy P.S., and Yegnanarayana B. Robustness of group-delay-based method for extraction of significant excitation from speech signals. IEEE Transactions on Speech and Audio Processing 7 (1999) 609-619
    • (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 609-619
    • Murthy, P.S.1    Yegnanarayana, B.2
  • 17
    • 33745205178 scopus 로고    scopus 로고
    • PhD Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India
    • Prasanna, S.R.M., 2004. Event-Based Analysis of Speech. PhD Thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India.
    • (2004) Event-Based Analysis of Speech
    • Prasanna, S.R.M.1
  • 22
    • 33750713338 scopus 로고    scopus 로고
    • Modeling durations of syllables using neural networks
    • Rao K.S., and Yegnanarayana B. Modeling durations of syllables using neural networks. Computer Speech and Language 21 (2007) 282-295
    • (2007) Computer Speech and Language , vol.21 , pp. 282-295
    • Rao, K.S.1    Yegnanarayana, B.2
  • 24
    • 0029375490 scopus 로고
    • Determination of instants of significant excitation in speech using group delay function
    • Smits R., and Yegnanarayana B. Determination of instants of significant excitation in speech using group delay function. IEEE Transactions on Speech and Audio Processing 3 (1995) 325-333
    • (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , pp. 325-333
    • Smits, R.1    Yegnanarayana, B.2
  • 31
    • 77950029784 scopus 로고    scopus 로고
    • PhD Thesis, Institute for Graduate Studies in Science and Engineering, Bogaziti University, Berlin, Germany
    • Turk, O., 2007. Cross-lingual Voice Conversion. PhD Thesis, Institute for Graduate Studies in Science and Engineering, Bogaziti University, Berlin, Germany.
    • (2007) Cross-lingual Voice Conversion
    • Turk, O.1
  • 33
    • 84863647359 scopus 로고    scopus 로고
    • Donor selection for voice conversion
    • Antalya, Turkey
    • Turk, O., Arslan, L.M., 2005. Donor selection for voice conversion. In: Proceedings of EUSIPCO, Antalya, Turkey.
    • (2005) Proceedings of EUSIPCO
    • Turk, O.1    Arslan, L.M.2
  • 34
    • 33746653351 scopus 로고    scopus 로고
    • Robust processing techniques for voice conversion
    • Turk O., and Arslan L.M. Robust processing techniques for voice conversion. Computer Speech and Language 20 (2006) 441-467
    • (2006) Computer Speech and Language , vol.20 , pp. 441-467
    • Turk, O.1    Arslan, L.M.2
  • 36
    • 0035989168 scopus 로고    scopus 로고
    • AANN an alternative to GMM for pattern recognition
    • Yegnanarayana B., and Kishore S.P. AANN an alternative to GMM for pattern recognition. Neural Networks 15 (2002) 459-469
    • (2002) Neural Networks , vol.15 , pp. 459-469
    • Yegnanarayana, B.1    Kishore, S.P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.