메뉴 건너뛰기




Volumn , Issue , 2014, Pages 7889-7893

Voice conversion in time-invariant speaker-independent space

Author keywords

conditional restricted Boltzmann machine; deep learning; speaker specific features; Voice conversion

Indexed keywords

SIGNAL PROCESSING;

EID: 84905252390     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6855136     Document Type: Conference Paper
Times cited : (6)

References (19)
  • 2
    • 84865747520 scopus 로고    scopus 로고
    • Intonation conversion from neutral to expressive speech
    • Christophe Veaux and X. Robet, "Intonation conversion from neutral to expressive speech, " in Proc. Interspeech, 2011, pp. 2765-2768.
    • (2011) Proc. Interspeech , pp. 2765-2768
    • Veaux, C.1    Robet, X.2
  • 3
    • 80052698826 scopus 로고    scopus 로고
    • Speaking-aid systems using gmm-based voice conversion for electrolaryngeal speech
    • Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, and Kiyohiro Shikano, "Speaking-aid systems using gmm-based voice conversion for electrolaryngeal speech, " Speech Communication, vol. 54, no. 1, pp. 134-146, 2012.
    • (2012) Speech Communication , vol.54 , Issue.1 , pp. 134-146
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 5
    • 70450192197 scopus 로고    scopus 로고
    • Speech generation from hand gestures based on space mapping
    • Aki Kunikoshi, Yu Qiao, Nobuaki Minematsu, and Keikichi Hirose, "Speech generation from hand gestures based on space mapping, " in Proc. Interspeech, 2009, pp. 308-311.
    • (2009) Proc. Interspeech , pp. 308-311
    • Kunikoshi, A.1    Qiao, Y.2    Minematsu, N.3    Hirose, K.4
  • 6
    • 0021412027 scopus 로고
    • Vector quantization
    • Robert Gray, "Vector quantization, " IEEE ASSP Magazine, vol. 1, no. 2, pp. 4-29, 1984.
    • (1984) IEEE ASSP Magazine , vol.1 , Issue.2 , pp. 4-29
    • Gray, R.1
  • 7
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and Jean-Pierre Tubach, "Voice transformation using PSOLA technique, " Speech Communication, vol. 11, no. 2, pp. 175-187, 1992.
    • (1992) Speech Communication , vol.11 , Issue.2 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.-P.3
  • 9
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • Tomoki Toda, AlanW. Black, and Keiichi Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 12
    • 84906280857 scopus 로고    scopus 로고
    • Voice conversion in high-order eigen space using deep belief nets
    • Toru Nakashika, Ryoichi Takashima, Tetsuya Takiguchi, and Yasuo Ariki, "Voice conversion in high-order eigen space using deep belief nets, " in Proc. Interspeech, 2013, pp. 369-372.
    • (2013) Proc. Interspeech , pp. 369-372
    • Nakashika, T.1    Takashima, R.2    Takiguchi, T.3    Ariki, Y.4
  • 13
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • Geoffrey E. Hinton, Simon Osindero, and Yee-Whye Teh, "A fast learning algorithm for deep belief nets, " Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006.
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 17
    • 0025475528 scopus 로고
    • ATR japanese speech database as a tool of speech recognition and synthesis
    • Akira Kurematsu, Kazuya Takeda, Yoshinori Sagisaka, Shigeru Katagiri, Hisao Kuwabara, and Kiyohiro Shikano, "ATR japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, no. 4, pp. 357-363, 1990.
    • (1990) Speech Communication , vol.9 , Issue.4 , pp. 357-363
    • Kurematsu, A.1    Takeda, K.2    Sagisaka, Y.3    Katagiri, S.4    Kuwabara, H.5    Shikano, K.6
  • 19
    • 80052359758 scopus 로고    scopus 로고
    • Speech reconstruction from melfrequency cepstral coefficients using a source-filter model
    • Ben Milner and Xu Shao, "Speech reconstruction from melfrequency cepstral coefficients using a source-filter model, " in Proc. Interspeech, 2002, pp. 2421-2424.
    • (2002) Proc. Interspeech , pp. 2421-2424
    • Milner, B.1    Shao, X.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.