SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 1057-1061

An investigation of acoustic features for singing voice conversion based on perceptual age

(8) Kobayashi, Kazuhiro a Doi, Hironori a Toda, Tomoki a Nakano, Tomoyasu b Goto, Masataka b Neubig, Graham a Sakti, Sakriani a Nakamura, Satoshi a

a NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

b NATIONAL INSTITUTE OF ADVANCED INDUSTRIAL SCIENCE AND TECHNOLOGY AIST (Japan)

Author keywords

Perceptual age; Singing voice; Spectral and prosodic features; Subjective evaluations; Voice conversion

Indexed keywords

COMPUTER APPLICATIONS; COMPUTER SIMULATION;

PERCEPTUAL AGE; PROSODIC FEATURES; SINGING VOICES; SUBJECTIVE EVALUATIONS; VOICE CONVERSION;

SPEECH PROCESSING;

EID: 84905262778 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (5)

References (21)

1
- 84867616167
- Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown
- Mar
- H. Kawahara and M. Morise, "Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown, " Proc. ICASSP, pp. 5389-5392, Mar. 2012.
- (2012) Proc. ICASSP , pp. 5389-5392
- Kawahara, H.¹ Morise, M.²

2
- 0032026483
- Continuous proba- bilistic transform for voice conversion
- Mar
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous proba- bilistic transform for voice conversion, " IEEE Trans. SAP, vol. 6, no. 2, pp. 131-142, Mar. 1998.
- (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

3
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. ASLP, vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

4
- 79959827418
- Applying voice conversion to concatenative singing-voice synthesis
- Sept
- F. Villavicencio and J. Bonada, "Applying voice conversion to concatenative singing-voice synthesis, " Proc. INTERSPEECH, pp. 2162-2165, Sept. 2010.
- (2010) Proc. INTERSPEECH , pp. 2162-2165
- Villavicencio, F.¹ Bonada, J.²

5
- 84874432462
- GMM voice conversion of singing voice using vocal tract area function
- Speech (Japanese edition), Nov
- Y. Kawakami, H. Banno, and F. Itakura, "GMM voice conversion of singing voice using vocal tract area function, " IEICE technical report. Speech (Japanese edition), vol. 110, no. 297, pp. 71-76, Nov. 2010.
- (2010) IEICE Technical Report , vol.110 , Issue.297 , pp. 71-76
- Kawakami, Y.¹ Banno, H.² Itakura, F.³

6
- 34547496175
- One-to-many and many-to- one voice conversion based on eigenvoices
- Apr
- T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to- one voice conversion based on eigenvoices, " Proc. ICASSP, pp. 1249-1252, Apr. 2007.
- (2007) Proc. ICASSP , pp. 1249-1252
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

7
- 84874403435
- Singing voice conversion method based on many-to-many eigenvoice con- version and training data generation using a singing-to-singing synthesis system
- Nov
- H. Doi, T. Toda, T. Nakano, M. Goto, and S. Nakamura, "Singing voice conversion method based on many-to-many eigenvoice con- version and training data generation using a singing-to-singing synthesis system, " Proc. APSIPA ASC, Nov. 2012.
- (2012) Proc. APSIPA ASC
- Doi, H.¹ Toda, T.² Nakano, T.³ Goto, M.⁴ Nakamura, S.⁵

8
- 70450194389
- Many-to- many eigenvoice conversion with reference voice
- Sept
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-to- many eigenvoice conversion with reference voice, " Proc. INTER- SPEECH, pp. 1623-1626, Sept. 2009.
- (2009) Proc. INTER- SPEECH , pp. 1623-1626
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

9
- 67651002140
- Statistical parametric speech synthesis
- Nov
- H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, Nov. 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

10
- 51449114529
- A style control technique for HMM-based expressive speech synthesis (speech and hearing)
- Sep
- T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis (speech and hearing), " IEICE transactions on information and systems, vol. 90, no. 9, pp. 1406-1413, Sep. 2007.
- (2007) IEICE Transactions on Information and Systems , vol.90 , Issue.9 , pp. 1406-1413
- Nose, T.¹ Yamagishi, J.² Masuko, T.³ Kobayashi, T.⁴

11
- 44949155552
- A tech- nique for controlling voice quality of synthetic speech using mul- Tiple regression HSMM
- Sept
- M. Tachibana, T. Nose, J. Yamagishi, and T. Kobayashi, "A tech- nique for controlling voice quality of synthetic speech using mul- Tiple regression HSMM, " Proc. INTERSPEECH, pp. 2438-2441, Sept. 2006.
- (2006) Proc. INTERSPEECH , pp. 2438-2441
- Tachibana, M.¹ Nose, T.² Yamagishi, J.³ Kobayashi, T.⁴

12
- 79959847554
- Adaptive voice-quality control based on one-to-many eigenvoice conversion
- Sept
- K. Ohta, T. Toda, Y. Ohtani, H. Saruwatari, and K. Shikano, "Adaptive voice-quality control based on one-to-many eigenvoice conversion, " Proc. INTERSPEECH, pp. 2158-2161, Sept. 2010.
- (2010) Proc. INTERSPEECH , pp. 2158-2161
- Ohta, K.¹ Toda, T.² Ohtani, Y.³ Saruwatari, H.⁴ Shikano, K.⁵

13
- 79959816772
- Longitudinal changes of selected voice source parameters
- Sept
- H. Kasuya, H. Yoshida, S. Ebihara, and H. Mori, "Longitudi- nal changes of selected voice source parameters, " Proc. INTER- SPEECH, pp. 2570-2573, Sept. 2010.
- (2010) Proc. INTER- SPEECH , pp. 2570-2573
- Kasuya, H.¹ Yoshida, H.² Ebihara, S.³ Mori, H.⁴

14
- 0036299156
- Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers
- May
- N. Minematsu, M. Sekiguchi, and K. Hirose, "Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers, " Proc. ICASSP, pp. 137-140, May. 2002.
- (2002) Proc. ICASSP , pp. 137-140
- Minematsu, N.¹ Sekiguchi, M.² Hirose, K.³

15
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- June
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " Proc. ICASSP, pp. 1315-1318, June 2000.
- (2000) Proc. ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

16
- 84874199000
- Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight
- Sept
- H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight, " Proc. MAVEBA, Sept. 2001.
- (2001) Proc. MAVEBA
- Kawahara, H.¹ Estill, J.² Fujimura, O.³

17
- 44949143155
- Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
- Sept
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation, " Proc. INTERSPEECH, pp. 2266-2269, Sept. 2006.
- (2006) Proc. INTERSPEECH , pp. 2266-2269
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

18
- 84867211725
- Low-delay voice conversion based on maximum likelihood es- Timation of spectral parameter trajectory
- Sept
- T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Low-delay voice conversion based on maximum likelihood es- Timation of spectral parameter trajectory, " Proc. INTERSPEECH, pp. 1076-1079, Sept. 2008.
- (2008) Proc. INTERSPEECH , pp. 1076-1079
- Muramatsu, T.¹ Ohtani, Y.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

19
- 84878390910
- Implementation of com- putationally efficient real-time voice conversion
- Sept
- T. Toda, T. Muramatsu, and H. Banno, "Implementation of com- putationally efficient real-time voice conversion, " Proc. INTER- SPEECH, Sept. 2012.
- (2012) Proc. INTER- SPEECH
- Toda, T.¹ Muramatsu, T.² Banno, H.³

20
- 0032673049
- Restructur- ing speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- Apr
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructur- ing speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigne, A.³

21
- 84901764550
- AIST humming database: Music database for singing research
- (Japanese edition), vol. 2005-MUS-61-2, Aug
- M. Goto and T. Nishimura, "AIST humming database: Music database for singing research, " IPSJ SIG Notes (Technical Report) (Japanese edition), vol. 2005-MUS-61-2, pp. 7-12, Aug. 2005.
- (2005) IPSJ SIG Notes (Technical Report) , pp. 7-12
- Goto, M.¹ Nishimura, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.