SCOPUS 정보 검색 플랫폼

Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

Volumn , Issue , 2010, Pages 2406-2409

Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding

(3) Tamura, Masatsune a Kagoshima, Takehiko a Akamine, Masami a

Author keywords

Sparse coding; Spectrum parameter; Speech synthesis; Sub band basis spectrum model; Voice adaptation

Indexed keywords

SPEECH COMMUNICATION; SPEECH SYNTHESIS; CODES (SYMBOLS); CONTINUOUS SPEECH RECOGNITION; SPECTRUM ANALYSIS; SPEECH CODING;

ANALYSIS-SYNTHESIS; HIGHER FREQUENCIES; LINEAR COMBINATIONS; LOWER FREQUENCIES; SPARSE CODING; SPECTRUM MODEL; SPECTRUM PARAMETERS; SPECTRUM REPRESENTATION; ANALYSIS/SYNTHESIS; BASIS VECTOR; LOG SPECTRUMS; SPECTRA MODELING; SPECTRA'S; SUB-BAND BASE SPECTRUM MODEL; SUBBANDS; VOICE ADAPTATION;

SPECTRUM ANALYSIS; SPEECH SYNTHESIS;

EID: 79959839864 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (3)

References (11)

1
- 29144484191
- Concatenative speech synthesis based on the plural unit selection and fusion method
- DOI 10.1093/ietisy/e88-d.11.2565
- T. Mizutani and T. Kagoshima, "Concatenative speech synthesis based on the plural unit selection and fusion method," IEICE Trans. E88-D, 11, pp.2565-2572, 2005. (Pubitemid 41816802)
- (2005) IEICE Transactions on Information and Systems , vol.E88-D , Issue.11 , pp. 2565-2572
- Mizutani, T.¹ Kagoshima, T.²

2
- 0004056285
- Prentice Hall
- X. Huang, A. Acero, H.W. Hon, "Spoken Language Processing: A Guide to Theory, Algorithm and System Development," Prentice Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm and System Development
- Huang, X.¹ Acero, A.² Hon, H.W.³

3
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," Proc. ICASSP, pp. I-137 - I-140, 1992.
- (1992) Proc. ICASSP
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

4
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Kasuse and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, 27, pp.187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Kasuse, I.² Cheveigne, A.³

5
- 0003447548
- PhD thesis, Ecole Nationale Supérieure des Télé communications
- Y. Stylianou, "Harmonic plus noise models for speech, combined with statistical methods for speech and speaker modification," PhD thesis, Ecole Nationale Supérieure des Télécommunications, 1996
- (1996) Harmonic Plus Noise Models for Speech, Combined with Statistical Methods for Speech and Speaker Modification
- Stylianou, Y.¹

6
- 70450176940
- Speech synthesis based on the plural unit selection and fusion method using FWF model
- R. Morinaka, M. Tamura, M. Morita, T. Kagoshima, "Speech synthesis based on the plural unit selection and fusion method using FWF model," Proc. INTERSPEECH, pp. 2083-2086, 2009.
- (2009) Proc. INTERSPEECH , pp. 2083-2086
- Morinaka, R.¹ Tamura, M.² Morita, M.³ Kagoshima, T.⁴

7
- 44949084615
- A weight estimation method using LDA for multi-band speech recognition
- K. Iwano, K. Kojima, and S. Furui, "A weight estimation method using LDA for multi-band speech recognition," Proc. INTERSPEECH, pp. 2534 - 2537, 2006.
- (2006) Proc. INTERSPEECH , pp. 2534-2537
- Iwano, K.¹ Kojima, K.² Furui, S.³

8
- 0029938380
- Emergence of simple-cell receptive field properties by learning a sparse code for natural images
- B. A. Olshausen and D. J. Field, "Emergence of simple-cell receptive field properties by learning a sparse code for natural images," Nature, vol 381, 1996.
- (1996) Nature , vol.381
- Olshausen, B.A.¹ Field, D.J.²

9
- 78049398611
- Sparse coding for speech recognition
- G.S.V.S. Sivaram, S. K. Nemala, M. Elhilali, T. Tran, H. Hermansky, "Sparse coding for speech recognition," Proc. ICASSP, pp. 4346-4349, 2010.
- (2010) Proc. ICASSP , pp. 4346-4349
- Sivaram, G.S.V.S.¹ Nemala, S.K.² Elhilali, M.³ Tran, T.⁴ Hermansky, H.⁵

10
- 0003289778
- Solving least squares problems
- (first published by 1974)
- C. L. Lawson, R. J. Hanson, "Solving Least Squares Problems," SIAM classics in applied mathematics, 1995 (first published by 1974)
- (1995) SIAM Classics in Applied Mathematics
- Lawson, C.L.¹ Hanson, R.J.²

11
- 0035472456
- Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
- DOI 10.1109/89.952489, PII S1063667601082335
- P. Jackson and C. H. Shadle, "Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech," IEEE Trans. Speech and Audio Processing, vol.9 pp.713-726, 2001. (Pubitemid 32992835)
- (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.7 , pp. 713-726
- Jackson, P.J.B.¹ Shadle, C.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.