SCOPUS 정보 검색 플랫폼

2008 IEEE Workshop on Spoken Language Technology, SLT 2008 - Proceedings

Volumn , Issue , 2008, Pages 25-28

Vowel-based frequency alignment function design and recognition-based time alignment for automatic speech morphing

(4) Onishi, Masato a Takahashi, Toru b Irino, Toshio a Kawahara, Hideki a

Author keywords

Audio systems8Auditory system; Signal processing; Speech communication; Speech processing; Speech recognition; Speech synthesis; Vocal system

Indexed keywords

AUDIO SYSTEMS8AUDITORY SYSTEM; MORPHING; NEW DESIGN; OPEN SOURCES; SPEECH RECOGNITION SYSTEMS; SUBJECTIVE TESTS; TEST RESULTS; TIME ALIGNMENT; TIME FREQUENCY; VOCAL SYSTEM; WEIGHTED AVERAGES;

ALIGNMENT; OBJECT RECOGNITION; SIGNAL PROCESSING; SPEECH ANALYSIS; SPEECH PROCESSING; SPEECH RECOGNITION; SPEECH SYNTHESIS;

SPEECH COMMUNICATION;

EID: 67649562205 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2008.4777831 Document Type: Conference Paper

Times cited : (1)

References (12)

1
- 0141591524
- Auditory morphing based on an elastic perceptual distance metric in an interference free time-frequency representation
- Hong Kong, China
- H. Kawahara and H. Matsui, "Auditory morphing based on an elastic perceptual distance metric in an interference free time-frequency representation," in ICASSP 2003, Hong Kong, China, 2003, vol.I, pp. 256-259.
- (2003) ICASSP 2003 , vol.1 , pp. 256-259
- Kawahara, H.¹ Matsui, H.²

2
- 0032673049
- Auditory morphing based on an elastic perceptual distance metric in an interference free time-frequency representation
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Auditory morphing based on an elastic perceptual distance metric in an interference free time-frequency representation," Speech Communication, vol.27, no. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

3
- 33750915991
- STRAIGHT, exploitation of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds
- DOI 10.1250/ast.27.349
- H. Kawahara, "STRAIGHT, exploration of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds," Acoustic Science & Technology, vol.27, no. 5, pp. 349-353, 2006. (Pubitemid 44728687)
- (2006) Acoustical Science and Technology , vol.27 , Issue.6 , pp. 349-353
- Kawahara, H.¹

4
- 85009062693
- Julius - an open source real-time large vocabulary recognition engine
- Aalborg, Denmark
- A. Lee, T. Kawahara, and K. Shikano, "Julius - an open source real-time large vocabulary recognition engine," in Interspeech 2001, Aalborg, Denmark, 2001, pp. 1691-1694.
- (2001) Interspeech 2001 , pp. 1691-1694
- Lee, A.¹ Kawahara, T.² Shikano, K.³

5
- 67649526748
- Speaker conversion system based on vowels -An implementation of voice texture mapping
- SP2006-162 [in Japanese]
- T. Takahashi, M. Morise, R. Nisimura, T. Irino, H. Kawahara, and H. Banno, "Speaker conversion system based on vowels -an implementation of voice texture mapping-," IEICE Technical Report, vol.106, no. 613, pp. 13-18, 2007, SP2006-162 [in Japanese].
- (2007) IEICE Technical Report , vol.106 , Issue.613 , pp. 13-18
- Takahashi, T.¹ Morise, M.² Nisimura, R.³ Irino, T.⁴ Kawahara, H.⁵ Banno, H.⁶

6
- 67649557745
- Vowel-based voice conversion and its objective evaluation
- Gold Coast, Australia
- M. Onishi, T. Takahashi, T. Morise, T. Irino, and H. Kawahara, "Vowel-based voice conversion and its objective evaluation," in NCSP'08, Gold Coast, Australia, 2008, pp. 275-278.
- (2008) NCSP'08 , pp. 275-278
- Onishi, M.¹ Takahashi, T.² Morise, T.³ Irino, T.⁴ Kawahara, H.⁵

7
- 60049091877
- Listener adaptability to individual speaker differences in monosyllabic speech perception
- [in Japanese]
- K. Kato and K. Kakehi, "Listener adaptability to individual speaker differences in monosyllabic speech perception," J. Acoust. Soc. Jpn., vol.44, no. 3, pp. 180-186, 1988, [in Japanese].
- (1988) J. Acoust. Soc. Jpn. , vol.44 , Issue.3 , pp. 180-186
- Kato, K.¹ Kakehi, K.²

8
- 44949084088
- General framework for flexible speech style manipulation and synthesis
- Seoul, Korea,[CD-ROM]
- T. Takahashi, T. Irino, and H. Kawahara, "General framework for flexible speech style manipulation and synthesis," in WESPAC IX 2006, Seoul, Korea, 2006, [CD-ROM].
- (2006) WESPAC IX 2006
- Takahashi, T.¹ Irino, T.² Kawahara, H.³

9
- 0019053271
- COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES.
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for mono syllabic word recognition in continuously spoken sentences," IEEE Trans. ASSP, vol.28, no. 4, pp. 357-366, 1980. (Pubitemid 11464930)
- (1980) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis Steven, B.¹ Mermelstein Paul²

10
- 0001481529
- Bark and ERB bilinear transforms
- J. O. Smith and J. S. Abel, "Bark and ERB bilinear transforms," IEEE Trans. Speech and Audio Proc., vol.7, no. 6, pp. 697- 708, 1999.
- (1999) IEEE Trans. Speech and Audio Proc. , vol.7 , Issue.6 , pp. 697-708
- Smith, J.O.¹ Abel, J.S.²

11
- 0032744416
- Vowel formant discrimination: Towards more ordinary listening conditions
- D. Kewley-Port and Y. Zheng, "Vowel formant discrimination: Towards more ordinary listening conditions," J. Acoust. Soc. Am., vol.106, no. 5, pp. 2945-2958, 1999.
- (1999) J. Acoust. Soc. Am. , vol.106 , Issue.5 , pp. 2945-2958
- Kewley-Port, D.¹ Zheng, Y.²

12
- 67649539246
- An introduction to R
- and the R Development CoreTeam,Vienna, Austria,ISBN 3-900051-12-7.
- V. N. Venables, F. M. Smith, and the R Development CoreTeam, An introduction to R, R Foundation for Statistical Computing, ienna, Austria, 2008, ISBN 3-900051-12-7.
- (2008) R Foundation for Statistical Computing
- Venables, V.N.¹ Smith, F.M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.