SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 2, 2006, Pages 365-375

The ATR multilingual speech-to-speech translation system

(10) Nakamura, Satoshi a,b Markov, Konstantin a,b Nakaiwa, Hiromi b Kikui, Genichiro b Kawai, Hisashi a,b Jitsuhiro, Takatoshi a,b Zhang, Jin Song b Yamamoto, Hirofumi b Sumita, Eiichiro b Yamamoto, Seiichi a,b

a IEEE (Japan)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

Example based machine translation (EBMT); Minimum description length (MDL); Multiclass language model; Speech to speech translation (S2S); Statistical machine translation (SMT); Successive state splitting (SSS); Text to speech (TTS) conversion

Indexed keywords

EXAMPLE BASED MACHINE TRANSLATION (EBMT); MINIMUM DESCRIPTION LENGTH (MDL); MULTICLASS LANGUAGE MODELS; SPEECH TO SPEECH TRANSLATION (S2ST) SYSTEMS; STATISTICAL MACHINE TRANSLATION (SMT); SUCCESSIVE STATE SPLITTING (SSS); TRANSLATION QUALITY;

DATABASE SYSTEMS; LEARNING SYSTEMS; LINGUISTICS; SPEECH RECOGNITION; TRANSLATION (LANGUAGES);

SPEECH SYNTHESIS;

EID: 33751057590 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.860774 Document Type: Article

Times cited : (142)

References (50)

1
- 0003557444
- W. Wahlster, Ed, Berlin, Germany: Springer-Verlag
- W. Wahlster, Ed., Vennobil: Foundations of Speech-to-Speech Translations. Berlin, Germany: Springer-Verlag, 2000.
- (2000) Vennobil: Foundations of Speech-to-Speech Translations

2
- 85009156805
- NESPOLEI's multi-lingual and multi-modal corpus
- E. Costantini, S. Burger, and F. Pianesi, "NESPOLEI's multi-lingual and multi-modal corpus," in Proc. LREC, 2002, pp. 165-170.
- (2002) Proc. LREC , pp. 165-170
- Costantini, E.¹ Burger, S.² Pianesi, F.³

3
- 33947133934
- A. Lavie, L. Levin, T. Schultz, and A. Waibel. Domain portability in speech-to-speech translation, presented at Proc. HLT Workshop. [Online] Available: http://www.is.cs.cmu.edu/papers/speech/HLT2001/HLT_alon.pdf
- A. Lavie, L. Levin, T. Schultz, and A. Waibel. Domain portability in speech-to-speech translation, presented at Proc. HLT Workshop. [Online] Available: http://www.is.cs.cmu.edu/papers/speech/HLT2001/HLT_alon.pdf

4
- 0007645150
- Segment selection and pitch modification for high quality speech synthesis using waveform segments
- T. Hirokawa and K. Hakoda, "Segment selection and pitch modification for high quality speech synthesis using waveform segments," in Proc. Int. Conf. Spoken Language Processing, 1990, pp. 337-340.
- (1990) Proc. Int. Conf. Spoken Language Processing , pp. 337-340
- Hirokawa, T.¹ Hakoda, K.²

5
- 0004131347
- Trainable speech synthesis,
- Ph.D. dissertation. Eng. Dept. Cambridge Univ, Cambridge, U.K
- R. Donovan, "Trainable speech synthesis," Ph.D. dissertation. Eng. Dept. Cambridge Univ., Cambridge, U.K., 1996.
- (1996)
- Donovan, R.¹

6
- 33947103313
- A. Breen and P. Jackson, Nonuniform unit selection and the similarity metric within BT's laureate TTS system, in Proc. 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Blue Mountians, Australia, Nov. 1998, p. G.1.
- A. Breen and P. Jackson, "Nonuniform unit selection and the similarity metric within BT's laureate TTS system," in Proc. 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Blue Mountians, Australia, Nov. 1998, p. G.1.

7
- 85135109865
- ATR ν-talk speech synthesis system
- Banff, AB, Canada, Oct
- Y. Sagisaka, N. Kaiki, N. Iwahashi, and K. Mimura, "ATR ν-talk speech synthesis system," in Proc. Int. Conf. Spoken Language Processing, Banff, AB, Canada, Oct. 1992, pp. 483-486.
- (1992) Proc. Int. Conf. Spoken Language Processing , pp. 483-486
- Sagisaka, Y.¹ Kaiki, N.² Iwahashi, N.³ Mimura, K.⁴

8
- 0342918775
- Chatr: A genetic speech synthesis system
- Kyoto, Japan, Aug
- A. W. Black and P. Taylor, "Chatr: a genetic speech synthesis system," in Proc. Conf. Computational Linguistics, Kyoto, Japan, Aug. 1994, pp. 983-986.
- (1994) Proc. Conf. Computational Linguistics , pp. 983-986
- Black, A.W.¹ Taylor, P.²

9
- 0023756465
- Speech synthesis by rule using an optimal selection of nonuniform synthesis units
- New York, Apr
- Y. Sagisaka, "Speech synthesis by rule using an optimal selection of nonuniform synthesis units," in Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing, New York, Apr. 1988, pp. 679-682.
- (1988) Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing , pp. 679-682
- Sagisaka, Y.¹

10
- 0027699809
- Speech segment selection for concatenate synthesis based on spectral distortion minimization
- Nov
- N. Iwahashi, N. Kaiki, and Y. Sagisaka, "Speech segment selection for concatenate synthesis based on spectral distortion minimization," Trans. IEICE, vol. E76-A, no. 11, pp. 1942-1948, Nov. 1993.
- (1993) Trans. IEICE , vol.E76-A , Issue.11 , pp. 1942-1948
- Iwahashi, N.¹ Kaiki, N.² Sagisaka, Y.³

11
- 0002144369
- Tree-based state tying for high accuracy acoustic modeling
- S. Young, J. Odell, and P. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Workshop on Human Language Technology, 1994, pp. 307-312.
- (1994) Proc. ARPA Workshop on Human Language Technology , pp. 307-312
- Young, S.¹ Odell, J.² Woodland, P.³

12
- 85013744934
- A successive state splitting algorithm for efficient allophone modeling
- J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," in Proc. ICASSP, vol. I, 1992, pp. 573-576.
- (1992) Proc. ICASSP , vol.1 , pp. 573-576
- Takami, J.¹ Sagayama, S.²

13
- 0030715097
- HMM topology design using maximum likelihood successive state splitting
- M. Ostendorf and H. Singer, "HMM topology design using maximum likelihood successive state splitting," Comput. Speech Lang., vol. 11, pp. 17-41, 1997.
- (1997) Comput. Speech Lang , vol.11 , pp. 17-41
- Ostendorf, M.¹ Singer, H.²

14
- 85009204321
- Automatic generation of nonuniform context-dependent HMM topologies based on the MDL criterion
- T. Jitsuhiro, T. Matsui, and S. Nakamura, "Automatic generation of nonuniform context-dependent HMM topologies based on the MDL criterion," in Proc. Etirospeech, 2003, pp. 2721-2724.
- (2003) Proc. Etirospeech , pp. 2721-2724
- Jitsuhiro, T.¹ Matsui, T.² Nakamura, S.³

15
- 85022919385
- Class-based N-gram models of natural language
- P. Brown, V. Pietra, P. Souza, J. Lai, and R. Mercer, "Class-based N-gram models of natural language," Comput. Linguistics, vol. 18, no. 4, pp. 467-479, 1992.
- (1992) Comput. Linguistics , vol.18 , Issue.4 , pp. 467-479
- Brown, P.¹ Pietra, V.² Souza, P.³ Lai, J.⁴ Mercer, R.⁵

16
- 0038373395
- Multi-class composite N-gram language model
- H. Yamamoto, S. Isogai, and Y. Sagisaka, "Multi-class composite N-gram language model," Speech Commun., vol. 41, pp. 369-379, 2003.
- (2003) Speech Commun , vol.41 , pp. 369-379
- Yamamoto, H.¹ Isogai, S.² Sagisaka, Y.³

17
- 84944178665
- Hierarchical grouping to optimize an objective function
- H. Ward, Jr., "Hierarchical grouping to optimize an objective function," J. Amer. Statist. Assoc., vol. 58, pp. 236-244, 1963.
- (1963) J. Amer. Statist. Assoc , vol.58 , pp. 236-244
- Ward Jr., H.¹

18
- 0021567651
- A framework of a mechanical translation between Japanese and English by analogy principle
- M. Elithorn and R. Banerji, Eds. Amsterdam, The Netherlands: North-Holland
- M. Nagao, "A framework of a mechanical translation between Japanese and English by analogy principle," in Artificial and Human Intelligence, M. Elithorn and R. Banerji, Eds. Amsterdam, The Netherlands: North-Holland, 1984, pp. 173-180.
- (1984) Artificial and Human Intelligence , pp. 173-180
- Nagao, M.¹

19
- 0033281002
- Review article: Example-based machine translation
- H. Somers, "Review article: example-based machine translation," J. Mach. Translat., pp. 113-157, 1999.
- (1999) J. Mach. Translat , pp. 113-157
- Somers, H.¹

20
- 84936823635
- A statistical approach to machine translation
- P. Brown et al., "A statistical approach to machine translation," Comput. Linguistics, vol. 16, pp. 79-85, 1993.
- (1993) Comput. Linguistics , vol.16 , pp. 79-85
- Brown, P.¹

21
- 0031361613
- Automating knowledge acquisition for machine translation
- K. Knight, "Automating knowledge acquisition for machine translation," AI Mag., vol. 18, no. 4, pp. 81-96, 1997.
- (1997) AI Mag , vol.18 , Issue.4 , pp. 81-96
- Knight, K.¹

22
- 33947164498
- Stochastic modeling: From pattern classification to language translation
- H. Ney, "Stochastic modeling: from pattern classification to language translation," in Proc. ACL Workshop DDMT, 2001, pp. 33-37.
- (2001) Proc. ACL Workshop DDMT , pp. 33-37
- Ney, H.¹

23
- 0039330854
- Learning dependency translation models as collections of finite-state head transducers
- H. Alshawi, S. Bangalore, and S. Douglas, "Learning dependency translation models as collections of finite-state head transducers," Comput. Linguistics, vol. 26, no. 1, pp. 45-60, 2000.
- (2000) Comput. Linguistics , vol.26 , Issue.1 , pp. 45-60
- Alshawi, H.¹ Bangalore, S.² Douglas, S.³

24
- 0006658814
- Fast decoding for statistical machine translation
- Y. Wang and A. Waibel, "Fast decoding for statistical machine translation," in Proc. ICSLP, 1998, pp. 2775-2778.
- (1998) Proc. ICSLP , pp. 2775-2778
- Wang, Y.¹ Waibel, A.²

25
- 84882967809
- Improved alignment models for statistical machine translation
- F. Och, C. Tillmann, and H. Ney, "Improved alignment models for statistical machine translation," in Proc. EMNLPAWLC, 1999, pp. 20-28.
- (1999) Proc. EMNLPAWLC , pp. 20-28
- Och, F.¹ Tillmann, C.² Ney, H.³

26
- 84947545641
- Effective phrase translation extraction from alignment models
- A. Venugopal, S. Vogel, and A. Waibel, "Effective phrase translation extraction from alignment models," in Proc. ACL, 2003, pp. 319-326.
- (2003) Proc. ACL , pp. 319-326
- Venugopal, A.¹ Vogel, S.² Waibel, A.³

27
- 25844478468
- Example-based machine translation using DP-matching between word sequences
- E. Sumita, "Example-based machine translation using DP-matching between word sequences," in Proc. ACL Workshop DDMT, 2001, pp. 1-8.
- (2001) Proc. ACL Workshop DDMT , pp. 1-8
- Sumita, E.¹

28
- 18544376963
- Application of translation knowledge acquired by hierarchical phrase alignment
- K. Imamura, "Application of translation knowledge acquired by hierarchical phrase alignment," in Proc. TMI, 2002, pp. 74-84.
- (2002) Proc. TMI , pp. 74-84
- Imamura, K.¹

29
- 85149144166
- Feedback cleaning of machine translation rules using automatic evaluation
- K. Imamura, E. Sumita, and Y. Matsumoto, "Feedback cleaning of machine translation rules using automatic evaluation," in Proc. 41st Annu. Meeting Assoc. Computational Linguistics, 2003, pp. 447-454.
- (2003) Proc. 41st Annu. Meeting Assoc. Computational Linguistics , pp. 447-454
- Imamura, K.¹ Sumita, E.² Matsumoto, Y.³

30
- 26844578082
- Statistical machine translation based on hierarchical phrase alignment
- T. Watanabe, K. Imamura, and E. Sumita, "Statistical machine translation based on hierarchical phrase alignment," in Proc. TMI, 2002, pp. 188-198.
- (2002) Proc. TMI , pp. 188-198
- Watanabe, T.¹ Imamura, K.² Sumita, E.³

31
- 33746611240
- Using language and translation models to select the best among outputs from multiple MT systems
- Y. Akiba, T. Watanabe, and E. Sumita, "Using language and translation models to select the best among outputs from multiple MT systems," in Proc. COLING, 2002, pp. 8-14.
- (2002) Proc. COLING , pp. 8-14
- Akiba, Y.¹ Watanabe, T.² Sumita, E.³

32
- 25844528067
- Example-based decoding for statistical machine translation
- T. Watanabe and E. Sumita, "Example-based decoding for statistical machine translation," in Proc. 9th MT Summit, 2003, pp. 410-417.
- (2003) Proc. 9th MT Summit , pp. 410-417
- Watanabe, T.¹ Sumita, E.²

33
- 84857598349
- An evaluation of the multi-engine MT architecture
- C. Hogan and R. Frederking, "An evaluation of the multi-engine MT architecture," in Proc. AMTA, 1998, pp. 113-123.
- (1998) Proc. AMTA , pp. 113-123
- Hogan, C.¹ Frederking, R.²

34
- 0242413528
- A program for automatically selecting the best output from multiple machine translation engines
- C. Callison-Burch and S. Floumoy, "A program for automatically selecting the best output from multiple machine translation engines," in Proc. MT-SUMMIT-VIII, 2001, pp. 63-66.
- (2001) Proc. MT-SUMMIT-VIII , pp. 63-66
- Callison-Burch, C.¹ Floumoy, S.²

35
- 0033708106
- Speech parameter generation algorithms for hmm-based speech synthesis
- Istanbul, Turkey, Jun
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for hmm-based speech synthesis," in Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing, Istanbul, Turkey, Jun. 2000, pp. 1315-1318.
- (2000) Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

36
- 4544270859
- Optimizing subcost functions for segment selection based on perceptual evaluations in concatenative speech synthesis
- Montreal, QC, Canada, Jun
- T. Toda, H. Kawai, and M. Tsuzaki, "Optimizing subcost functions for segment selection based on perceptual evaluations in concatenative speech synthesis," in Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing, vol. I, Montreal, QC, Canada, Jun. 2004, pp. 657-660.
- (2004) Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing , vol.1 , pp. 657-660
- Toda, T.¹ Kawai, H.² Tsuzaki, M.³

37
- 84863704138
- Toward a broad-coverage bi-lingual corpus for speech translation of travel conversations in the real world
- T. Takezawa, E. Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto, "Toward a broad-coverage bi-lingual corpus for speech translation of travel conversations in the real world," in Proc. LREC, 2002, pp. 147-152.
- (2002) Proc. LREC , pp. 147-152
- Takezawa, T.¹ Sumita, E.² Sugaya, F.³ Yamamoto, H.⁴ Yamamoto, S.⁵

38
- 85009211217
- Creating corpora for speech-to-speech translation
- G. Kikui, E. Sumita, T. Takezawa, and S. Yamamoto, "Creating corpora for speech-to-speech translation," in Proc. Eurospeech, 2003, pp. 381-384.
- (2003) Proc. Eurospeech , pp. 381-384
- Kikui, G.¹ Sumita, E.² Takezawa, T.³ Yamamoto, S.⁴

39
- 10244241317
- Solutions to problems inherent in spoken language stranslation: The ATR-MATRIX approach
- E. Sumita, S. Yamada, K. Yamamoto, M. Paul, H. Kashioka, K. Ishikawa, and S. Shirai, "Solutions to problems inherent in spoken language stranslation: the ATR-MATRIX approach," in Proc. 7th MT Summit, 1999, pp. 229-235.
- (1999) Proc. 7th MT Summit , pp. 229-235
- Sumita, E.¹ Yamada, S.² Yamamoto, K.³ Paul, M.⁴ Kashioka, H.⁵ Ishikawa, K.⁶ Shirai, S.⁷

40
- 0011946055
- CHATR: A high-definition speech resequencing system
- N. Campbell, "CHATR: a high-definition speech resequencing system," in Proc. ASA/JASA Joint Meeting, 1996, pp. 1223-1228.
- (1996) Proc. ASA/JASA Joint Meeting , pp. 1223-1228
- Campbell, N.¹

41
- 0007601623
- Speech and language databases for speech translation research in ATR
- T. Takezawa, T. Morimoto, and Y. Sagisaka, "Speech and language databases for speech translation research in ATR," in Proc. 1st Int. Workshop on East-Asian Language Resources and Evaluation (EALREW), 1998, pp. 148-155.
- (1998) Proc. 1st Int. Workshop on East-Asian Language Resources and Evaluation (EALREW) , pp. 148-155
- Takezawa, T.¹ Morimoto, T.² Sagisaka, Y.³

42
- 85009064344
- Improving genericity for task-independent speech recognition
- F. Lefevre, J. L. Gauvain, and L. Lamel, "Improving genericity for task-independent speech recognition," in Proc. Eurospeech, 2001, pp. 1241-1244.
- (2001) Proc. Eurospeech , pp. 1241-1244
- Lefevre, F.¹ Gauvain, J.L.² Lamel, L.³

43
- 0012330750
- The design for the wall street journal-based CSR corpus
- Feb
- D. Paul and J. Baker, "The design for the wall street journal-based CSR corpus," in Proc. DARPA Speech and Natural Language Workshop, Feb. 1992, pp. 357-362.
- (1992) Proc. DARPA Speech and Natural Language Workshop , pp. 357-362
- Paul, D.¹ Baker, J.²

44
- 33947120905
- Plainsboro, NJ, Mar
- Proc. Spoken Language Technology Workshop, Plainsboro, NJ, Mar. 1994.
- (1994) Proc. Spoken Language Technology Workshop

45
- 85124698057
- The architecture of the Festival speech synthesis system
- Sydney, Australia, Nov
- P. Taylor, A. Black, and R. Caley, "The architecture of the Festival speech synthesis system," in Proc. Third Int. Workshop Speech Synthesis, Sydney, Australia, Nov. 1998, pp. 147-151.
- (1998) Proc. Third Int. Workshop Speech Synthesis , pp. 147-151
- Taylor, P.¹ Black, A.² Caley, R.³

46
- 33947141155
- An introduction to ATRPTH: A phonetically rich sentence set based Chinese Putonghua speech database developed by ATR
- Fall
- J. S. Zhang, M. Mizumachi, F. Soong, and S. Nakamura, "An introduction to ATRPTH: a phonetically rich sentence set based Chinese Putonghua speech database developed by ATR," in Proc. ASJ Meeting, Fall 2003, pp. 167-168.
- (2003) Proc. ASJ Meeting , pp. 167-168
- Zhang, J.S.¹ Mizumachi, M.² Soong, F.³ Nakamura, S.⁴

47
- 85009079604
- A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition
- J. S. Zhang, S. W. Zhang, Y. Sagisaka, and S. Nakamura, "A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition," in Proc. Eurospeech, vol. 3, 2001, pp. 1661-1663.
- (2001) Proc. Eurospeech , vol.3 , pp. 1661-1663
- Zhang, J.S.¹ Zhang, S.W.² Sagisaka, Y.³ Nakamura, S.⁴

48
- 0038719307
- A study on acoustic modeling of pauses for recognizing noisy conversational speech
- J. S. Zhang, K. Markov, T. Matsui, and S. Nakamura, "A study on acoustic modeling of pauses for recognizing noisy conversational speech," Proc. IEICE Trans. Inf. Syst., vol. 86-D, no. 3, pp. 489-196, 2003.
- (2003) Proc. IEICE Trans. Inf. Syst , vol.86-D , Issue.3 , pp. 489-196
- Zhang, J.S.¹ Markov, K.² Matsui, T.³ Nakamura, S.⁴

49
- 85009064490
- Evaluation of the ATR-MATRIX speech translation system with a pair comparison method between the system and humans
- F. Sugaya, T. Takezawa, A. Yokoo, Y. Sagisaka, and S. Yamamoto, "Evaluation of the ATR-MATRIX speech translation system with a pair comparison method between the system and humans," in Proc. ICSLP, 2000, pp. 1105-1108.
- (2000) Proc. ICSLP , pp. 1105-1108
- Sugaya, F.¹ Takezawa, T.² Yokoo, A.³ Sagisaka, Y.⁴ Yamamoto, S.⁵

50
- 33751352564
- Chunk-based statistical translation
- T. Watanabe, E. Sumita, and H. Okuno, "Chunk-based statistical translation," Proc. ACL, pp. 303-310, 2003.
- (2003) Proc. ACL , pp. 303-310
- Watanabe, T.¹ Sumita, E.² Okuno, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.