메뉴 건너뛰기




Volumn 14, Issue 2, 2006, Pages 365-375

The ATR multilingual speech-to-speech translation system

Author keywords

Example based machine translation (EBMT); Minimum description length (MDL); Multiclass language model; Speech to speech translation (S2S); Statistical machine translation (SMT); Successive state splitting (SSS); Text to speech (TTS) conversion

Indexed keywords

EXAMPLE BASED MACHINE TRANSLATION (EBMT); MINIMUM DESCRIPTION LENGTH (MDL); MULTICLASS LANGUAGE MODELS; SPEECH TO SPEECH TRANSLATION (S2ST) SYSTEMS; STATISTICAL MACHINE TRANSLATION (SMT); SUCCESSIVE STATE SPLITTING (SSS); TRANSLATION QUALITY;

EID: 33751057590     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.860774     Document Type: Article
Times cited : (142)

References (50)
  • 2
    • 85009156805 scopus 로고    scopus 로고
    • NESPOLEI's multi-lingual and multi-modal corpus
    • E. Costantini, S. Burger, and F. Pianesi, "NESPOLEI's multi-lingual and multi-modal corpus," in Proc. LREC, 2002, pp. 165-170.
    • (2002) Proc. LREC , pp. 165-170
    • Costantini, E.1    Burger, S.2    Pianesi, F.3
  • 3
    • 33947133934 scopus 로고    scopus 로고
    • A. Lavie, L. Levin, T. Schultz, and A. Waibel. Domain portability in speech-to-speech translation, presented at Proc. HLT Workshop. [Online] Available: http://www.is.cs.cmu.edu/papers/speech/HLT2001/HLT_alon.pdf
    • A. Lavie, L. Levin, T. Schultz, and A. Waibel. Domain portability in speech-to-speech translation, presented at Proc. HLT Workshop. [Online] Available: http://www.is.cs.cmu.edu/papers/speech/HLT2001/HLT_alon.pdf
  • 4
    • 0007645150 scopus 로고
    • Segment selection and pitch modification for high quality speech synthesis using waveform segments
    • T. Hirokawa and K. Hakoda, "Segment selection and pitch modification for high quality speech synthesis using waveform segments," in Proc. Int. Conf. Spoken Language Processing, 1990, pp. 337-340.
    • (1990) Proc. Int. Conf. Spoken Language Processing , pp. 337-340
    • Hirokawa, T.1    Hakoda, K.2
  • 5
    • 0004131347 scopus 로고    scopus 로고
    • Trainable speech synthesis,
    • Ph.D. dissertation. Eng. Dept. Cambridge Univ, Cambridge, U.K
    • R. Donovan, "Trainable speech synthesis," Ph.D. dissertation. Eng. Dept. Cambridge Univ., Cambridge, U.K., 1996.
    • (1996)
    • Donovan, R.1
  • 6
    • 33947103313 scopus 로고    scopus 로고
    • A. Breen and P. Jackson, Nonuniform unit selection and the similarity metric within BT's laureate TTS system, in Proc. 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Blue Mountians, Australia, Nov. 1998, p. G.1.
    • A. Breen and P. Jackson, "Nonuniform unit selection and the similarity metric within BT's laureate TTS system," in Proc. 3rd ESCA/COCOSDA Workshop on Speech Synthesis, Jenolan Caves, Blue Mountians, Australia, Nov. 1998, p. G.1.
  • 8
    • 0342918775 scopus 로고
    • Chatr: A genetic speech synthesis system
    • Kyoto, Japan, Aug
    • A. W. Black and P. Taylor, "Chatr: a genetic speech synthesis system," in Proc. Conf. Computational Linguistics, Kyoto, Japan, Aug. 1994, pp. 983-986.
    • (1994) Proc. Conf. Computational Linguistics , pp. 983-986
    • Black, A.W.1    Taylor, P.2
  • 9
    • 0023756465 scopus 로고
    • Speech synthesis by rule using an optimal selection of nonuniform synthesis units
    • New York, Apr
    • Y. Sagisaka, "Speech synthesis by rule using an optimal selection of nonuniform synthesis units," in Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing, New York, Apr. 1988, pp. 679-682.
    • (1988) Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing , pp. 679-682
    • Sagisaka, Y.1
  • 10
    • 0027699809 scopus 로고
    • Speech segment selection for concatenate synthesis based on spectral distortion minimization
    • Nov
    • N. Iwahashi, N. Kaiki, and Y. Sagisaka, "Speech segment selection for concatenate synthesis based on spectral distortion minimization," Trans. IEICE, vol. E76-A, no. 11, pp. 1942-1948, Nov. 1993.
    • (1993) Trans. IEICE , vol.E76-A , Issue.11 , pp. 1942-1948
    • Iwahashi, N.1    Kaiki, N.2    Sagisaka, Y.3
  • 12
    • 85013744934 scopus 로고
    • A successive state splitting algorithm for efficient allophone modeling
    • J. Takami and S. Sagayama, "A successive state splitting algorithm for efficient allophone modeling," in Proc. ICASSP, vol. I, 1992, pp. 573-576.
    • (1992) Proc. ICASSP , vol.1 , pp. 573-576
    • Takami, J.1    Sagayama, S.2
  • 13
    • 0030715097 scopus 로고    scopus 로고
    • HMM topology design using maximum likelihood successive state splitting
    • M. Ostendorf and H. Singer, "HMM topology design using maximum likelihood successive state splitting," Comput. Speech Lang., vol. 11, pp. 17-41, 1997.
    • (1997) Comput. Speech Lang , vol.11 , pp. 17-41
    • Ostendorf, M.1    Singer, H.2
  • 14
    • 85009204321 scopus 로고    scopus 로고
    • Automatic generation of nonuniform context-dependent HMM topologies based on the MDL criterion
    • T. Jitsuhiro, T. Matsui, and S. Nakamura, "Automatic generation of nonuniform context-dependent HMM topologies based on the MDL criterion," in Proc. Etirospeech, 2003, pp. 2721-2724.
    • (2003) Proc. Etirospeech , pp. 2721-2724
    • Jitsuhiro, T.1    Matsui, T.2    Nakamura, S.3
  • 15
    • 85022919385 scopus 로고
    • Class-based N-gram models of natural language
    • P. Brown, V. Pietra, P. Souza, J. Lai, and R. Mercer, "Class-based N-gram models of natural language," Comput. Linguistics, vol. 18, no. 4, pp. 467-479, 1992.
    • (1992) Comput. Linguistics , vol.18 , Issue.4 , pp. 467-479
    • Brown, P.1    Pietra, V.2    Souza, P.3    Lai, J.4    Mercer, R.5
  • 16
    • 0038373395 scopus 로고    scopus 로고
    • Multi-class composite N-gram language model
    • H. Yamamoto, S. Isogai, and Y. Sagisaka, "Multi-class composite N-gram language model," Speech Commun., vol. 41, pp. 369-379, 2003.
    • (2003) Speech Commun , vol.41 , pp. 369-379
    • Yamamoto, H.1    Isogai, S.2    Sagisaka, Y.3
  • 17
    • 84944178665 scopus 로고
    • Hierarchical grouping to optimize an objective function
    • H. Ward, Jr., "Hierarchical grouping to optimize an objective function," J. Amer. Statist. Assoc., vol. 58, pp. 236-244, 1963.
    • (1963) J. Amer. Statist. Assoc , vol.58 , pp. 236-244
    • Ward Jr., H.1
  • 18
    • 0021567651 scopus 로고
    • A framework of a mechanical translation between Japanese and English by analogy principle
    • M. Elithorn and R. Banerji, Eds. Amsterdam, The Netherlands: North-Holland
    • M. Nagao, "A framework of a mechanical translation between Japanese and English by analogy principle," in Artificial and Human Intelligence, M. Elithorn and R. Banerji, Eds. Amsterdam, The Netherlands: North-Holland, 1984, pp. 173-180.
    • (1984) Artificial and Human Intelligence , pp. 173-180
    • Nagao, M.1
  • 19
    • 0033281002 scopus 로고    scopus 로고
    • Review article: Example-based machine translation
    • H. Somers, "Review article: example-based machine translation," J. Mach. Translat., pp. 113-157, 1999.
    • (1999) J. Mach. Translat , pp. 113-157
    • Somers, H.1
  • 20
    • 84936823635 scopus 로고
    • A statistical approach to machine translation
    • P. Brown et al., "A statistical approach to machine translation," Comput. Linguistics, vol. 16, pp. 79-85, 1993.
    • (1993) Comput. Linguistics , vol.16 , pp. 79-85
    • Brown, P.1
  • 21
    • 0031361613 scopus 로고    scopus 로고
    • Automating knowledge acquisition for machine translation
    • K. Knight, "Automating knowledge acquisition for machine translation," AI Mag., vol. 18, no. 4, pp. 81-96, 1997.
    • (1997) AI Mag , vol.18 , Issue.4 , pp. 81-96
    • Knight, K.1
  • 22
    • 33947164498 scopus 로고    scopus 로고
    • Stochastic modeling: From pattern classification to language translation
    • H. Ney, "Stochastic modeling: from pattern classification to language translation," in Proc. ACL Workshop DDMT, 2001, pp. 33-37.
    • (2001) Proc. ACL Workshop DDMT , pp. 33-37
    • Ney, H.1
  • 23
    • 0039330854 scopus 로고    scopus 로고
    • Learning dependency translation models as collections of finite-state head transducers
    • H. Alshawi, S. Bangalore, and S. Douglas, "Learning dependency translation models as collections of finite-state head transducers," Comput. Linguistics, vol. 26, no. 1, pp. 45-60, 2000.
    • (2000) Comput. Linguistics , vol.26 , Issue.1 , pp. 45-60
    • Alshawi, H.1    Bangalore, S.2    Douglas, S.3
  • 24
    • 0006658814 scopus 로고    scopus 로고
    • Fast decoding for statistical machine translation
    • Y. Wang and A. Waibel, "Fast decoding for statistical machine translation," in Proc. ICSLP, 1998, pp. 2775-2778.
    • (1998) Proc. ICSLP , pp. 2775-2778
    • Wang, Y.1    Waibel, A.2
  • 25
    • 84882967809 scopus 로고    scopus 로고
    • Improved alignment models for statistical machine translation
    • F. Och, C. Tillmann, and H. Ney, "Improved alignment models for statistical machine translation," in Proc. EMNLPAWLC, 1999, pp. 20-28.
    • (1999) Proc. EMNLPAWLC , pp. 20-28
    • Och, F.1    Tillmann, C.2    Ney, H.3
  • 26
    • 84947545641 scopus 로고    scopus 로고
    • Effective phrase translation extraction from alignment models
    • A. Venugopal, S. Vogel, and A. Waibel, "Effective phrase translation extraction from alignment models," in Proc. ACL, 2003, pp. 319-326.
    • (2003) Proc. ACL , pp. 319-326
    • Venugopal, A.1    Vogel, S.2    Waibel, A.3
  • 27
    • 25844478468 scopus 로고    scopus 로고
    • Example-based machine translation using DP-matching between word sequences
    • E. Sumita, "Example-based machine translation using DP-matching between word sequences," in Proc. ACL Workshop DDMT, 2001, pp. 1-8.
    • (2001) Proc. ACL Workshop DDMT , pp. 1-8
    • Sumita, E.1
  • 28
    • 18544376963 scopus 로고    scopus 로고
    • Application of translation knowledge acquired by hierarchical phrase alignment
    • K. Imamura, "Application of translation knowledge acquired by hierarchical phrase alignment," in Proc. TMI, 2002, pp. 74-84.
    • (2002) Proc. TMI , pp. 74-84
    • Imamura, K.1
  • 30
    • 26844578082 scopus 로고    scopus 로고
    • Statistical machine translation based on hierarchical phrase alignment
    • T. Watanabe, K. Imamura, and E. Sumita, "Statistical machine translation based on hierarchical phrase alignment," in Proc. TMI, 2002, pp. 188-198.
    • (2002) Proc. TMI , pp. 188-198
    • Watanabe, T.1    Imamura, K.2    Sumita, E.3
  • 31
    • 33746611240 scopus 로고    scopus 로고
    • Using language and translation models to select the best among outputs from multiple MT systems
    • Y. Akiba, T. Watanabe, and E. Sumita, "Using language and translation models to select the best among outputs from multiple MT systems," in Proc. COLING, 2002, pp. 8-14.
    • (2002) Proc. COLING , pp. 8-14
    • Akiba, Y.1    Watanabe, T.2    Sumita, E.3
  • 32
    • 25844528067 scopus 로고    scopus 로고
    • Example-based decoding for statistical machine translation
    • T. Watanabe and E. Sumita, "Example-based decoding for statistical machine translation," in Proc. 9th MT Summit, 2003, pp. 410-417.
    • (2003) Proc. 9th MT Summit , pp. 410-417
    • Watanabe, T.1    Sumita, E.2
  • 33
    • 84857598349 scopus 로고    scopus 로고
    • An evaluation of the multi-engine MT architecture
    • C. Hogan and R. Frederking, "An evaluation of the multi-engine MT architecture," in Proc. AMTA, 1998, pp. 113-123.
    • (1998) Proc. AMTA , pp. 113-123
    • Hogan, C.1    Frederking, R.2
  • 34
    • 0242413528 scopus 로고    scopus 로고
    • A program for automatically selecting the best output from multiple machine translation engines
    • C. Callison-Burch and S. Floumoy, "A program for automatically selecting the best output from multiple machine translation engines," in Proc. MT-SUMMIT-VIII, 2001, pp. 63-66.
    • (2001) Proc. MT-SUMMIT-VIII , pp. 63-66
    • Callison-Burch, C.1    Floumoy, S.2
  • 36
    • 4544270859 scopus 로고    scopus 로고
    • Optimizing subcost functions for segment selection based on perceptual evaluations in concatenative speech synthesis
    • Montreal, QC, Canada, Jun
    • T. Toda, H. Kawai, and M. Tsuzaki, "Optimizing subcost functions for segment selection based on perceptual evaluations in concatenative speech synthesis," in Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing, vol. I, Montreal, QC, Canada, Jun. 2004, pp. 657-660.
    • (2004) Proc. IEEE Int. Conf. Speech, Acoustics, Signal Processing , vol.1 , pp. 657-660
    • Toda, T.1    Kawai, H.2    Tsuzaki, M.3
  • 37
    • 84863704138 scopus 로고    scopus 로고
    • Toward a broad-coverage bi-lingual corpus for speech translation of travel conversations in the real world
    • T. Takezawa, E. Sumita, F. Sugaya, H. Yamamoto, and S. Yamamoto, "Toward a broad-coverage bi-lingual corpus for speech translation of travel conversations in the real world," in Proc. LREC, 2002, pp. 147-152.
    • (2002) Proc. LREC , pp. 147-152
    • Takezawa, T.1    Sumita, E.2    Sugaya, F.3    Yamamoto, H.4    Yamamoto, S.5
  • 40
    • 0011946055 scopus 로고    scopus 로고
    • CHATR: A high-definition speech resequencing system
    • N. Campbell, "CHATR: a high-definition speech resequencing system," in Proc. ASA/JASA Joint Meeting, 1996, pp. 1223-1228.
    • (1996) Proc. ASA/JASA Joint Meeting , pp. 1223-1228
    • Campbell, N.1
  • 42
    • 85009064344 scopus 로고    scopus 로고
    • Improving genericity for task-independent speech recognition
    • F. Lefevre, J. L. Gauvain, and L. Lamel, "Improving genericity for task-independent speech recognition," in Proc. Eurospeech, 2001, pp. 1241-1244.
    • (2001) Proc. Eurospeech , pp. 1241-1244
    • Lefevre, F.1    Gauvain, J.L.2    Lamel, L.3
  • 45
    • 85124698057 scopus 로고    scopus 로고
    • The architecture of the Festival speech synthesis system
    • Sydney, Australia, Nov
    • P. Taylor, A. Black, and R. Caley, "The architecture of the Festival speech synthesis system," in Proc. Third Int. Workshop Speech Synthesis, Sydney, Australia, Nov. 1998, pp. 147-151.
    • (1998) Proc. Third Int. Workshop Speech Synthesis , pp. 147-151
    • Taylor, P.1    Black, A.2    Caley, R.3
  • 46
    • 33947141155 scopus 로고    scopus 로고
    • An introduction to ATRPTH: A phonetically rich sentence set based Chinese Putonghua speech database developed by ATR
    • Fall
    • J. S. Zhang, M. Mizumachi, F. Soong, and S. Nakamura, "An introduction to ATRPTH: a phonetically rich sentence set based Chinese Putonghua speech database developed by ATR," in Proc. ASJ Meeting, Fall 2003, pp. 167-168.
    • (2003) Proc. ASJ Meeting , pp. 167-168
    • Zhang, J.S.1    Mizumachi, M.2    Soong, F.3    Nakamura, S.4
  • 47
    • 85009079604 scopus 로고    scopus 로고
    • A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition
    • J. S. Zhang, S. W. Zhang, Y. Sagisaka, and S. Nakamura, "A hybrid approach to enhance task portability of acoustic models in Chinese speech recognition," in Proc. Eurospeech, vol. 3, 2001, pp. 1661-1663.
    • (2001) Proc. Eurospeech , vol.3 , pp. 1661-1663
    • Zhang, J.S.1    Zhang, S.W.2    Sagisaka, Y.3    Nakamura, S.4
  • 48
    • 0038719307 scopus 로고    scopus 로고
    • A study on acoustic modeling of pauses for recognizing noisy conversational speech
    • J. S. Zhang, K. Markov, T. Matsui, and S. Nakamura, "A study on acoustic modeling of pauses for recognizing noisy conversational speech," Proc. IEICE Trans. Inf. Syst., vol. 86-D, no. 3, pp. 489-196, 2003.
    • (2003) Proc. IEICE Trans. Inf. Syst , vol.86-D , Issue.3 , pp. 489-196
    • Zhang, J.S.1    Markov, K.2    Matsui, T.3    Nakamura, S.4
  • 49
    • 85009064490 scopus 로고    scopus 로고
    • Evaluation of the ATR-MATRIX speech translation system with a pair comparison method between the system and humans
    • F. Sugaya, T. Takezawa, A. Yokoo, Y. Sagisaka, and S. Yamamoto, "Evaluation of the ATR-MATRIX speech translation system with a pair comparison method between the system and humans," in Proc. ICSLP, 2000, pp. 1105-1108.
    • (2000) Proc. ICSLP , pp. 1105-1108
    • Sugaya, F.1    Takezawa, T.2    Yokoo, A.3    Sagisaka, Y.4    Yamamoto, S.5
  • 50
    • 33751352564 scopus 로고    scopus 로고
    • Chunk-based statistical translation
    • T. Watanabe, E. Sumita, and H. Okuno, "Chunk-based statistical translation," Proc. ACL, pp. 303-310, 2003.
    • (2003) Proc. ACL , pp. 303-310
    • Watanabe, T.1    Sumita, E.2    Okuno, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.