메뉴 건너뛰기




Volumn 88, Issue 8, 2000, Pages 1314-1336

Speech and language processing for next-millennium communications services

Author keywords

Dialogue management; Speaker recognition; Speech coding; Speech processing; Speech recognition; Speech synthesis; Spoken language understanding

Indexed keywords


EID: 0042660763     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/5.880086     Document Type: Article
Times cited : (48)

References (81)
  • 1
    • 33646917431 scopus 로고    scopus 로고
    • P.ML: A language interface to networked voice response anils, in
    • Chicago, IL, May
    • J. C. Ramming, "P.ML: A language interface to networked voice response anils," in Proc. Workshop on Internet Programming Languages, Chicago, IL, May 1998.
    • (1998) Proc. Workshop on Internet Programming Languages
    • Ramming, J.C.1
  • 2
    • 0002174507 scopus 로고
    • The vocoder
    • H. Dudley, "The vocoder," Bell Lab. Rec., vol. IS, pp. 122-126, 1939.
    • (1939) Bell Lab. Rec. , vol.IS , pp. 122-126
    • Dudley, H.1
  • 3
    • 0020269695 scopus 로고
    • 32 kb/s ADPCM-DLQ coding for network applications, in
    • D. W. Petr, "32 kb/s ADPCM-DLQ coding for network applications," in Proc. IEEE GLOBECOM, Dec. 1982, pp. A8.3-1 -A8.3-5.
    • (1982) Proc. IEEE GLOBECOM, Dec.
    • Petr, D.W.1
  • 4
    • 0002279871 scopus 로고    scopus 로고
    • Linear-prediction based anaiysis-by-synthesis coding, in
    • W. B. Kleijn and K. K. Paliwal, Eds. Asterdam, The Netherlands: Elsevier
    • P. Kroon and W. B. Kleijn, "Linear-prediction based anaiysis-by-synthesis coding," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paliwal, Eds. Asterdam, The Netherlands: Elsevier, J995, pp. 79-119.
    • Speech Coding and Synthesis , pp. 79-119
    • Kroon, P.1    Kleijn, W.B.2
  • 7
    • 0029725602 scopus 로고    scopus 로고
    • A 2.4 kbit/s MELP coder candidate for the new US Federal Standard
    • A. McCree, K.. Truong, E. George, T. Barnweli, and V. Viswanathan, "A 2.4 kbit/s MELP coder candidate for the new US Federal Standard," in Proc. ICASSP'96, vol. 1, 1996, pp. 200-203.
    • (1996) Proc. ICASSP'96 , vol.1 , pp. 200-203
    • McCree, A.1    Truong, K.2    George, E.3    Barnweli, T.4    Viswanathan, V.5
  • 8
    • 0001935942 scopus 로고
    • W. B. Kleijn and K. K. Paiiwal, Eds. Asterdam, The Netherlands: Elsevier,'
    • R. J. McAulay and T. F. Quatieri, "Sinusoidal coding," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paiiwal, Eds. Asterdam, The Netherlands: Elsevier,' 1995, pp. 121-173.
    • (1995) Speech Coding and Synthesis , pp. 121-173
    • McAulay, R.J.1    Quatieri, T.F.2    Coding, S.3
  • 9
    • 0000066010 scopus 로고
    • Waveform interpolation for coding and synthesis, in
    • W. B. Kleijn and K. K. Paiiwal, Eds. Asterdam, The Netherlands: Elsevier
    • W. B. Kleijn and J. Haagen, "Waveform interpolation for coding and synthesis," in Speech Coding and Synthesis, W. B. Kleijn and K. K. Paiiwal, Eds. Asterdam, The Netherlands: Elsevier, 1995, pp. 175-207.
    • (1995) Speech Coding and Synthesis , pp. 175-207
    • Kleijn, W.B.1    Haagen, J.2
  • 12
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenate speech synthesis system using a large speech database, in
    • A. Hunt and A. Black, "Unit selection in a concatenate speech synthesis system using a large speech database," in Proc. ICASSP'96, 1996, pp. 373-376.
    • (1996) Proc. ICASSP'96 , pp. 373-376
    • Hunt, A.1    Black, A.2
  • 14
    • 0032178446 scopus 로고    scopus 로고
    • Animated talking head with personalized 3D Head Model
    • J. Ostennann, "Animated talking head with personalized 3D Head Model," J. VLSI Signal Process., vol. 20, pp. 97-105, 1998.
    • (1998) J. VLSI Signal Process. , vol.20 , pp. 97-105
    • Ostennann, J.1
  • 15
    • 84872004031 scopus 로고    scopus 로고
    • Sample-based synthesis of photo-realistic talking-heads, in
    • E. Cosatto and H. P. Grat', "Sample-based synthesis of photo-realistic talking-heads," in Proc. Computer Animation, 1998, pp. 103-110.
    • (1998) Proc. Computer Animation , pp. 103-110
    • Cosatto, E.1    Grat, H.P.2
  • 16
    • 84962695927 scopus 로고    scopus 로고
    • Animation of synthetic faces in M PEG-4, in
    • J. Ostcrmann, "Animation of synthetic faces in M PEG-4," in Proc. Computer Animation, 1998, pp. 49-55.
    • (1998) Proc. Computer Animation , pp. 49-55
    • Ostcrmann, J.1
  • 18
    • 0000706716 scopus 로고
    • State of the art in continuous speech recognition, in
    • D. Roe and J. Wilpon, Eds. Washington, DC: National Academy Press
    • J. Makhoul and J. Schwanz, "State of the art in continuous speech recognition," in Voice Communication between Humans and Machines., D. Roe and J. Wilpon, Eds. Washington, DC: National Academy Press, 1994, pp. 165-188.
    • (1994) Voice Communication between Humans and Machines. , pp. 165-188
    • Makhoul, J.1    Schwanz, J.2
  • 20
    • 27144558930 scopus 로고    scopus 로고
    • Stochastic language models for speech recognition and understanding, in
    • G. Riccardi and A. Gorin, "Stochastic language models for speech recognition and understanding," in Proc. ICSLP 98. 199S, pp. 2087-2090.
    • Proc. ICSLP , vol.98 , pp. 2087-2090
    • Riccardi, G.1    Gorin, A.2
  • 21
    • 0033872141 scopus 로고    scopus 로고
    • Utterance verification in continuous speech recognition: Decoding and training procedures
    • R. Rose and E. Lleida, "Utterance verification in continuous speech recognition: Decoding and training procedures," IEEE Trans. Speech Audio Processing, vol. 8, pp. 126-139, Mar 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 126-139
    • Rose, R.1    Lleida, E.2
  • 22
    • 0032661656 scopus 로고    scopus 로고
    • NetworK optimization fo large-vocabulary speech recognition
    • M. Mohri and M. Riley, "NetworK optimization fo large-vocabulary speech recognition," Speech Commun., vol. 2X. pp. 1-12, 1999.
    • (1999) Speech Commun. , vol.2 , pp. 1-12
    • Mohri, M.1    Riley, M.2
  • 23
    • 21444449828 scopus 로고    scopus 로고
    • Weighed automata in text and speech processing, presented at the
    • M. Mohri. F. Pereira, and ,V1. Riley, " Weighed automata in text and speech processing," presented at the Proc. ECAI-96 Workshop, Budapest, Hungary, 1996.
    • (1996) Proc. ECAI-96
    • Mohri, M.1    Pereira, F.2    Riley, V.3
  • 28
    • 85135259135 scopus 로고    scopus 로고
    • Integrated context-dependent networks in very large vocabulary speech recognition," in
    • M. Mohri and VI. Riley, "integrated context-dependent networks in very large vocabulary speech recognition," in Proc. 6th Euro. Cor.f. Speech Communication and Technologe. Budapest, Hungary, 1999, pp. 811-814.
    • (1999) Proc. , vol.6 , pp. 811-814
    • Mohri, M.1    Riley, V.I.2
  • 29
    • 84892168937 scopus 로고    scopus 로고
    • Full expansion of context-dependent networks in large vocabulary speech recognition, presented at the
    • M. Mohri, M. Riiey, D. Hindle, A. Ljolje, and F. Pereira, "Full expansion of context-dependent networks in large vocabulary speech recognition," presented at the Proc. ICASSP'9S, vol. Il, I998, pp. 665-668.
    • Proc. ICASSP'9S , pp. 665-668
    • Mohri, M.1    Riiey, M.2    Hindle, D.3    Ljolje, A.4    Pereira, F.5
  • 31
    • 33646918841 scopus 로고    scopus 로고
    • On the use of formal grammars, in
    • R. De Mori, Ed. London, U.K.: Academic
    • A. Corazza and R. De Mori, "On the use of formal grammars," in Spoken Dialogues w/'uh Computers. R. De Mori, Ed. London, U.K.: Academic, 1998, pp. 461-484.
    • (1998) Spoken Dialogues W/'uh Computers. , pp. 461-484
    • Corazza, A.1    De Mori, R.2
  • 32
    • 3042662922 scopus 로고    scopus 로고
    • Sentence interpretation,"
    • R. De Mori, Ed. London, U.K.: Academic
    • R. Kuhn and R. De Mori, "Sentence interpretation," in Spoken Dialogues with Computers. R. De Mori, Ed. London, U.K.: Academic, 1998, pp. 486-522.
    • (1998) Spoken Dialogues with Computers. , pp. 486-522
    • Kuhn, R.1    De Mori, R.2
  • 33
    • 33646923397 scopus 로고
    • Natural communication between person and computer, in
    • W. G. Lehnen and M. H. Ringle, Eds. Hillsdale, NJ: Lawrence Eribaum
    • B. C. Hruce, "Natural communication between person and computer," in Strategies for Natural Language Processing. W. G. Lehnen and M. H. Ringle, Eds. Hillsdale, NJ: Lawrence Eribaum, 1982, pp. 55-88.
    • (1982) Strategies for Natural Language Processing. , pp. 55-88
    • Hruce, B.C.1
  • 34
    • 33646935414 scopus 로고
    • Models of natural language understanding, in
    • D. Roe and J. Wilpon, Eds. Washington, DC: National Academy Press.
    • M. Bates, "Models of natural language understanding," in Voice Communication Between Humans and Machines. D. Roe and J. Wilpon, Eds. Washington, DC: National Academy Press. 1994, pp. 238-253.
    • (1994) Voice Communication between Humans and Machines. , pp. 238-253
    • Bates, M.1
  • 35
    • 0008525457 scopus 로고
    • "The roles of language processing in a spoken language interface, in
    • D. Roe and J. Wilpon, Eds. Washington, DC: National Academy Press
    • L. Hirschman, "The roles of language processing in a spoken language interface," in Voice Communication Between Humans and Machines, D. Roe and J. Wilpon, Eds. Washington, DC: National Academy Press, 1994, pp. 217-237.
    • (1994) Voice Communication between Humans and Machines , pp. 217-237
    • Hirschman, L.1
  • 36
    • 85009081750 scopus 로고
    • Robust parsing for spoken language systems, in
    • S. Scncff, "Robust parsing for spoken language systems," in Proc. ICASSP, 1992, pp. 189-192.
    • (1992) Proc. ICASSP , pp. 189-192
    • Scncff, S.1
  • 37
    • 33646932025 scopus 로고
    • A context-free grammar compiler for speech understanding systems, in
    • M. K. Brown and B. Bunlschuh, "A context-free grammar compiler for speech understanding systems," in Proc. 1CSLP 94, 1994, pp. 21-24.
    • (1994) Proc. 1CSLP 94 , pp. 21-24
    • Brown, M.K.1    Bunlschuh, B.2
  • 39
    • 33646932922 scopus 로고    scopus 로고
    • Spoken language understanding within dialogs using a graphical model of task structure, presented at the
    • J. Wright. A. Gorin, and A. Abella, "Spoken language understanding within dialogs using a graphical model of task structure," presented at the Proc. JCSLP 9X, 1998.
    • (1998) Proc. JCSLP 9X
    • Wright, J.1    Gorin, A.2    Abella, A.3
  • 42
    • 0029002615 scopus 로고
    • On automated language acquisition
    • A. L. Gorin, "On automated language acquisition," J. Acoust. Soc. Amer., vol. 97. pp. 3441-3461, 1995.
    • (1995) J. Acoust. Soc. Amer. , vol.97 , pp. 3441-3461
    • Gorin, A.L.1
  • 43
    • 85135142918 scopus 로고    scopus 로고
    • Dialogue strategies guiding users to their communicative goals, in
    • M. Denecke and A. Waibel, "Dialogue strategies guiding users to their communicative goals," in Proc. Eurosoeech 1997, 1997. pp. 1339-1342.
    • (1997) Proc. Eurosoeech 1997 , pp. 1339-1342
    • Denecke, M.1    Waibel, A.2
  • 44
    • 0033908288 scopus 로고    scopus 로고
    • Stochastic language adaptation over time and state in natural spoken dialogue systems
    • Jan.
    • G. Riccardi and A. L. Gorin, "Stochastic language adaptation over time and state in natural spoken dialogue systems," IEEE Trans. Speech Audio Processing, vol. 8, pp. 3-10, Jan. 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 3-10
    • Riccardi, G.1    Gorin, A.L.2
  • 45
    • 33646916861 scopus 로고    scopus 로고
    • Sentence generation
    • R. De Mori, Ed. London, L.K.: Academic
    • C. Sorin and R. De Mori, "Sentence generation," in Spoken Dialogues with Computers, R. De Mori, Ed. London, L.K.: Academic, 1998, pp. 503-582.
    • (1998) Spoken Dialogues with Computers , pp. 503-582
    • Sorin, C.1    De Mori, R.2
  • 46
    • 33646949943 scopus 로고
    • Practical issues in dialogue design, in
    • M. .VI. Tayior, F. Xeel, and D G. Bouwhuis, Eds. Amsterdam, The Netherlands: Eisevier
    • C. A. McCann, W. Edmoason, and R. K. Moore, "Practical issues in dialogue design," in The Structure of \lidtii:;oij.iil Dialogue, M. .VI. Tayior, F. Xeel, and D G. Bouwhuis, Eds. Amsterdam, The Netherlands: Eisevier, 1989, pp. 467-480.
    • (1989) The Structure of \Lidtii:;oij.iil Dialogue , pp. 467-480
    • McCann, C.A.1    Edmoason, W.2    Moore, R.K.3
  • 49
    • 0029749080 scopus 로고    scopus 로고
    • Field trial evaluations of two different information inquiry systems
    • R. Billi, G. Castagp.eri, and M. Danieli, "Field trial evaluations of two different information inquiry systems." in Proc. I\TTA, 1996, pp. 129-134.
    • (1996) Proc. I\TTA , pp. 129-134
    • Billi, R.1    Castagp.eri, G.2    Danieli, M.3
  • 50
    • 0002885822 scopus 로고    scopus 로고
    • Empirically evaluating an adaptable spoken dialogue system
    • D. J. Liiman and S. Pan, "Empirically evaluating an adaptable spoken dialogue system," presented at the Proc. 7;h In!. Conf. User Modeling, 1999.
    • (1999) Proc. 7;h In!. Conf. User Modeling
    • Liiman, D.J.1    Pan, S.2
  • 52
    • 33646923097 scopus 로고    scopus 로고
    • A peak, a plateau, or a stiff climb'.'
    • Winter
    • M. Walker, "A peak, a plateau, or a stiff climb'.'," ELSNEWS 8.1, pp. 1-3. Winter 1999.
    • (1999) ELSNEWS 8.1 , pp. 1-3
    • Walker, M.1
  • 53
    • 0142018800 scopus 로고    scopus 로고
    • The grounding problem in conversation with and through computers, in
    • S. R. Fussell and R. J. Keutz, Eds. Hillsd'ale, NJ: Lawrence Erlbaum, .
    • S. E. Brennan, "The grounding problem in conversation with and through computers," in Soda! and Cognitive Psychological Approaches to Interpersonal Communication, S. R. Fussell and R. J. Keutz, Eds. Hillsd'ale, NJ: Lawrence Erlbaum, i998, pp. 201-255.
    • Soda! and Cognitive Psychological Approaches to Interpersonal Communication , pp. 201-255
    • Brennan, S.E.1
  • 55
    • 33646920041 scopus 로고    scopus 로고
    • Evaluating spoken language systems, in
    • C. Kamm. M. Walker, and D. Litman, "Evaluating spoken language systems," in Proc. AVIOS 1999, 1999, pp. 187-197.
    • (1999) Proc. AVIOS 1999 , pp. 187-197
    • Kamm, C.1    Walker, M.2    Litman, D.3
  • 56
    • 0030355546 scopus 로고    scopus 로고
    • On-une incremental adaptation for speaker verification using maximum likelihood eslimates of CDHMM parameters, in
    • K. Yu and J. Mason, "On-une incremental adaptation for speaker verification using maximum likelihood eslimates of CDHMM parameters," in Proc. 1CSLP 96, 1996, pp. 1752-1755.
    • (1996) Proc. 1CSLP 96 , pp. 1752-1755
    • Yu, K.1    Mason, J.2
  • 57
    • 84871607712 scopus 로고    scopus 로고
    • Speaker identification with user-selected password phrases, in
    • A. E. Rosenberg and S. Parthasarathy, "Speaker identification with user-selected password phrases," in Proc. Eurospeech'97, 1997, pp. 1371-1374.
    • (1997) Proc. Eurospeech'97 , pp. 1371-1374
    • Rosenberg, A.E.1    Parthasarathy, S.2
  • 58
    • 84871612783 scopus 로고    scopus 로고
    • A comparative study of speaker verification systems using the Polycast database, in
    • T. Nordstrom, H. Melin, and J. Lindbcrg, "A comparative study of speaker verification systems using the Polycast database," in Proc. 1CSLP'98, 1998, pp. 1359-1362.
    • (1998) Proc. 1CSLP'98 , pp. 1359-1362
    • Nordstrom, T.1    Melin, H.2    Lindbcrg, J.3
  • 59
    • 85128364516 scopus 로고    scopus 로고
    • An implementation and evaluation of an on-line speaker verification system for field trials, in
    • Y. Gu and T. Thomas, "An implementation and evaluation of an on-line speaker verification system for field trials," in Proc. ICSLP'98, 1998, pp. 125-128.
    • (1998) Proc. ICSLP' , vol.98 , pp. 125-128
    • Gu, Y.1    Thomas, T.2
  • 61
    • 0032074814 scopus 로고    scopus 로고
    • On the application of multimedia processing to telecommunications
    • May
    • R. V. Cox. B. Haskell, Y. LeCun, B. Shahraray, and L. Rabiner, "On the application of multimedia processing to telecommunications," Proc. IEEE, vol. 86, pp. 755-824, May 1998.
    • (1998) Proc. IEEE , vol.86 , pp. 755-824
    • Cox, R.V.1    Haskell, B.2    Lecun, Y.3    Shahraray, B.4    Rabiner, L.5
  • 62
    • 0032678171 scopus 로고    scopus 로고
    • Automated generation of news content hierarchy by integrating audio, video, and text information, in
    • Q. Huang, Z. Liu, A. Rosenberg, D. Gibbon, and B. Shahraray, "Automated generation of news content hierarchy by integrating audio, video, and text information," in Proc. ICASSP'99, vol. VI, 1999, pp. 3025-3028.
    • (1999) Proc. ICASSP'99, Vol. VI , pp. 3025-3028
    • Huang, Q.1    Liu, Z.2    Rosenberg, A.3    Gibbon, D.4    Shahraray, B.5
  • 64
    • 0025473034 scopus 로고
    • ANShR: An application of speech technology to the Japanese banking industry
    • Aug.
    • R. Nakatsu, "ANShR: An application of speech technology to the Japanese banking industry," Computer, vol. 23, pp. 43-48, Aug. 1990.
    • (1990) Computer , vol.23 , pp. 43-48
    • Nakatsu, R.1
  • 65
    • 33646936550 scopus 로고    scopus 로고
    • private communication.
    • S. Furui, private communication.
    • Furui, S.1
  • 66
    • 0025468474 scopus 로고
    • Putting speech recognition to work in the telephone network
    • Aug.
    • M. Lenniz, "Putting speech recognition to work in the telephone network," Computer, vol. 23. pp. 35-41, Aug. 1990.
    • (1990) Computer , vol.23 , pp. 35-41
    • Lenniz, M.1
  • 67
  • 68
    • 0347735588 scopus 로고
    • ACNA-The Ameritech customer name and address servicer
    • M. Yuschik, E. Schwab, and L. Griffith, "ACNA-The Ameritech customer name and address servicer." J. AVIOS. vol. 15, pp. 21-33, 1994.
    • (1994) J. AVIOS. , vol.15 , pp. 21-33
    • Yuschik, M.1    Schwab, E.2    Griffith, L.3
  • 69
    • 0347105138 scopus 로고
    • Results from automating a name & address service with speech synthesis, in
    • D, Yashchin, S. Basson, A. Kalyanswamy, and K. Silveiman, "Results from automating a name & address service with speech synthesis," in Proc. AVIOS, 1992.
    • (1992) Proc. AVIOS
    • Yashchin, D.1    Basson, S.2    Kalyanswamy, A.3    Silveiman, K.4
  • 70
    • 0032315051 scopus 로고    scopus 로고
    • Evaluation of the dutch train timetable information system developed in the ARISE project, in
    • A. Sanderman, E. den Os, A. Cremers, L. Boves, and J. Sturm, "Evaluation of the dutch train timetable information system developed in the ARISE project," in Proc. IVTfA 98, 1998, pp. 91-96.
    • (1998) Proc. IVTfA 98 , pp. 91-96
    • Sanderman, A.1    Den Os, E.2    Cremers, A.3    Boves, L.4    Sturm, J.5
  • 71
    • 0032309293 scopus 로고    scopus 로고
    • Field trials of the Italian ARISE train timetable system
    • G. Casiagneri. P. Baggia, and M. Danieli, "Field trials of the Italian ARISE train timetable system." in Proc. IVTTA 98. 1998, pp. 97-102.
    • (1998) Proc. IVTTA 98. , pp. 97-102
    • Casiagneri, G.1    Baggia, P.2    Danieli, M.3
  • 74
    • 33646935185 scopus 로고
    • Director' assistance automation in Bell Canada: Trial results, in
    • M. Leimig, G. Bielby, and J. Massicote, "Director)' assistance automation in Bell Canada: Trial results," in Proc. 1VTTA 94, 1994, pp. 8-12.
    • (1994) Proc. 1VTTA 94 , pp. 8-12
    • Leimig, M.1    Bielby, G.2    Massicote, J.3
  • 75
  • 76
    • 0032320862 scopus 로고    scopus 로고
    • Automation of telecora Italia directory assistance service: Field trial results, in
    • R. Billi, F. Canavcsto, and C. Rullent, "Automation of telecora Italia directory assistance service: Field trial results," in Proc. IVTTA 98, 1998, pp. 11-14.
    • (1998) Proc. IVTTA 98 , pp. 11-14
    • Billi, R.1    Canavcsto, F.2    Rullent, C.3
  • 79
    • 33646910761 scopus 로고
    • The vocoder-Electrical re-creation of speech
    • H. Dudley, "The vocoder-Electrical re-creation of speech," J. Soc. Motion Pict. Eng., vol. 34, pp. 272-278, 1940.
    • (1940) J. Soc. Motion Pict. Eng. , vol.34 , pp. 272-278
    • Dudley, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.