SCOPUS 정보 검색 플랫폼

HLT/EMNLP 2005 - Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

Volumn , Issue , 2005, Pages 217-224

A salience driven approach to robust input interpretation in multimodal conversational systems

(2) Chai, Joyce Y a Qu, Shaolin a

a Michigan State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONCEPT IDENTIFICATION; CONVERSATIONAL SYSTEMS; GRAPHICAL DISPLAYS; MULTIMODAL CONVERSATION; MULTIMODAL INPUTS; PHYSICAL WORLD; SPOKEN LANGUAGE UNDERSTANDING; WORD ERROR RATE;

NATURAL LANGUAGE PROCESSING SYSTEMS;

EID: 80053238919 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.3115/1220575.1220603 Document Type: Conference Paper

Times cited : (10)

References (34)

1
- 85009133573
- Integrating multimodal language processing with speech recognition
- Bangalore, S. and Johnston, M. 2000. Integrating Multimodal Language Processing with Speech Recognition. In Proceedings of ICSLP.
- (2000) Proceedings of ICSLP
- Bangalore, S.¹ Johnston, M.²

2
- 85022919385
- Class-based n-gram models of natural language
- Brown, P., Della Pietra, V. J., deSouza, P. V., Lai, J. C, and Mercer, R. L. 1992. Class-based n-gram models of natural language. Computational Linguistics, 18(4):467-479.
- (1992) Computational Linguistics , vol.18 , Issue.4 , pp. 467-479
- Brown, P.¹ Della Pietra, V.J.² DeSouza, P.V.³ Lai, J.C.⁴ Mercer, R.L.⁵

3
- 26944498636
- Utilizing visual attention for cross-modal coreference interpretation
- Byron, D., Mampilly, T., Sharma, V., and Xu, T. 2005. Utilizing Visual Attention for Cross-Modal Coreference Interpretation. Spring Lecture Notes in Computer Science: Proceedings of Context-05, page 83-96.
- (2005) Spring Lecture Notes in Computer Science: Proceedings of Context-05 , pp. 83-96
- Byron, D.¹ Mampilly, T.² Sharma, V.³ Xu, T.⁴

4
- 0003949731
- Cognitive Science Society
- Cassell, J., Stone, M., Douville, B., Prevost, S., Achorn, B., Steedman, M., Badler, N., and Pelachaud, C. 1994. Modeling the interaction between speech and gesture. Cognitive Science Society.
- (1994) Modeling The Interaction between Speech and Gesture
- Cassell, J.¹ Stone, M.² Douville, B.³ Prevost, S.⁴ Achorn, B.⁵ Steedman, M.⁶ Badler, N.⁷ Pelachaud, C.⁸

5
- 33644594894
- Linguistic theories in efficient multimodal reference resolution: An Empirical Investigation
- San Diego, CA
- Chai, J. Y., Prasov, Z., Blaim, J., and Jin, R. 2005. Linguistic Theories in Efficient Multimodal Reference Resolution: an Empirical Investigation. The 10th International Conference on Intelligent User Interfaces (IUI-05), pp. 43-50, San Diego, CA.
- (2005) 10th International Conference on Intelligent User Interfaces (IUI-05) , pp. 43-50
- Chai, J.Y.¹ Prasov, Z.² Blaim, J.³ Jin, R.⁴

6
- 85149108619
- Optimization in multimodal interpretation
- Barcelona, Spain
- Chai, J. Y., Hong, P., Zhou, M. X, and Prasov, Z. 2004b. Optimization in Multimodal Interpretation. In Proceedings of ACL, pp. 1-8, Barcelona, Spain.
- (2004) Proceedings of ACL , pp. 1-8
- Chai, J.Y.¹ Hong, P.² Zhou X, M.³ Prasov, Z.⁴

7
- 18744368762
- A probabilistic approach to reference resolution in multimodal user interfaces
- IUI 04: 2004 International Conference on Intelligent User Interfaces
- Chai, J. Y., Hong, P., and Zhou, M. 2004a. A Probabilistic Approach to Reference Resolution in Multimodal User Interfaces. Proceedings of 9th International Conference on Intelligent User Interfaces (IUI-04), pp. 70-77, Madeira, Portugal. (Pubitemid 40673704)
- (2004) International Conference on Intelligent User Interfaces, Proceedings IUI , pp. 70-77
- Chai, J.Y.¹ Hong, P.² Zhou, M.X.³

8
- 0034295822
- Structured language modeling
- Chelba, C. and Jelinek, F. 2000. Structured language modeling. Computer Speech and Language, 14(4):283-332.
- (2000) Computer Speech and Language , vol.14 , Issue.4 , pp. 283-332
- Chelba, C.¹ Jelinek, F.²

9
- 0031380441
- Quickset: Multimodal interaction for distributed applications
- Cohen, P., Johnston, M., McGee, D., Oviatt, S., Pittman, J.; Smith, I., Chen, L., and Clow, J. 1996. Quickset: Multimodal Interaction for Distributed Applications. Proceedings of ACM Multimedia, 31-40.
- (1996) Proceedings of ACM Multimedia , pp. 31-40
- Cohen, P.¹ Johnston, M.² McGee, D.³ Oviatt, S.⁴ Pittman, J.⁵ Smith, I.⁶ Chen, L.⁷ Clow, J.⁸

10
- 85117714184
- A salience-based approach to gesture-speech alignment
- Eisenstein J. and Christoudias. C. 2004. A salience-based approach to gesture-speech alignment. In Proceedings of HLT/NAACL'04.
- (2004) Proceedings of HLT/NAACL'04
- Eisenstein, J.¹ Christoudias, C.²

11
- 85121030057
- Topic-based language models using EM
- Gildea, D. and Hofmann, T. 1999. Topic-based language models using EM. In Proceedings of Eurospeech.
- (1999) Proceedings of Eurospeech
- Gildea, D.¹ Hofmann, T.²

12
- 0034776038
- Gaze durations during speech reflect word selection and phonological encoding
- DOI 10.1016/S0010-0277(01)00138-X, PII S001002770100138X
- Griffin, Z. M. 2001. Gaze durations during speech reflect word selection and phonological encoding. Cognition 82, B1-B14. (Pubitemid 32979187)
- (2001) Cognition , vol.82 , Issue.1
- Griffin, Z.M.¹

13
- 79956213344
- Centering: A framework for modeling the local coherence of discourse
- Grosz, B. J., Joshi, A. K., and Weinstein, S. 1995. Centering: A framework for modeling the local coherence of discourse. Computational Linguistics, 21(2).
- (1995) Computational Linguistics , vol.21 , Issue.2
- Grosz, B.J.¹ Joshi, A.K.² Weinstein, S.³

14
- 0000534475
- Logic and Conversation. In Cole, P. and Morgan, J. eds. New York, New York: Academic Press
- Grice, H. P. Logic and Conversation. 1975. In Cole, P., and Morgan, J., eds. Speech Acts. New York, New York: Academic Press. 41-58.
- (1975) Speech Acts , pp. 41-58
- Grice, H.P.¹

15
- 85015757336
- Cognitive status and the form of referring expressions in discourse
- Gundel, J. K., Hedberg, N., and Zacharski, R. 1993. Cognitive Status and the Form of Referring Expressions in Discourse. Language 69(2):274-307.
- (1993) Language , vol.69 , Issue.2 , pp. 274-307
- Gundel, J.K.¹ Hedberg, N.² Zacharski, R.³

16
- 84871612068
- The effectiveness of corpus-induced dependency grammars for post-processing speech
- Harper, M.., White, C., Wang, W., Johnson, M., and Helzerman, R. 2000. The Effectiveness of Corpus-Induced Dependency Grammars for Post-processing Speech. Proceedings of the North American Association for Computational Linguistics, 102-109.
- (2000) Proceedings of the North American Association for Computational Linguistics , pp. 102-109
- Harper, M.¹ White, C.² Wang, W.³ Johnson, M.⁴ Helzerman, R.⁵

17
- 0039623602
- POS tags and decision trees for language modeling
- Heeman. P. 1999. POS tags and decision trees for language modeling. In Proceedings of the Conference on Empirical Methods in Natural Language Process (EMNLP).
- (1999) Proceedings of the Conference on Empirical Methods in Natural Language Process (EMNLP)
- Heeman, P.¹

18
- 84937291102
- Automatic referent resolution of deictic and anaphoric expressions
- Huls, C., Bos, E., and Classen, W. 1995. Automatic Referent Resolution of Deictic and Anaphoric Expressions. Computational Linguistics, 21(1):59-79.
- (1995) Computational Linguistics , vol.21 , Issue.1 , pp. 59-79
- Huls, C.¹ Bos, E.² Classen, W.³

19
- 0001882615
- Self-organized language modeling for speech recognition
- Waibel, A. and Lee, K. F. (Eds)
- Jelinek, F. 1990. Self-organized language modeling for speech recognition. In Waibel, A. and Lee, K. F. (Eds). Readings in Speech Recognition, pp. 450-506.
- (1990) Readings in Speech Recognition , pp. 450-506
- Jelinek, F.¹

20
- 85149128609
- Unification-based multimodal parsing
- Johnston, M. 1998. Unification-based Multimodal parsing, Proceedings of COLING-ACL.
- (1998) Proceedings of COLING-ACL
- Johnston, M.¹

21
- 85149125977
- MATCH: An architecture for multimodal dialog systems
- Philadelphia
- Johnston, M., Bangalore, S., Visireddy G., Stent, A., Ehlen, P., Walker, M., Whittaker, S., and Maloor, P. 2002. MATCH: An Architecture for Multimodal Dialog Systems, in Proceedings of the 40th ACL, Philadelphia, pp. 376-383.
- (2002) Proceedings of the 40th ACL , pp. 376-383
- Johnston, M.¹ Bangalore, S.² Visireddy, G.³ Stent, A.⁴ Ehlen, P.⁵ Walker, M.⁶ Whittaker, S.⁷ Maloor, P.⁸

22
- 0023312404
- Estimation of probabilities from sparse data for the language model component of a speech recognizer
- Katz, S. M. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech, and Signal Processing, 35(3).
- (1987) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.35 , Issue.3
- Katz, S.M.¹

23
- 85158093287
- Cognitive status and form of reference in multimodal human-computer interaction
- Kehler, A. 2000. Cognitive Status and Form of Reference in Multimodal Human-Computer Interaction, Proceedings of AAAI'01.
- (2000) Proceedings of AAAI'01
- Kehler, A.¹

24
- 85123963268
- Improved clustering techniques for class-based statistical language modeling
- Kneser, R. and Ney, H. 1993. Improved clustering techniques for class-based statistical language modeling. In Eurospeech'93, pp. 973-976.
- (1993) Eurospeech'93 , pp. 973-976
- Kneser, R.¹ Ney, H.²

25
- 3843116149
- Visual salience and perceptual grouping in multimodal interactivity
- Verona, Italy
- Landragin, F., Bellalem, N., and Romary, L. 2001. Visual Salience and Perceptual Grouping in Multimodal Interactivity. In: First International Workshop on Information Presentation and Natural Multimodal Dialogue, Verona, Italy, pp. 151-155.
- (2001) First International Workshop on Information Presentation and Natural Multimodal Dialogue , pp. 151-155
- Landragin, F.¹ Bellalem, N.² Romary, L.³

26
- 30844441070
- An algorithm for pronominal anaphora resolution
- Lappin, S., and Leass, H. 1994. An algorithm for pronominal anaphora resolution. Computational Linguistics, 20(4).
- (1994) Computational Linguistics , vol.20 , Issue.4
- Lappin, S.¹ Leass, H.²

27
- 0032684957
- Mutual disambiguation of recognition errors in a multimodal architecture
- Oviatt, S. 1999. Mutual Disambiguation of Recognition Errors in a Multimodal Architecture. In Proceedings of CHI.
- (1999) Proceedings of CHI
- Oviatt, S.¹

28
- 4243107294
- Multimodal conversational systems for automobiles
- Pieraccini, R., Dayandhi, K., Bloom, J., Dahan, J.-G., Phillips, M., Goodman, B. R., Prasad, K. V., 2004. Multimodal Conversational Systems for Automobiles, Communications of the ACM, Vol. 47, No. 1, pp. 47-49.
- (2004) Communications of the ACM , vol.47 , Issue.1 , pp. 47-49
- Pieraccini, R.¹ Dayandhi, K.² Bloom, J.³ Dahan, J.-G.⁴ Phillips, M.⁵ Goodman, B.R.⁶ Prasad, K.V.⁷

29
- 10844243736
- Towards situated speech understanding: Visual context priming of language models
- Roy, D. and Mukherjee, N. 2005. Towards Situated Speech Understanding: Visual Context Priming of Language Models. Computer Speech and Language, 19(2): 227-248.
- (2005) Computer Speech and Language , vol.19 , Issue.2 , pp. 227-248
- Roy, D.¹ Mukherjee, N.²

30
- 80053228383
- A class-based language model for large vocabulary speech recognition extracted from part-of-speech statistics
- Samuelsson, C. and Reichl, W. 1999. A class-based Language Model for Large Vocabulary Speech Recognition Extracted from Part-of-Speech Statistics. In IEEE ICASSP'99.
- (1999) IEEE ICASSP'99
- Samuelsson, C.¹ Reichl, W.²

31
- 27344451549
- Using model-theoretic semantic interpretation to guide statistical parsing and word recognition in a spoken language interface
- Sapporo, Japan
- Schuler, W. 2003. Using model-theoretic semantic interpretation to guide statistical parsing and word recognition in a spoken language interface. In Proceedings of ACL, Sapporo, Japan.
- (2003) Proceedings of ACL
- Schuler, W.¹

32
- 0034841734
- A dynamic semantic model for rescoring recognition hypothesis
- Wai, C., Pierraccinni, R., and Meng, H. 2001. A Dynamic Semantic Model for Rescoring Recognition Hypothesis. Proceedings of the ICASSP.
- (2001) Proceedings of the ICASSP
- Wai, C.¹ Pierraccinni, R.² Meng, H.³

33
- 4544358964
- The superARV language model: In Investigating the effectiveness of tightly integrating multiple knowledge sources
- Wang, W. and Harper, M. 2002. The superARV language model: In Investigating the effectiveness of tightly integrating multiple knowledge sources. In Proceedings of EMNLP, 238-247.
- (2002) Proceedings of EMNLP , pp. 238-247
- Wang, W.¹ Harper, M.²

34
- 0026372948
- Integration of speech recognition and natural language processing in the MIT voyager system
- Zue, V., Glass, J., Goodine, D., Leung, H., Phillips, M., Polifroni, J., and Seneff, S. 1991. Integration of Speech Recognition and Natural Language Processing in the MIT Voyager System. Proceedings of the ICASSP.
- (1991) Proceedings of the ICASSP
- Zue, V.¹ Glass, J.² Goodine, D.³ Leung, H.⁴ Phillips, M.⁵ Polifroni, J.⁶ Seneff, S.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.