-
1
-
-
85009133573
-
Integrating multimodal language processing with speech recognition
-
Bangalore, S. and Johnston, M. 2000. Integrating Multimodal Language Processing with Speech Recognition. In Proceedings of ICSLP.
-
(2000)
Proceedings of ICSLP
-
-
Bangalore, S.1
Johnston, M.2
-
2
-
-
85022919385
-
Class-based n-gram models of natural language
-
Brown, P., Della Pietra, V. J., deSouza, P. V., Lai, J. C, and Mercer, R. L. 1992. Class-based n-gram models of natural language. Computational Linguistics, 18(4):467-479.
-
(1992)
Computational Linguistics
, vol.18
, Issue.4
, pp. 467-479
-
-
Brown, P.1
Della Pietra, V.J.2
DeSouza, P.V.3
Lai, J.C.4
Mercer, R.L.5
-
3
-
-
26944498636
-
Utilizing visual attention for cross-modal coreference interpretation
-
Byron, D., Mampilly, T., Sharma, V., and Xu, T. 2005. Utilizing Visual Attention for Cross-Modal Coreference Interpretation. Spring Lecture Notes in Computer Science: Proceedings of Context-05, page 83-96.
-
(2005)
Spring Lecture Notes in Computer Science: Proceedings of Context-05
, pp. 83-96
-
-
Byron, D.1
Mampilly, T.2
Sharma, V.3
Xu, T.4
-
4
-
-
0003949731
-
-
Cognitive Science Society
-
Cassell, J., Stone, M., Douville, B., Prevost, S., Achorn, B., Steedman, M., Badler, N., and Pelachaud, C. 1994. Modeling the interaction between speech and gesture. Cognitive Science Society.
-
(1994)
Modeling The Interaction between Speech and Gesture
-
-
Cassell, J.1
Stone, M.2
Douville, B.3
Prevost, S.4
Achorn, B.5
Steedman, M.6
Badler, N.7
Pelachaud, C.8
-
5
-
-
33644594894
-
Linguistic theories in efficient multimodal reference resolution: An Empirical Investigation
-
San Diego, CA
-
Chai, J. Y., Prasov, Z., Blaim, J., and Jin, R. 2005. Linguistic Theories in Efficient Multimodal Reference Resolution: an Empirical Investigation. The 10th International Conference on Intelligent User Interfaces (IUI-05), pp. 43-50, San Diego, CA.
-
(2005)
10th International Conference on Intelligent User Interfaces (IUI-05)
, pp. 43-50
-
-
Chai, J.Y.1
Prasov, Z.2
Blaim, J.3
Jin, R.4
-
6
-
-
85149108619
-
Optimization in multimodal interpretation
-
Barcelona, Spain
-
Chai, J. Y., Hong, P., Zhou, M. X, and Prasov, Z. 2004b. Optimization in Multimodal Interpretation. In Proceedings of ACL, pp. 1-8, Barcelona, Spain.
-
(2004)
Proceedings of ACL
, pp. 1-8
-
-
Chai, J.Y.1
Hong, P.2
Zhou X, M.3
Prasov, Z.4
-
7
-
-
18744368762
-
A probabilistic approach to reference resolution in multimodal user interfaces
-
IUI 04: 2004 International Conference on Intelligent User Interfaces
-
Chai, J. Y., Hong, P., and Zhou, M. 2004a. A Probabilistic Approach to Reference Resolution in Multimodal User Interfaces. Proceedings of 9th International Conference on Intelligent User Interfaces (IUI-04), pp. 70-77, Madeira, Portugal. (Pubitemid 40673704)
-
(2004)
International Conference on Intelligent User Interfaces, Proceedings IUI
, pp. 70-77
-
-
Chai, J.Y.1
Hong, P.2
Zhou, M.X.3
-
9
-
-
0031380441
-
Quickset: Multimodal interaction for distributed applications
-
Cohen, P., Johnston, M., McGee, D., Oviatt, S., Pittman, J.; Smith, I., Chen, L., and Clow, J. 1996. Quickset: Multimodal Interaction for Distributed Applications. Proceedings of ACM Multimedia, 31-40.
-
(1996)
Proceedings of ACM Multimedia
, pp. 31-40
-
-
Cohen, P.1
Johnston, M.2
McGee, D.3
Oviatt, S.4
Pittman, J.5
Smith, I.6
Chen, L.7
Clow, J.8
-
12
-
-
0034776038
-
Gaze durations during speech reflect word selection and phonological encoding
-
DOI 10.1016/S0010-0277(01)00138-X, PII S001002770100138X
-
Griffin, Z. M. 2001. Gaze durations during speech reflect word selection and phonological encoding. Cognition 82, B1-B14. (Pubitemid 32979187)
-
(2001)
Cognition
, vol.82
, Issue.1
-
-
Griffin, Z.M.1
-
13
-
-
79956213344
-
Centering: A framework for modeling the local coherence of discourse
-
Grosz, B. J., Joshi, A. K., and Weinstein, S. 1995. Centering: A framework for modeling the local coherence of discourse. Computational Linguistics, 21(2).
-
(1995)
Computational Linguistics
, vol.21
, Issue.2
-
-
Grosz, B.J.1
Joshi, A.K.2
Weinstein, S.3
-
14
-
-
0000534475
-
-
Logic and Conversation. In Cole, P. and Morgan, J. eds. New York, New York: Academic Press
-
Grice, H. P. Logic and Conversation. 1975. In Cole, P., and Morgan, J., eds. Speech Acts. New York, New York: Academic Press. 41-58.
-
(1975)
Speech Acts
, pp. 41-58
-
-
Grice, H.P.1
-
15
-
-
85015757336
-
Cognitive status and the form of referring expressions in discourse
-
Gundel, J. K., Hedberg, N., and Zacharski, R. 1993. Cognitive Status and the Form of Referring Expressions in Discourse. Language 69(2):274-307.
-
(1993)
Language
, vol.69
, Issue.2
, pp. 274-307
-
-
Gundel, J.K.1
Hedberg, N.2
Zacharski, R.3
-
16
-
-
84871612068
-
The effectiveness of corpus-induced dependency grammars for post-processing speech
-
Harper, M.., White, C., Wang, W., Johnson, M., and Helzerman, R. 2000. The Effectiveness of Corpus-Induced Dependency Grammars for Post-processing Speech. Proceedings of the North American Association for Computational Linguistics, 102-109.
-
(2000)
Proceedings of the North American Association for Computational Linguistics
, pp. 102-109
-
-
Harper, M.1
White, C.2
Wang, W.3
Johnson, M.4
Helzerman, R.5
-
18
-
-
84937291102
-
Automatic referent resolution of deictic and anaphoric expressions
-
Huls, C., Bos, E., and Classen, W. 1995. Automatic Referent Resolution of Deictic and Anaphoric Expressions. Computational Linguistics, 21(1):59-79.
-
(1995)
Computational Linguistics
, vol.21
, Issue.1
, pp. 59-79
-
-
Huls, C.1
Bos, E.2
Classen, W.3
-
19
-
-
0001882615
-
Self-organized language modeling for speech recognition
-
Waibel, A. and Lee, K. F. (Eds)
-
Jelinek, F. 1990. Self-organized language modeling for speech recognition. In Waibel, A. and Lee, K. F. (Eds). Readings in Speech Recognition, pp. 450-506.
-
(1990)
Readings in Speech Recognition
, pp. 450-506
-
-
Jelinek, F.1
-
21
-
-
85149125977
-
MATCH: An architecture for multimodal dialog systems
-
Philadelphia
-
Johnston, M., Bangalore, S., Visireddy G., Stent, A., Ehlen, P., Walker, M., Whittaker, S., and Maloor, P. 2002. MATCH: An Architecture for Multimodal Dialog Systems, in Proceedings of the 40th ACL, Philadelphia, pp. 376-383.
-
(2002)
Proceedings of the 40th ACL
, pp. 376-383
-
-
Johnston, M.1
Bangalore, S.2
Visireddy, G.3
Stent, A.4
Ehlen, P.5
Walker, M.6
Whittaker, S.7
Maloor, P.8
-
22
-
-
0023312404
-
Estimation of probabilities from sparse data for the language model component of a speech recognizer
-
Katz, S. M. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech, and Signal Processing, 35(3).
-
(1987)
IEEE Transactions on Acoustics, Speech, and Signal Processing
, vol.35
, Issue.3
-
-
Katz, S.M.1
-
23
-
-
85158093287
-
Cognitive status and form of reference in multimodal human-computer interaction
-
Kehler, A. 2000. Cognitive Status and Form of Reference in Multimodal Human-Computer Interaction, Proceedings of AAAI'01.
-
(2000)
Proceedings of AAAI'01
-
-
Kehler, A.1
-
24
-
-
85123963268
-
Improved clustering techniques for class-based statistical language modeling
-
Kneser, R. and Ney, H. 1993. Improved clustering techniques for class-based statistical language modeling. In Eurospeech'93, pp. 973-976.
-
(1993)
Eurospeech'93
, pp. 973-976
-
-
Kneser, R.1
Ney, H.2
-
25
-
-
3843116149
-
Visual salience and perceptual grouping in multimodal interactivity
-
Verona, Italy
-
Landragin, F., Bellalem, N., and Romary, L. 2001. Visual Salience and Perceptual Grouping in Multimodal Interactivity. In: First International Workshop on Information Presentation and Natural Multimodal Dialogue, Verona, Italy, pp. 151-155.
-
(2001)
First International Workshop on Information Presentation and Natural Multimodal Dialogue
, pp. 151-155
-
-
Landragin, F.1
Bellalem, N.2
Romary, L.3
-
26
-
-
30844441070
-
An algorithm for pronominal anaphora resolution
-
Lappin, S., and Leass, H. 1994. An algorithm for pronominal anaphora resolution. Computational Linguistics, 20(4).
-
(1994)
Computational Linguistics
, vol.20
, Issue.4
-
-
Lappin, S.1
Leass, H.2
-
27
-
-
0032684957
-
Mutual disambiguation of recognition errors in a multimodal architecture
-
Oviatt, S. 1999. Mutual Disambiguation of Recognition Errors in a Multimodal Architecture. In Proceedings of CHI.
-
(1999)
Proceedings of CHI
-
-
Oviatt, S.1
-
28
-
-
4243107294
-
Multimodal conversational systems for automobiles
-
Pieraccini, R., Dayandhi, K., Bloom, J., Dahan, J.-G., Phillips, M., Goodman, B. R., Prasad, K. V., 2004. Multimodal Conversational Systems for Automobiles, Communications of the ACM, Vol. 47, No. 1, pp. 47-49.
-
(2004)
Communications of the ACM
, vol.47
, Issue.1
, pp. 47-49
-
-
Pieraccini, R.1
Dayandhi, K.2
Bloom, J.3
Dahan, J.-G.4
Phillips, M.5
Goodman, B.R.6
Prasad, K.V.7
-
29
-
-
10844243736
-
Towards situated speech understanding: Visual context priming of language models
-
Roy, D. and Mukherjee, N. 2005. Towards Situated Speech Understanding: Visual Context Priming of Language Models. Computer Speech and Language, 19(2): 227-248.
-
(2005)
Computer Speech and Language
, vol.19
, Issue.2
, pp. 227-248
-
-
Roy, D.1
Mukherjee, N.2
-
30
-
-
80053228383
-
A class-based language model for large vocabulary speech recognition extracted from part-of-speech statistics
-
Samuelsson, C. and Reichl, W. 1999. A class-based Language Model for Large Vocabulary Speech Recognition Extracted from Part-of-Speech Statistics. In IEEE ICASSP'99.
-
(1999)
IEEE ICASSP'99
-
-
Samuelsson, C.1
Reichl, W.2
-
31
-
-
27344451549
-
Using model-theoretic semantic interpretation to guide statistical parsing and word recognition in a spoken language interface
-
Sapporo, Japan
-
Schuler, W. 2003. Using model-theoretic semantic interpretation to guide statistical parsing and word recognition in a spoken language interface. In Proceedings of ACL, Sapporo, Japan.
-
(2003)
Proceedings of ACL
-
-
Schuler, W.1
-
33
-
-
4544358964
-
The superARV language model: In Investigating the effectiveness of tightly integrating multiple knowledge sources
-
Wang, W. and Harper, M. 2002. The superARV language model: In Investigating the effectiveness of tightly integrating multiple knowledge sources. In Proceedings of EMNLP, 238-247.
-
(2002)
Proceedings of EMNLP
, pp. 238-247
-
-
Wang, W.1
Harper, M.2
-
34
-
-
0026372948
-
Integration of speech recognition and natural language processing in the MIT voyager system
-
Zue, V., Glass, J., Goodine, D., Leung, H., Phillips, M., Polifroni, J., and Seneff, S. 1991. Integration of Speech Recognition and Natural Language Processing in the MIT Voyager System. Proceedings of the ICASSP.
-
(1991)
Proceedings of the ICASSP
-
-
Zue, V.1
Glass, J.2
Goodine, D.3
Leung, H.4
Phillips, M.5
Polifroni, J.6
Seneff, S.7
|