-
1
-
-
38249011582
-
Feedback strategies for error correction in speech recognition systems
-
June
-
W. A. Ainsworth and S. R. Pratt. Feedback strategies for error correction in speech recognition systems. International Journal of Man-Machine Studies, 36(6):833-842, June 1992.
-
(1992)
International Journal of Man-Machine Studies
, vol.36
, Issue.6
, pp. 833-842
-
-
Ainsworth, W.A.1
Pratt, S.R.2
-
2
-
-
0035148809
-
Transcriber: Development and use of a tool for assisting speech corpora production
-
C. Barras, E. Geoffrois, Z. Wu, and M. Liberman. Transcriber: development and use of a tool for assisting speech corpora production. Speech Communication, 33(1-2):5-22, 2001.
-
(2001)
Speech Communication
, vol.33
, Issue.1-2
, pp. 5-22
-
-
Barras, C.1
Geoffrois, E.2
Wu, Z.3
Liberman, M.4
-
3
-
-
70350697914
-
-
PhD thesis, Trinity College, Dept of Computer Science
-
M.-M. Bouamrane. Interaction-based Information Retrieval in Multimodal, Online, Artefact-Focused Meeting Recordings. PhD thesis, Trinity College, Dept of Computer Science, 2007.
-
(2007)
Interaction-based Information Retrieval in Multimodal, Online, Artefact-focused Meeting Recordings
-
-
Bouamrane, M.-M.1
-
5
-
-
33746221185
-
History based visual mining of semi-structured audio and text
-
pages 360-363, Beijing, China, Jan. IEEE Press
-
M.-M. Bouamrane, S. Luz, and M. Masoodian. History based visual mining of semi-structured audio and text. In Proceedings of the 12th International Multi-media Modelling Conference, MMM06, pages 360-363, Beijing, China, Jan. 2006. IEEE Press.
-
(2006)
Proceedings of the 12th International Multi-media Modelling Conference, MMM06
-
-
Bouamrane, M.-M.1
Luz, S.2
Masoodian, M.3
-
6
-
-
1542468335
-
Speech and language processing for multimodal human-computer interaction
-
L. Deng, Y. Wang, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, M. Mahajan, and X. D. Huang. Speech and language processing for multimodal human-computer interaction. Journal VLSI Signal Processing Systems, 36(2/3):161-187, 2004.
-
(2004)
Journal VLSI Signal Processing Systems
, vol.36
, Issue.2-3
, pp. 161-187
-
-
Deng, L.1
Wang, Y.2
Wang, K.3
Acero, A.4
Hon, H.5
Droppo, J.6
Boulis, C.7
Mahajan, M.8
Huang, X.D.9
-
7
-
-
0032646977
-
An overview of audio information retrieval
-
J. Foote. An overview of audio information retrieval. Multimedia Systems, 7(1):2-10, 1999.
-
(1999)
Multimedia Systems
, vol.7
, Issue.1
, pp. 2-10
-
-
Foote, J.1
-
8
-
-
33846950613
-
Accessing the spoken word
-
J. Goldman, S. Renals, S. Bird, F. de Jong, M. Federico, C. Fleischhauer, M. Kornbluh, L. Lamel, D. Oard, C. Stewart, and R. Wright. Accessing the spoken word. International Journal of Digital Libraries, 5(4):287-298, 2005.
-
(2005)
International Journal of Digital Libraries
, vol.5
, Issue.4
, pp. 287-298
-
-
Goldman, J.1
Renals, S.2
Bird, S.3
De Jong, F.4
Federico, M.5
Fleischhauer, C.6
Kornbluh, M.7
Lamel, L.8
Oard, D.9
Stewart, C.10
Wright, R.11
-
9
-
-
0001292643
-
The beauty of errors: Patterns of error correction in desktop speech systems
-
pages 133-140
-
Halverson, C. A., Horn, D. B., Karat, C.-M., and J. Karat. The beauty of errors: Patterns of error correction in desktop speech systems. In Proceedings of INTERACT'99: Human-Computer Interaction, pages 133-140, 1999.
-
(1999)
Proceedings of INTERACT'99: Human-computer Interaction
-
-
Halverson, C.A.1
Horn, D.B.2
Karat, C.-M.3
Karat, J.4
-
10
-
-
34547521678
-
Automatic alignment and error correction of human generated transcripts for long speech recordings
-
pages 1606-1609, Pittsburgh, Pennsylvania
-
T. Hazen. Automatic alignment and error correction of human generated transcripts for long speech recordings. In Procedings of Inter speech'06, pages 1606-1609, Pittsburgh, Pennsylvania, 2006.
-
(2006)
Procedings of Inter speech'06
-
-
Hazen, T.1
-
13
-
-
70350682102
-
Improving automatic speech transcription for multimedia content
-
P. Isaias and M. B. Nunes, editors, pages 145-152, Vila Real
-
M. Masoodian, B. Rogers, and S. Luz. Improving automatic speech transcription for multimedia content. In P. Isaias and M. B. Nunes, editors, Proceedings of WWW/Internet '07, pages 145-152, Vila Real, 2007.
-
(2007)
Proceedings of WWW/Internet '07
-
-
Masoodian, M.1
Rogers, B.2
Luz, S.3
-
14
-
-
34047264991
-
TRAED: Speech audio editing using imperfect transcripts
-
pages 454-259,Beijing, China. IEEE Computer Society
-
M. Masoodian, B. Rogers, D. Ware, and S. McKoy. TRAED: Speech audio editing using imperfect transcripts. In 12th International Conference on Multi-Media Modeling (MMM 2006), pages 454-259,Beijing, China, 2006. IEEE Computer Society.
-
(2006)
12th International Conference on Multi-Media Modeling (MMM 2006)
-
-
Masoodian, M.1
Rogers, B.2
Ware, D.3
McKoy, S.4
-
15
-
-
70349123611
-
Towards an efficient archive of spontaneous speech: Design of computer-assisted speech transcription system
-
H. Nanjo and T. Kawahara. Towards an efficient archive of spontaneous speech: Design of computer-assisted speech transcription system. The Journal of the Acoustical Society of America, 120:3042, 2006.
-
(2006)
The Journal of the Acoustical Society of America
, vol.120
, pp. 3042
-
-
Nanjo, H.1
Kawahara, T.2
-
18
-
-
33748596822
-
Automatic speech recognition for generalised time based media retrieval and indexing
-
pages 241-246, New York, NY, US. ACM Press
-
J. Robertson, W. Y. Wong, C. Chung, and D. K. Kim. Automatic speech recognition for generalised time based media retrieval and indexing. In Proceedings of the sixth ACM international conference on Multimedia, MULTIMEDIA '98, pages 241-246, New York, NY, US, 1998. ACM Press.
-
(1998)
Proceedings of the Sixth ACM International Conference on Multimedia, MULTIMEDIA '98
-
-
Robertson, J.1
Wong, W.Y.2
Chung, C.3
Kim, D.K.4
-
19
-
-
0010250404
-
Productivity satisfaction, and interaction strategies of individuals with spinal cord injuries and traditional users interacting with speech recognition software
-
A. Sears, C. Karat, K. Oseitutu, A. Karimullah, and J. Feng. Productivity, satisfaction, and interaction strategies of individuals with spinal cord injuries and traditional users interacting with speech recognition software. Universal Access in the Information Society, 1(1):4-15, 2001.
-
(2001)
Universal Access in the Information Society
, vol.1
, Issue.1
, pp. 4-15
-
-
Sears, A.1
Karat, C.2
Oseitutu, K.3
Karimullah, A.4
Feng, J.5
-
20
-
-
85009262210
-
Multimodal error correction for speech user interfaces
-
B. Suhm, B. Myers, and A. Waibel. Multimodal error correction for speech user interfaces. ACM Trans. Comput.-Hum. Interact, 8(1):60-98, 2001.
-
(2001)
ACM Trans. Comput.-hum. Interact
, vol.8
, Issue.1
, pp. 60-98
-
-
Suhm, B.1
Myers, B.2
Waibel, A.3
-
22
-
-
0034842455
-
Advances in automatic meeting record creation and access
-
pages 597-600. IEEE Press
-
A. Waibel, M. Brett, F. Metze, K. Ries, T. Schaaf, T. Schultz, H. Soltau, H. Yu, and K. Zechner. Advances in automatic meeting record creation and access. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 1, pages 597-600. IEEE Press, 2001.
-
(2001)
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
, vol.1
-
-
Waibel, A.1
Brett, M.2
Metze, F.3
Ries, K.4
Schaaf, T.5
Schultz, T.6
Soltau, H.7
Yu, H.8
Zechner, K.9
-
24
-
-
24144440949
-
Browsing recorded meetings with Ferret
-
S. Bengio and H. Bourlard, editors, pages 12-21, Martigny, Switzerland, June. Springer-Verlag GmbH
-
P. Wellner, M. Flynn, and M. Guillemot. Browsing recorded meetings with Ferret. In S. Bengio and H. Bourlard, editors, Proceedings of Machine Learning for Multimodal Interaction: First International Workshop, MLMI 2004, volume 3361, pages 12-21, Martigny, Switzerland, June 2004. Springer-Verlag GmbH.
-
(2004)
Proceedings of Machine Learning for Multimodal Interaction: First International Workshop, MLMI 2004
, vol.3361
-
-
Wellner, P.1
Flynn, M.2
Guillemot, M.3
-
25
-
-
0037480836
-
Scanmail: A voicemail interface that makes speech browsable, readable and searchable
-
pages 275-282, New York, NY, US. ACM Press
-
S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isenhour, L. Stead, G. Zamchick, and A. Rosenberg. Scanmail: a voicemail interface that makes speech browsable, readable and searchable. In Proceedings of the SIGCHI conference on Human factors in computing systems,CHI '02, pages 275-282, New York, NY, US, 2002. ACM Press.
-
(2002)
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems,CHI '02
-
-
Whittaker, S.1
Hirschberg, J.2
Amento, B.3
Stark, L.4
Bacchiani, M.5
Isenhour, P.6
Stead, L.7
Zamchick, G.8
Rosenberg, A.9
|