SCOPUS 정보 검색 플랫폼

ACM International Conference Proceeding Series

Volumn , Issue , 2008, Pages 9-16

Interactive visualisation techniques for dynamic speech transcription, correction and training

(3) Luz, Saturnino a Masoodian, Masood b Rogers, Bill b

a TRINITY COLLEGE DUBLIN (Ireland)

b UNIVERSITY OF WAIKATO (New Zealand)

Author keywords

Automatic Speech Transcription; Error correction; Semi automatic Speech Transcription; Speech Recogniser Training

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; AUTOMATIC SPEECH TRANSCRIPTION; CONTEXTUAL INFORMATION; EDITING SYSTEMS; IN-CORE; NEW MECHANISMS; PERFORMANCE GAIN; SEMI-AUTOMATIC SPEECH TRANSCRIPTION; SPEECH RECOGNITION TECHNOLOGY; SPEECH TRANSCRIPTIONS; SPONTANEOUS SPEECH; USER FEEDBACK; USER INTERACTION; USER INTERFACE DESIGNS; VISUALISATION; WORD ERROR RATE;

ERROR CORRECTION; HUMAN COMPUTER INTERACTION; INFORMATION USE; KNOWLEDGE MANAGEMENT; SPEECH TRANSMISSION; TRANSCRIPTION; USER INTERFACES; VISUALIZATION;

SPEECH RECOGNITION;

EID: 70349098534 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1496976.1496978 Document Type: Conference Paper

Times cited : (4)

References (25)

1
- 38249011582
- Feedback strategies for error correction in speech recognition systems
- June
- W. A. Ainsworth and S. R. Pratt. Feedback strategies for error correction in speech recognition systems. International Journal of Man-Machine Studies, 36(6):833-842, June 1992.
- (1992) International Journal of Man-Machine Studies , vol.36 , Issue.6 , pp. 833-842
- Ainsworth, W.A.¹ Pratt, S.R.²

2
- 0035148809
- Transcriber: Development and use of a tool for assisting speech corpora production
- C. Barras, E. Geoffrois, Z. Wu, and M. Liberman. Transcriber: development and use of a tool for assisting speech corpora production. Speech Communication, 33(1-2):5-22, 2001.
- (2001) Speech Communication , vol.33 , Issue.1-2 , pp. 5-22
- Barras, C.¹ Geoffrois, E.² Wu, Z.³ Liberman, M.⁴

3
- 70350697914
- PhD thesis, Trinity College, Dept of Computer Science
- M.-M. Bouamrane. Interaction-based Information Retrieval in Multimodal, Online, Artefact-Focused Meeting Recordings. PhD thesis, Trinity College, Dept of Computer Science, 2007.
- (2007) Interaction-based Information Retrieval in Multimodal, Online, Artefact-focused Meeting Recordings
- Bouamrane, M.-M.¹

4
- 33846977063
- Meeting browsing
- M.-M. Bouamrane and S. Luz. Meeting browsing. Multimedia Systems, 12(4-5):439-457, 2007.
- (2007) Multimedia Systems , vol.12 , Issue.4-5 , pp. 439-457
- Bouamrane, M.-M.¹ Luz, S.²

5
- 33746221185
- History based visual mining of semi-structured audio and text
- pages 360-363, Beijing, China, Jan. IEEE Press
- M.-M. Bouamrane, S. Luz, and M. Masoodian. History based visual mining of semi-structured audio and text. In Proceedings of the 12th International Multi-media Modelling Conference, MMM06, pages 360-363, Beijing, China, Jan. 2006. IEEE Press.
- (2006) Proceedings of the 12th International Multi-media Modelling Conference, MMM06
- Bouamrane, M.-M.¹ Luz, S.² Masoodian, M.³

6
- 1542468335
- Speech and language processing for multimodal human-computer interaction
- L. Deng, Y. Wang, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, M. Mahajan, and X. D. Huang. Speech and language processing for multimodal human-computer interaction. Journal VLSI Signal Processing Systems, 36(2/3):161-187, 2004.
- (2004) Journal VLSI Signal Processing Systems , vol.36 , Issue.2-3 , pp. 161-187
- Deng, L.¹ Wang, Y.² Wang, K.³ Acero, A.⁴ Hon, H.⁵ Droppo, J.⁶ Boulis, C.⁷ Mahajan, M.⁸ Huang, X.D.⁹

7
- 0032646977
- An overview of audio information retrieval
- J. Foote. An overview of audio information retrieval. Multimedia Systems, 7(1):2-10, 1999.
- (1999) Multimedia Systems , vol.7 , Issue.1 , pp. 2-10
- Foote, J.¹

8
- 33846950613
- Accessing the spoken word
- J. Goldman, S. Renals, S. Bird, F. de Jong, M. Federico, C. Fleischhauer, M. Kornbluh, L. Lamel, D. Oard, C. Stewart, and R. Wright. Accessing the spoken word. International Journal of Digital Libraries, 5(4):287-298, 2005.
- (2005) International Journal of Digital Libraries , vol.5 , Issue.4 , pp. 287-298
- Goldman, J.¹ Renals, S.² Bird, S.³ De Jong, F.⁴ Federico, M.⁵ Fleischhauer, C.⁶ Kornbluh, M.⁷ Lamel, L.⁸ Oard, D.⁹ Stewart, C.¹⁰ Wright, R.¹¹

9
- 0001292643
- The beauty of errors: Patterns of error correction in desktop speech systems
- pages 133-140
- Halverson, C. A., Horn, D. B., Karat, C.-M., and J. Karat. The beauty of errors: Patterns of error correction in desktop speech systems. In Proceedings of INTERACT'99: Human-Computer Interaction, pages 133-140, 1999.
- (1999) Proceedings of INTERACT'99: Human-computer Interaction
- Halverson, C.A.¹ Horn, D.B.² Karat, C.-M.³ Karat, J.⁴

10
- 34547521678
- Automatic alignment and error correction of human generated transcripts for long speech recordings
- pages 1606-1609, Pittsburgh, Pennsylvania
- T. Hazen. Automatic alignment and error correction of human generated transcripts for long speech recordings. In Procedings of Inter speech'06, pages 1606-1609, Pittsburgh, Pennsylvania, 2006.
- (2006) Procedings of Inter speech'06
- Hazen, T.¹

11
- 0032652962
- Patterns of entry and correction in large vocabulary continuous speech recognition systems
- pages 568-575. ACM Press
- C.-M. Karat, C. Halverson, D. Horn, and J. Karat. Patterns of entry and correction in large vocabulary continuous speech recognition systems. In CHI '99: Proceedings of the SIGCHI conference on Human factors in computing systems, pages 568-575. ACM Press, 1999.
- (1999) CHI '99: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
- Karat, C.-M.¹ Halverson, C.² Horn, D.³ Karat, J.⁴

12
- 57349132532
- A system for dynamic 3d visualisation of speech recognition paths
- pages 482-483. ACM Press
- S. Luz, M. Masoodian, B. Rogers, and B. Zhang. A system for dynamic 3d visualisation of speech recognition paths. In Proceedings of Advanced Visual Interfaces AVI'08, pages 482-483. ACM Press, 2008.
- (2008) Proceedings of Advanced Visual Interfaces AVI'08
- Luz, S.¹ Masoodian, M.² Rogers, B.³ Zhang, B.⁴

13
- 70350682102
- Improving automatic speech transcription for multimedia content
- P. Isaias and M. B. Nunes, editors, pages 145-152, Vila Real
- M. Masoodian, B. Rogers, and S. Luz. Improving automatic speech transcription for multimedia content. In P. Isaias and M. B. Nunes, editors, Proceedings of WWW/Internet '07, pages 145-152, Vila Real, 2007.
- (2007) Proceedings of WWW/Internet '07
- Masoodian, M.¹ Rogers, B.² Luz, S.³

14
- 34047264991
- TRAED: Speech audio editing using imperfect transcripts
- pages 454-259,Beijing, China. IEEE Computer Society
- M. Masoodian, B. Rogers, D. Ware, and S. McKoy. TRAED: Speech audio editing using imperfect transcripts. In 12th International Conference on Multi-Media Modeling (MMM 2006), pages 454-259,Beijing, China, 2006. IEEE Computer Society.
- (2006) 12th International Conference on Multi-Media Modeling (MMM 2006)
- Masoodian, M.¹ Rogers, B.² Ware, D.³ McKoy, S.⁴

15
- 70349123611
- Towards an efficient archive of spontaneous speech: Design of computer-assisted speech transcription system
- H. Nanjo and T. Kawahara. Towards an efficient archive of spontaneous speech: Design of computer-assisted speech transcription system. The Journal of the Acoustical Society of America, 120:3042, 2006.
- (2006) The Journal of the Acoustical Society of America , vol.120 , pp. 3042
- Nanjo, H.¹ Kawahara, T.²

16
- 70350643936
- NIST Automatic Meeting Transcription, Data Collection and Annotation Workshop, 2001.
- (2001) NIST Automatic Meeting Transcription, Data Collection and Annotation Workshop

17
- 0004244302
- Prentice Hall
- L. Rabiner and B.-H. Juang. Fundamentals of speech recognition. Prentice Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

18
- 33748596822
- Automatic speech recognition for generalised time based media retrieval and indexing
- pages 241-246, New York, NY, US. ACM Press
- J. Robertson, W. Y. Wong, C. Chung, and D. K. Kim. Automatic speech recognition for generalised time based media retrieval and indexing. In Proceedings of the sixth ACM international conference on Multimedia, MULTIMEDIA '98, pages 241-246, New York, NY, US, 1998. ACM Press.
- (1998) Proceedings of the Sixth ACM International Conference on Multimedia, MULTIMEDIA '98
- Robertson, J.¹ Wong, W.Y.² Chung, C.³ Kim, D.K.⁴

19
- 0010250404
- Productivity satisfaction, and interaction strategies of individuals with spinal cord injuries and traditional users interacting with speech recognition software
- A. Sears, C. Karat, K. Oseitutu, A. Karimullah, and J. Feng. Productivity, satisfaction, and interaction strategies of individuals with spinal cord injuries and traditional users interacting with speech recognition software. Universal Access in the Information Society, 1(1):4-15, 2001.
- (2001) Universal Access in the Information Society , vol.1 , Issue.1 , pp. 4-15
- Sears, A.¹ Karat, C.² Oseitutu, K.³ Karimullah, A.⁴ Feng, J.⁵

20
- 85009262210
- Multimodal error correction for speech user interfaces
- B. Suhm, B. Myers, and A. Waibel. Multimodal error correction for speech user interfaces. ACM Trans. Comput.-Hum. Interact, 8(1):60-98, 2001.
- (2001) ACM Trans. Comput.-hum. Interact , vol.8 , Issue.1 , pp. 60-98
- Suhm, B.¹ Myers, B.² Waibel, A.³

21
- 70350674408
- University of Maryland. NIST
- University of Maryland. Proceedings of the 2000 Speech Transcription Workshop. NIST, 2000.
- Proceedings of the 2000 Speech Transcription Workshop , vol.2000

22
- 0034842455
- Advances in automatic meeting record creation and access
- pages 597-600. IEEE Press
- A. Waibel, M. Brett, F. Metze, K. Ries, T. Schaaf, T. Schultz, H. Soltau, H. Yu, and K. Zechner. Advances in automatic meeting record creation and access. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, volume 1, pages 597-600. IEEE Press, 2001.
- (2001) Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing , vol.1
- Waibel, A.¹ Brett, M.² Metze, F.³ Ries, K.⁴ Schaaf, T.⁵ Schultz, T.⁶ Soltau, H.⁷ Yu, H.⁸ Zechner, K.⁹

23
- 0042033109
- Morgan Kaufmann
- A. Waibel and K. Lee. Readings in Speech Recognition. Morgan Kaufmann, 1990.
- (1990) Readings in Speech Recognition
- Waibel, A.¹ Lee, K.²

24
- 24144440949
- Browsing recorded meetings with Ferret
- S. Bengio and H. Bourlard, editors, pages 12-21, Martigny, Switzerland, June. Springer-Verlag GmbH
- P. Wellner, M. Flynn, and M. Guillemot. Browsing recorded meetings with Ferret. In S. Bengio and H. Bourlard, editors, Proceedings of Machine Learning for Multimodal Interaction: First International Workshop, MLMI 2004, volume 3361, pages 12-21, Martigny, Switzerland, June 2004. Springer-Verlag GmbH.
- (2004) Proceedings of Machine Learning for Multimodal Interaction: First International Workshop, MLMI 2004 , vol.3361
- Wellner, P.¹ Flynn, M.² Guillemot, M.³

25
- 0037480836
- Scanmail: A voicemail interface that makes speech browsable, readable and searchable
- pages 275-282, New York, NY, US. ACM Press
- S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isenhour, L. Stead, G. Zamchick, and A. Rosenberg. Scanmail: a voicemail interface that makes speech browsable, readable and searchable. In Proceedings of the SIGCHI conference on Human factors in computing systems,CHI '02, pages 275-282, New York, NY, US, 2002. ACM Press.
- (2002) Proceedings of the SIGCHI Conference on Human Factors in Computing Systems,CHI '02
- Whittaker, S.¹ Hirschberg, J.² Amento, B.³ Stark, L.⁴ Bacchiani, M.⁵ Isenhour, P.⁶ Stead, L.⁷ Zamchick, G.⁸ Rosenberg, A.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.