메뉴 건너뛰기




Volumn 22, Issue 4, 2006, Pages 237-270

Discovering cues to error detection in speech recognition output: A user-centered approach

Author keywords

Cues to error detection; Speech recognition; Taxonomy; Verbal protocol analysis

Indexed keywords

CLASSIFICATION (OF INFORMATION); ERROR DETECTION; HEURISTIC PROGRAMMING; HUMAN COMPUTER INTERACTION; MANAGEMENT INFORMATION SYSTEMS; NETWORK PROTOCOLS;

EID: 33748611913     PISSN: 07421222     EISSN: None     Source Type: Journal    
DOI: 10.2753/MIS0742-1222220409     Document Type: Article
Times cited : (8)

References (51)
  • 3
    • 0036355609 scopus 로고    scopus 로고
    • Speech recognition in university classrooms: Liberated learning project
    • In V.L. Hanson and J.A. Jacko (eds.), New York: ACM Press
    • Bain, K.; Basson, S.H.; and Wald, M. Speech recognition in university classrooms: Liberated learning project. In V.L. Hanson and J.A. Jacko (eds.), Proceedings of the Fifth International ACM Conference on Assistive Technologies. New York: ACM Press, 2002, pp. 192-196.
    • (2002) Proceedings of the Fifth International ACM Conference on Assistive Technologies , pp. 192-196
    • Bain, K.1    Basson, S.H.2    Wald, M.3
  • 4
    • 85149143404 scopus 로고    scopus 로고
    • Beyond n-grams: Can linguistic sophistication improve language modeling?
    • In C. Boitet and P. Whitelock (eds.), Morristown, NJ: Association for Computational Linguistics
    • Brill, E.; Florian, R.; Henderson, J.C.; and Mangu, L. Beyond n-grams: Can linguistic sophistication improve language modeling? In C. Boitet and P. Whitelock (eds.), Proceedings of the Thirty-Sixth Annual Meeting on Association for Computational Linguistics. Morristown, NJ: Association for Computational Linguistics, 1998, pp. 186-190.
    • (1998) Proceedings of the Thirty-Sixth Annual Meeting on Association for Computational Linguistics , pp. 186-190
    • Brill, E.1    Florian, R.2    Henderson, J.C.3    Mangu, L.4
  • 6
    • 0003987751 scopus 로고    scopus 로고
    • Ph.D. dissertation, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA
    • Chase, L. Error-Responsive Feedback Mechanisms for Speech Recognizers. Ph.D. dissertation, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 1997.
    • (1997) Error-Responsive Feedback Mechanisms for Speech Recognizers
    • Chase, L.1
  • 7
    • 85135168075 scopus 로고    scopus 로고
    • Word and acoustic confidence annotation for large vocabulary speech recognition
    • In G. Kokkinakis, N. Fakotakis, and E. Dermatas (eds.), Bonn, Germany: International Speech Communication Association
    • Chase, L. Word and acoustic confidence annotation for large vocabulary speech recognition. In G. Kokkinakis, N. Fakotakis, and E. Dermatas (eds.), Proceedings of the Fifth European Conference on Speech Communication and Technology. Bonn, Germany: International Speech Communication Association, 1997, pp. 815-818.
    • (1997) Proceedings of the Fifth European Conference on Speech Communication and Technology , pp. 815-818
    • Chase, L.1
  • 8
    • 4243109553 scopus 로고    scopus 로고
    • Challenges in adopting speech recognition
    • (January)
    • Deng, L., and Huang, X. Challenges in adopting speech recognition. Communications of the ACM, 47, 1 (January 2004), 69-75.
    • (2004) Communications of the ACM , vol.47 , Issue.1 , pp. 69-75
    • Deng, L.1    Huang, X.2
  • 9
    • 17444437850 scopus 로고    scopus 로고
    • Confidence scoring based on backward language models
    • In F.J. Taylor, J. Principe, and H. Bourlard (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Duchateau, J.; Demuynck, K.; and Wambacq, P. Confidence scoring based on backward language models. In F.J. Taylor, J. Principe, and H. Bourlard (eds.), 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1. Los Alamitos, CA: IEEE Computer Society Press, 2002, pp. 221-224.
    • (2002) 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 221-224
    • Duchateau, J.1    Demuynck, K.2    Wambacq, P.3
  • 10
    • 0343966008 scopus 로고
    • Natural language access to multiple databases: A model and a prototype
    • (Summer)
    • Ein-Dor, P., and Spiegter, I. Natural language access to multiple databases: A model and a prototype. Journal of Management Information Systems, 12, 1 (Summer 1995), 171-197.
    • (1995) Journal of Management Information Systems , vol.12 , Issue.1 , pp. 171-197
    • Ein-Dor, P.1    Spiegler, I.2
  • 12
    • 10844260787 scopus 로고    scopus 로고
    • Using confidence scores to improve hands-free speech based navigation in continuous dictation systems
    • (December)
    • Feng, J., and Sears, A. Using confidence scores to improve hands-free speech based navigation in continuous dictation systems. ACM Transactions on Computer-Human Interaction, 11, 4 (December 2004), 329-356.
    • (2004) ACM Transactions on Computer-Human Interaction , vol.11 , Issue.4 , pp. 329-356
    • Feng, J.1    Sears, A.2
  • 13
    • 33746261097 scopus 로고    scopus 로고
    • Automatic speech recognition and its application to information extraction
    • In R. Dale and K. Church (eds.), Morristown, NJ: Association for Computational Linguistics
    • Furui, S. Automatic speech recognition and its application to information extraction. In R. Dale and K. Church (eds.), Proceedings of the Thirty-Seventh Annual Meeting of the Association for Computational Linguistics. Morristown, NJ: Association for Computational Linguistics, 1999, pp. 11-20.
    • (1999) Proceedings of the Thirty-Seventh Annual Meeting of the Association for Computational Linguistics , pp. 11-20
    • Furui, S.1
  • 14
    • 33645586060 scopus 로고    scopus 로고
    • Large vocabulary speech recognition based on statistical methods
    • In W. Chou and B.H. Juang (eds.), Boca Raton, FL: CRC Press
    • Gauvain, J.-L., and Lamel, L. Large vocabulary speech recognition based on statistical methods. In W. Chou and B.H. Juang (eds.), Pattern Recognition in Speech and Language Processing. Boca Raton, FL: CRC Press, 2003, pp. 149-189.
    • (2003) Pattern Recognition in Speech and Language Processing , pp. 149-189
    • Gauvain, J.-L.1    Lamel, L.2
  • 15
    • 0030648371 scopus 로고    scopus 로고
    • A probabilistic approach to confidence estimation and evaluation
    • In M.K. Lang and H. Hoge (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Gillick, L.; Ito, Y.; and Young, J. A probabilistic approach to confidence estimation and evaluation. In M.K. Lang and H. Hoge (eds.), 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2. Los Alamitos, CA: IEEE Computer Society Press, 1997, pp. 879-882.
    • (1997) 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.2 , pp. 879-882
    • Gillick, L.1    Ito, Y.2    Young, J.3
  • 17
    • 0033708811 scopus 로고    scopus 로고
    • Contextual confidence measures for continuous speech recognition
    • In H. Abut and L. Onural (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Hernandez-Abrego, G., and Marino, J.B. Contextual confidence measures for continuous speech recognition. In H. Abut and L. Onural (eds.), 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3. Los Alamitos, CA: IEEE Computer Society Press, 2000, pp. 1803-1806.
    • (2000) 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.3 , pp. 1803-1806
    • Hernandez-Abrego, G.1    Marino, J.B.2
  • 18
    • 0346403090 scopus 로고    scopus 로고
    • Speaking to read: The effects of continuous vs. discrete speech recognition systems on the reading and spelling of children with learning disabilities
    • (Winter) (available at jset.unlv.edu/15.1/higgins/first.html)
    • Higgins, E.L., and Raskind, M.H. Speaking to read: The effects of continuous vs. discrete speech recognition systems on the reading and spelling of children with learning disabilities. Journal of Special Education Technology, 15, 1 (Winter 2000) (available at jset.unlv.edu/ 15.1/higgins/first.html).
    • (2000) Journal of Special Education Technology , vol.15 , Issue.1
    • Higgins, E.L.1    Raskind, M.H.2
  • 19
    • 33748604996 scopus 로고    scopus 로고
    • Speech recognition powers utility's customer service
    • September 12, (available at)
    • Hoffman, T. Speech recognition powers utility's customer service. ComputerWorld, September 12, 2005 (available at www.computerworld.com/ managementtopics/management/helpdesk/story/0,10801,104535,00.html).
    • (2005) ComputerWorld
    • Hoffman, T.1
  • 20
    • 85135146711 scopus 로고    scopus 로고
    • Estimating confidence using word lattices
    • In G. Kokkinakis, N. Fakotakis, and E. Dermatas (eds.), Bonn, Germany: International Speech Communication Association
    • Kemp, T., and Schaaf, T. Estimating confidence using word lattices. In G. Kokkinakis, N. Fakotakis, and E. Dermatas (eds.), Proceedings of the Fifth European Conference on Speech Communication and Technology. Bonn, Germany: International Speech Communication Association, 1997, pp. 827-830.
    • (1997) Proceedings of the Fifth European Conference on Speech Communication and Technology , pp. 827-830
    • Kemp, T.1    Schaaf, T.2
  • 22
  • 23
    • 0002039535 scopus 로고
    • Diagnosing the human threats to information technology implementation: A missing factor in systems analysis illustrated in a case study
    • (Fall)
    • Levine, H.G., and Rossmoore, D. Diagnosing the human threats to information technology implementation: A missing factor in systems analysis illustrated in a case study. Journal of Management Information Systems, 10, 2 (Fall 1993), 55-74.
    • (1993) Journal of Management Information Systems , vol.10 , Issue.2 , pp. 55-74
    • Levine, H.G.1    Rossmoore, D.2
  • 24
    • 33646809491 scopus 로고    scopus 로고
    • Ph.D. dissertation, School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN
    • Liu, Y. Structural Event Detection for Rich Transcription of Speech. Ph.D. dissertation, School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN, 2004.
    • (2004) Structural Event Detection for Rich Transcription of Speech
    • Liu, Y.1
  • 25
    • 33748604720 scopus 로고    scopus 로고
    • Speech recognition
    • SNOW, Toronto, ON, (available at snow.utoronto.ca/best/special/ speechrecognition.html)
    • Lubert, J.; Kotler, A.; Shein, F.; and Tam, C. Speech recognition. SNOW, Toronto, ON, 1998 (available at snow.utoronto.ca/best/special/ speechrecognition.html).
    • (1998)
    • Lubert, J.1    Kotler, A.2    Shein, F.3    Tam, C.4
  • 26
    • 0034843166 scopus 로고    scopus 로고
    • Robust confidence annotation and rejection for continuous speech recognition
    • In V.J. Mathews and A. Swindlehurst (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Maison, B., and Gopinath, R. Robust confidence annotation and rejection for continuous speech recognition. In V.J. Mathews and A. Swindlehurst (eds.), 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1. Los Alamitos, CA: IEEE Computer Society Press, 2001, pp. 389-392.
    • (2001) 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 389-392
    • Maison, B.1    Gopinath, R.2
  • 27
    • 0034843104 scopus 로고    scopus 로고
    • Error corrective mechanisms for speech recognition
    • In V.J. Mathews and A. Swindlehurst (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Mangu, L., and Padmanabhan, M. Error corrective mechanisms for speech recognition. In V.J. Mathews and A. Swindlehurst (eds.), 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1. Los Alamitos, CA: IEEE Computer Society Press, 2001, pp. 29-32.
    • (2001) 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 29-32
    • Mangu, L.1    Padmanabhan, M.2
  • 28
    • 0005495139 scopus 로고
    • Rhetorical structure theory: A theory of text organization
    • In L. Polanyi (ed.), Norwood, NJ: Ablex
    • Mann, W.C., and Thompson, S.A. Rhetorical structure theory: A theory of text organization. In L. Polanyi (ed.), The Structure of Discourse. Norwood, NJ: Ablex, 1987, pp. 85-96.
    • (1987) The Structure of Discourse , pp. 85-96
    • Mann, W.C.1    Thompson, S.A.2
  • 29
    • 0034434766 scopus 로고    scopus 로고
    • The use of explanations in knowledge-based systems: Cognitive perspectives and a process-tracing analysis
    • (Fall)
    • Mao, J.-Y., and Benbasat, I. The use of explanations in knowledge-based systems: Cognitive perspectives and a process-tracing analysis. Journal of Management Information Systems, 17, 2 (Fall 2000), 153-180.
    • (2000) Journal of Management Information Systems , vol.17 , Issue.2 , pp. 153-180
    • Mao, J.-Y.1    Benbasat, I.2
  • 30
    • 0345331026 scopus 로고    scopus 로고
    • Spoken dialogue technology: Enabling the conversational user interface
    • (March)
    • McTear, M.F. Spoken dialogue technology: Enabling the conversational user interface. ACM Computing Surveys, 34, 1 (March 2002), 90-169.
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 90-169
    • McTear, M.F.1
  • 32
    • 84861324825 scopus 로고    scopus 로고
    • Confidence scoring for speech understanding systems
    • In R.H. Mannell and J. Robert-Ribes (eds.), Canberra: Australian Speech Science and Technology Association
    • Pao, C.; Schmid, P.; and Glass, J. Confidence scoring for speech understanding systems. In R.H. Mannell and J. Robert-Ribes (eds.), Proceedings of the Fifth International Conference on Spoken Language Processing. Canberra: Australian Speech Science and Technology Association, 1998, pp. 815-818.
    • (1998) Proceedings of the Fifth International Conference on Spoken Language Processing , pp. 815-818
    • Pao, C.1    Schmid, P.2    Glass, J.3
  • 33
    • 17344395627 scopus 로고    scopus 로고
    • Estimating semantic confidence for spoken dialogue systems
    • In F.J. Taylor, J. Principe, and H. Bourlard (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Pradhan, S.S., and Ward, W.H. Estimating semantic confidence for spoken dialogue systems. In F.J. Taylor, J. Principe, and H. Bourlard (eds.), 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1. Los Alamitos, CA: IEEE Computer Society Press, 2002, pp. 233-236.
    • (2002) 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 233-236
    • Pradhan, S.S.1    Ward, W.H.2
  • 34
    • 0029725754 scopus 로고    scopus 로고
    • Error correction via a post-processor for continuous speech recognition
    • In M.H. Hayes and M.A. Clements (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Ringger, E.K., and Allen, J.F. Error correction via a post-processor for continuous speech recognition. In M.H. Hayes and M.A. Clements (eds.), 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1. Los Alamitos, CA: IEEE Computer Society Press, 1996, pp. 427-430.
    • (1996) 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 427-430
    • Ringger, E.K.1    Allen, J.F.2
  • 35
    • 33748596822 scopus 로고    scopus 로고
    • Automatic speech recognition for generalised time based media retrieval and indexing
    • In W. Effelsberg and B.C. Smith (eds.), New York: ACM Press
    • Robertson, J.; Wong, W.Y.; Chung, C.; and Kim, D.K. Automatic speech recognition for generalised time based media retrieval and indexing. In W. Effelsberg and B.C. Smith (eds.), Proceedings of the Sixth ACM International Conference on Multimedia. New York: ACM Press, 1998, pp. 241-246.
    • (1998) Proceedings of the Sixth ACM International Conference on Multimedia , pp. 241-246
    • Robertson, J.1    Wong, W.Y.2    Chung, C.3    Kim, D.K.4
  • 37
    • 0141590649 scopus 로고    scopus 로고
    • Word level confidence measurement using semantic features
    • In W. Siu, A.G. Constantinides, and Y. Chan (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Sarikaya, R.; Gao, Y.; and Picheny, M. Word level confidence measurement using semantic features. In W. Siu, A.G. Constantinides, and Y. Chan (eds.), 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1. Los Alamitos, CA: IEEE Computer Society Press, 2003, pp. 604-607.
    • (2003) 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 604-607
    • Sarikaya, R.1    Gao, Y.2    Picheny, M.3
  • 39
    • 0042739414 scopus 로고    scopus 로고
    • Hands-free speech-based navigation during dictation: Difficulties, consequences, and solutions
    • Sears, A.; Feng, J.; Oseitutu, K.; and Karat, C.-M. Hands-free speech-based navigation during dictation: Difficulties, consequences, and solutions. Human-Computer Interaction, 18, 3 (2003), 229-257.
    • (2003) Human-Computer Interaction , vol.18 , Issue.3 , pp. 229-257
    • Sears, A.1    Feng, J.2    Oseitutu, K.3    Karat, C.-M.4
  • 40
    • 0010250404 scopus 로고    scopus 로고
    • Productivity, satisfaction, and interaction strategies of individuals with spinal cord injuries and traditional users interacting with speech recognition software
    • (June)
    • Sears, A.; Karat, C.-M.; Oseitutu, K.; Karimullah, A.; and Feng, J. Productivity, satisfaction, and interaction strategies of individuals with spinal cord injuries and traditional users interacting with speech recognition software. Universal Access in the Information Society, 1, 1 (June 2001), 4-15.
    • (2001) Universal Access in the Information Society , vol.1 , Issue.1 , pp. 4-15
    • Sears, A.1    Karat, C.-M.2    Oseitutu, K.3    Karimullah, A.4    Feng, J.5
  • 44
    • 0032687452 scopus 로고    scopus 로고
    • Advances in confidence measures for large vocabulary
    • In D. Cochran and A. Spanias (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Wendemuth, A.; Rose, G.; and Dolfing, J.G.A. Advances in confidence measures for large vocabulary. In D. Cochran and A. Spanias (eds.), 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2. Los Alamitos, CA: IEEE Computer Society Press, 1999, pp. 705-708.
    • (1999) 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.2 , pp. 705-708
    • Wendemuth, A.1    Rose, G.2    Dolfing, J.G.A.3
  • 45
    • 0033708499 scopus 로고    scopus 로고
    • Using posterior probabilities for improved speech recognition
    • In H. Abut and L. Onural (eds.), Los Alamitos, CA: IEEE Computer Society Press
    • Wessel, F.; Schluter, R.; and Ney, H. Using posterior probabilities for improved speech recognition. In H. Abut and L. Onural (eds.), 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3. Los Alamitos, CA: IEEE Computer Society Press, 2000, pp. 1587-1590.
    • (2000) 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.3 , pp. 1587-1590
    • Wessel, F.1    Schluter, R.2    Ney, H.3
  • 47
    • 85079240850 scopus 로고
    • Detecting misrecognitions and out-of-vocabulary words
    • In Los Alamitos, CA: IEEE Computer Society Press
    • Young, S.R. Detecting misrecognitions and out-of-vocabulary words. In 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2. Los Alamitos, CA: IEEE Computer Society Press, 1994, pp. 21-24.
    • (1994) 1994 IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.2 , pp. 21-24
    • Young, S.R.1
  • 48
    • 21844441398 scopus 로고    scopus 로고
    • Challenges, methodologies, and issues in the usability testing of mobile applications
    • (July)
    • Zhang, D., and Adipat, B. Challenges, methodologies, and issues in the usability testing of mobile applications. International Journal of Human-Computer Interaction, 18, 3 (July 2005), 293-308.
    • (2005) International Journal of Human-Computer Interaction , vol.18 , Issue.3 , pp. 293-308
    • Zhang, D.1    Adipat, B.2
  • 49
    • 85009135077 scopus 로고    scopus 로고
    • Word level confidence annotation using combinations of features
    • In P. Dalsgaard, B. Lindberg, H. Benner, and Z. Tan (eds.), Bonn, Germany: International Speech Communication Association
    • Zhang, R., and Rudnicky, A.I. Word level confidence annotation using combinations of features. In P. Dalsgaard, B. Lindberg, H. Benner, and Z. Tan (eds.), Proceedings of the Seventh European Conference on Speech Communication and Technology. Bonn, Germany: International Speech Communication Association, 2001, pp. 2105-2108.
    • (2001) Proceedings of the Seventh European Conference on Speech Communication and Technology , pp. 2105-2108
    • Zhang, R.1    Rudnicky, A.I.2
  • 50
    • 85008024850 scopus 로고    scopus 로고
    • Data mining for detecting errors in dictation speech recognition
    • (September)
    • Zhou, L.; Shi, Y.; Feng, J.; and Sears, A. Data mining for detecting errors in dictation speech recognition. IEEE Transactions on Speech and Audio Processing, 13, 5 (September 2005), 681-688.
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 681-688
    • Zhou, L.1    Shi, Y.2    Feng, J.3    Sears, A.4
  • 51
    • 85009112311 scopus 로고    scopus 로고
    • A two-level schema for detecting recognition errors
    • In S.H. Kim, S. Lee, Y. Oh, and Y. Lee (eds.), Bonn, Germany: International Speech Communication Association
    • Zhou, Z., and Meng, H. A two-level schema for detecting recognition errors. In S.H. Kim, S. Lee, Y. Oh, and Y. Lee (eds.), Proceedings of the Eighth International Conference on Spoken Language Processing. Bonn, Germany: International Speech Communication Association, 2004, pp. 449-452.
    • (2004) Proceedings of the Eighth International Conference on Spoken Language Processing , pp. 449-452
    • Zhou, Z.1    Meng, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.