SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 2015-January, Issue , 2015, Pages 2829-2833

Using keyword spotting to help humans correct captioning faster

(4) Gaur, Yashesh a Metze, Florian a Miao, Yajie a Bigham, Jeffrey P a

a Carnegie Mellon University (United States)

Author keywords

Human computer interaction; Real time crowd sourcing; Speech recognition; Spoken term detection

Indexed keywords

AUDITION; HUMAN COMPUTER INTERACTION; SEARCH ENGINES; SPEECH COMMUNICATION;

AUTOMATIC SPEECH RECOGNITION; CORRECT ERROR; HARD OF HEARINGS; KEYWORD SEARCH; KEYWORD SPOTTING; NEW APPROACHES; SPOKEN TERM DETECTIONS; TECHNICAL TALKS;

SPEECH RECOGNITION;

EID: 84959147559 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (10)

References (26)

1
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- Nov
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, Nov 2012.
- (2012) Signal Processing Magazine, IEEE , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

2
- 0030266571
- Closed-captioned television presentation speed and vocabulary
- C. Jensema, R. McCann, and S. Ramsey, "Closed-captioned television presentation speed and vocabulary, " American Annals of the deaf, vol. 141, no. 4, pp. 284-292, 1996.
- (1996) American Annals of the Deaf , vol.141 , Issue.4 , pp. 284-292
- Jensema, C.¹ McCann, R.² Ramsey, S.³

3
- 84906928729
- Report on the 10th iwslt evaluation campaign
- Heidelberg; Germany
- M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, and M. Federico, "Report on the 10th iwslt evaluation campaign, " in Proc. IWSLT, Heidelberg; Germany, 2013, http: //www. eubridge. eu/87 282. php.
- (2013) Proc. IWSLT
- Cettolo, M.¹ Niehues, J.² Stüker, S.³ Bentivogli, L.⁴ Federico, M.⁵

4
- 56149084455
- Recent progress in the MIT spoken lecture processing project
- J. Glass, T. J. Hazen, S. Cyphers, I. Malioutov, D. Huynh, and R. Barzilay, "Recent Progress in the MIT Spoken Lecture Processing Project, " in Proc. Interspeech, 2007. [Online]. Available: http: //groups. csail. mit. edu/sls/publications/2007/Interspeech07-glass-lecture. pdf
- (2007) Proc. Interspeech
- Glass, J.¹ Hazen, T.J.² Cyphers, S.³ Malioutov, I.⁴ Huynh, D.⁵ Barzilay, R.⁶

5
- 51449091001
- Dynamic language model adaptation using presentation slides for lecture speech recognition
- H. Yamazaki, K. Iwano, K. Shinoda, S. Furui, and H. Yokota, "Dynamic language model adaptation using presentation slides for lecture speech recognition, " in In Proc. INTERSPEECH, 2007, pp. 2349-2352.
- (2007) Proc. INTERSPEECH , pp. 2349-2352
- Yamazaki, H.¹ Iwano, K.² Shinoda, K.³ Furui, S.⁴ Yokota, H.⁵

6
- 56149116530
- Web-based language modelling for automatic lecture transcription
- C. Munteanu, G. Penn, and R. Baecker, "Web-based language modelling for automatic lecture transcription, " in Proc. INTERSPEECH, 2007.
- (2007) Proc. INTERSPEECH
- Munteanu, C.¹ Penn, G.² Baecker, R.³

7
- 56149107305
- Automatic transcription for a web 2. 0 service to search podcasts
- Antwerp, Belgium, August 27-31, 2007
- J. Ogata, M. Goto, and K. Eto, "Automatic transcription for a web 2. 0 service to search podcasts, " in INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007, 2007, pp. 2617-2620.
- (2007) INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association , pp. 2617-2620
- Ogata, J.¹ Goto, M.² Eto, K.³

8
- 79951777091
- Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data
- G. Parent and M. Eskenazi, "Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data, " in Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE, 2010, pp. 312-317.
- (2010) Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE , pp. 312-317
- Parent, G.¹ Eskenazi, M.²

9
- 78049407752
- Using the amazon mechanical turk for transcription of spoken language
- M. Marge, S. Banerjee, and A. I. Rudnicky, "Using the amazon mechanical turk for transcription of spoken language, " in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 5270-5273.
- (2010) Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference On. IEEE , pp. 5270-5273
- Marge, M.¹ Banerjee, S.² Rudnicky, A.I.³

10
- 79958275518
- Cheap, fast and good enough: Automatic speech recognition with non-expert transcription
- Los Angeles, California: Association for Computational Linguistics, June
- S. Novotney and C. Callison-Burch, "Cheap, fast and good enough: Automatic speech recognition with non-expert transcription, " in Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Los Angeles, California: Association for Computational Linguistics, June 2010, pp. 207-215. [Online]. Available: http: //www. aclweb. org/anthology/N10-1024
- (2010) Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics , pp. 207-215
- Novotney, S.¹ Callison-Burch, C.²

11
- 84865758190
- A transcription task for crowdsourcing with automatic quality control
- C.-Y. Lee and J. R. Glass, "A transcription task for crowdsourcing with automatic quality control, " in INTERSPEECH'11, 2011, pp. 3041-3044.
- (2011) INTERSPEECH'11 , pp. 3041-3044
- Lee, C.-Y.¹ Glass, J.R.²

12
- 57649158901
- Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts
- Florence, Italy, April 5-10, 2008
- C. Munteanu, R. Baecker, and G. Penn, "Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts, " in Proceedings of the 2008 Conference on Human Factors in Computing Systems, CHI 2008, 2008, Florence, Italy, April 5-10, 2008, 2008, pp. 373-382. [Online]. Available: http: //doi. acm. org/10. 1145/1357054. 1357117
- (2008) Proceedings of the 2008 Conference on Human Factors in Computing Systems, CHI 2008, 2008 , pp. 373-382
- Munteanu, C.¹ Baecker, R.² Penn, G.³

13
- 84943243018
- Evaluation of interactive user corrections for lecture transcription
- Hong Kong, December 6-7, 2012
- H. Kolkhorst, K. Kilgour, S. Stüker, and A. Waibel, "Evaluation of interactive user corrections for lecture transcription, " in 2012 International Workshop on Spoken Language Translation, IWSLT 2012, Hong Kong, December 6-7, 2012, 2012, pp. 217-221.
- (2012) 2012 International Workshop on Spoken Language Translation, IWSLT 2012 , pp. 217-221
- Kolkhorst, H.¹ Kilgour, K.² Stüker, S.³ Waibel, A.⁴

14
- 84869046812
- Real-time captioning by groups of non-experts
- ACM Press
- W. Lasecki, C. Miller, A. Sadilek, A. Abumoussa, D. Borrello, R. S. Kushalnagar, and J. Bigham, "Real-time captioning by groups of non-experts, " in Proceedings of the 25th annual ACM symposium on User interface software and technology-UIST '12. ACM Press, 2012, pp. 23-34.
- (2012) Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology-UIST '12 , pp. 23-34
- Lasecki, W.¹ Miller, C.² Sadilek, A.³ Abumoussa, A.⁴ Borrello, D.⁵ Kushalnagar, R.S.⁶ Bigham, J.⁷

15
- 84959146957
- M. Wald, "Crowdsourcing correction of speech recognition captioning errors, " 2011.
- (2011) Crowdsourcing Correction of Speech Recognition Captioning Errors
- Wald, M.¹

16
- 84938721908
- A keyword search system using open source software
- South Lake Tahoe, NV; USA: IEEE, Dec. To appear
- J. Trmal, G. Chen, D. Povey, S. Khudanpur, P. Ghahremani, X. Zhang, V. Manohar, C. Liu, A. Jansen, D. Klakow, D. Yarowsky, and F. Metze, "A keyword search system using open source software, " in Proc. IEEE Workshop on Spoken Language Technology. South Lake Tahoe, NV; USA: IEEE, Dec. 2014, to appear.
- (2014) Proc. IEEE Workshop on Spoken Language Technology
- Trmal, J.¹ Chen, G.² Povey, D.³ Khudanpur, S.⁴ Ghahremani, P.⁵ Zhang, X.⁶ Manohar, V.⁷ Liu, C.⁸ Jansen, A.⁹ Klakow, D.¹⁰ Yarowsky, D.¹¹ Metze, F.¹²

17
- 84976253431
- Results of the 2006 spoken term detection evaluation
- J. G. Fiscus, J. Ajot, J. S. Garofolo, and G. Doddingtion, "Results of the 2006 spoken term detection evaluation, " in Proc. SIGIR, vol. 7, 2007, pp. 51-57.
- (2007) Proc. SIGIR , vol.7 , pp. 51-57
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³ Doddingtion, G.⁴

18
- 80052042597
- Lattice indexing for spoken term detection
- Nov
- D. Can and M. Saraclar, "Lattice indexing for spoken term detection, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 8, pp. 2338-2347, Nov 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.8 , pp. 2338-2347
- Can, D.¹ Saraclar, M.²

19
- 4544257924
- Vocabulary-independent search in spontaneous speech
- IEEE
- F. Seide, P. Yu, C. Ma, and E. Chang, "Vocabulary-independent search in spontaneous speech, " in Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP'04). IEEE International Conference on, vol. 1. IEEE, 2004, pp. I-253.
- (2004) Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP'04). IEEE International Conference on , vol.1 , pp. I-253
- Seide, F.¹ Yu, P.² Ma, C.³ Chang, E.⁴

20
- 36448941168
- Vocabulary independent spoken term detection
- J. Mamou, B. Ramabhadran, and O. Siohan, "Vocabulary independent spoken term detection, " in Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2007, pp. 615-622.
- (2007) Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM , pp. 615-622
- Mamou, J.¹ Ramabhadran, B.² Siohan, O.³

21
- 56149113962
- Rapid and accurate spoken term detection
- D. R. Miller, M. Kleber, C.-L. Kao, O. Kimball, T. Colthurst, S. A. Lowe, R. M. Schwartz, and H. Gish, "Rapid and accurate spoken term detection, " in Eighth Annual Conference of the International Speech Communication Association, 2007.
- (2007) Eighth Annual Conference of the International Speech Communication Association
- Miller, D.R.¹ Kleber, M.² Kao, C.-L.³ Kimball, O.⁴ Colthurst, T.⁵ Lowe, S.A.⁶ Schwartz, R.M.⁷ Gish, H.⁸

22
- 84946076428
- Ted-lium: An automatic speech recognition dedicated corpus
- A. Rousseau, P. Deléglise, and Y. Estève, "Ted-lium: An automatic speech recognition dedicated corpus. " in LREC, 2012, pp. 125-129.
- (2012) LREC , pp. 125-129
- Rousseau, A.¹ Deléglise, P.² Estève, Y.³

23
- 84876795561
- The kaldi speech recognition toolkit
- Dec
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The kaldi speech recognition toolkit, " in IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, Dec. 2011.
- (2011) IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlicek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsky, J.¹¹ Stemmer, G.¹² Vesely, K.¹³

24
- 84910038371
- CoRR abs/1401. 6984
- Y. Miao, "Kaldi+pdnn: Building dnn-based ASR systems with kaldi and PDNN, " CoRR, vol. abs/1401. 6984, 2014. [Online]. Available: http: //arxiv. org/abs/1401. 6984
- (2014) Kaldi+pdnn: Building Dnn-based ASR Systems with Kaldi and PDNN
- Miao, Y.¹

25
- 84910087158
- National Institute of Standards and Technology
- National Institute of Standards and Technology, "NIST open keyword search 2014 evaluation (OpenKWS14), " http: //www. nist. gov/itl/iad/mig/openkws14. cfm.
- NIST Open Keyword Search 2014 Evaluation (OpenKWS14)

26
- 84976216085
- J. Cain, "introduction to computer science-programming paradigms, " http: //see. stanford. edu/see/lecturelist. aspx?coll=2d712634-2bf1-4b55-9a3a-ca9d470755ee.
- Introduction to Computer Science-programming Paradigms
- Cain, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.