SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 08-12-September-2016, Issue , 2016, Pages 3062-3065

Manipulating word lattices to incorporate human corrections

(3) Gaur, Yashesh a Metze, Florian a Bigham, Jeffrey P a

a Carnegie Mellon University (United States)

Author keywords

Human computation; Keyword spotting; Speech recognition; Word lattices

Indexed keywords

ERRORS; SEARCH ENGINES; SPEECH COMMUNICATION; SPEECH PROCESSING;

AUTOMATIC SPEECH RECOGNITION; HUMAN COMPUTATION; KEYWORD SEARCH; KEYWORD SPOTTING; LARGE AMOUNTS OF DATA; OFFLINE; WORD ERROR RATE;

SPEECH RECOGNITION;

EID: 84994200760 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: 10.21437/Interspeech.2016-660 Document Type: Conference Paper

Times cited : (7)

References (20)

1
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
- Nov
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, Nov 2012.
- (2012) Signal Processing Magazine, IEEE , vol.29 , Issue.6 , pp. 82-97
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

2
- 84906928729
- Report on the 10th iwslt evaluation campaign
- Heidelberg; Germany
- M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, and M. Federico, "Report on the 10th iwslt evaluation campaign," in Proc. IWSLT, Heidelberg; Germany, 2013, http://www.eubridge.eu/87282.php.
- (2013) Proc. IWSLT
- Cettolo, M.¹ Niehues, J.² Stüker, S.³ Bentivogli, L.⁴ Federico, M.⁵

3
- 56149084455
- Recent progress in the mit spoken lecture processing project
- J. Glass, T. J. Hazen, S. Cyphers, I. Malioutov, D. Huynh, and R. Barzilay, "Recent Progress in the MIT Spoken Lecture Processing Project," in Proc. Interspeech, 2007. [Online]. Available: http://groups.csail.mit.edu/sls/publications/2007/Interspeech07- glass-lecture.pdf
- (2007) Proc. Interspeech
- Glass, J.¹ Hazen, T.J.² Cyphers, S.³ Malioutov, I.⁴ Huynh, D.⁵ Barzilay, R.⁶

4
- 0030266571
- Closed-captioned television presentation speed and vocabulary
- C. Jensema, R. McCann, and S. Ramsey, "Closed-captioned television presentation speed and vocabulary," American Annals of the deaf, vol. 141, no. 4, pp. 284-292, 1996.
- (1996) American Annals of the Deaf , vol.141 , Issue.4 , pp. 284-292
- Jensema, C.¹ McCann, R.² Ramsey, S.³

5
- 51449091001
- Dynamic language model adaptation using presentation slides for lecture speech recognition
- H. Yamazaki, K. Iwano, K. Shinoda, S. Furui, and H. Yokota, "Dynamic language model adaptation using presentation slides for lecture speech recognition," in In Proc. INTERSPEECH, 2007, pp. 2349-2352.
- (2007) Proc. INTERSPEECH , pp. 2349-2352
- Yamazaki, H.¹ Iwano, K.² Shinoda, K.³ Furui, S.⁴ Yokota, H.⁵

6
- 56149116530
- Web-based language modelling for automatic lecture transcription
- C. Munteanu, G. Penn, and R. Baecker, "Web-based language modelling for automatic lecture transcription," in Proc. INTERSPEECH, 2007.
- (2007) Proc. INTERSPEECH
- Munteanu, C.¹ Penn, G.² Baecker, R.³

7
- 56149107305
- Automatic transcription for a web 2.0 service to search podcasts
- J. Ogata, M. Goto, and K. Eto, "Automatic transcription for a web 2.0 service to search podcasts," in INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007, 2007, pp. 2617-2620.
- (2007) INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007 , pp. 2617-2620
- Ogata, J.¹ Goto, M.² Eto, K.³

8
- 79951777091
- Toward better crowdsourced transcription: Transcription of a year of the let?s go bus information system data
- G. Parent and M. Eskenazi, "Toward better crowdsourced transcription: Transcription of a year of the let?s go bus information system data," in Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE, 2010, pp. 312-317.
- (2010) Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE , pp. 312-317
- Parent, G.¹ Eskenazi, M.²

9
- 78049407752
- Using the amazon mechanical turk for transcription of spoken language
- IEEE
- M. Marge, S. Banerjee, and A. I. Rudnicky, "Using the amazon mechanical turk for transcription of spoken language," in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 5270-5273.
- (2010) Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on , pp. 5270-5273
- Marge, M.¹ Banerjee, S.² Rudnicky, A.I.³

10
- 79958275518
- Cheap, fast and good enough: Automatic speech recognition with non-expert transcription
- Los Angeles, California: Association for Computational Linguistics, June 2010
- S. Novotney and C. Callison-Burch, "Cheap, fast and good enough: Automatic speech recognition with non-expert transcription," in Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Los Angeles, California: Association for Computational Linguistics, June 2010, pp. 207-215. [Online]. Available: http://www.aclweb.org/anthology/N10-1024
- Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics , pp. 207-215
- Novotney, S.¹ Callison-Burch, C.²

11
- 84943243018
- Evaluation of interactive user corrections for lecture transcription
- Hong Kong, December 6-7, 2012
- H. Kolkhorst, K. Kilgour, S. Stüker, and A. Waibel, "Evaluation of interactive user corrections for lecture transcription," in 2012 International Workshop on Spoken Language Translation, IWSLT 2012, Hong Kong, December 6-7, 2012, 2012, pp. 217-221.
- (2012) 2012 International Workshop on Spoken Language Translation, IWSLT 2012 , pp. 217-221
- Kolkhorst, H.¹ Kilgour, K.² Stüker, S.³ Waibel, A.⁴

12
- 84869046812
- Real-time captioning by groups of non-experts
- New York, USA: ACM Press, Oct
- W. Lasecki, C. Miller, A. Sadilek, A. Abumoussa, D. Borrello, R. S. Kushalnagar, and J. Bigham, "Real-time captioning by groups of non-experts," in Proceedings of the 25th annual ACM symposium on User interface software and technology - UIST ?12. New York, New York, USA: ACM Press, Oct. 2012, pp. 23-34. [Online]. Available: http://dl.acm.org/citation.cfm?doid=2380116.2380122
- (2012) Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology - UIST ?12. New York , pp. 23-34
- Lasecki, W.¹ Miller, C.² Sadilek, A.³ Abumoussa, A.⁴ Borrello, D.⁵ Kushalnagar, R.S.⁶ Bigham, J.⁷

13
- 84959147559
- Using keyword spotting to help humans correct captioning faster
- Y. Gaur, F. Metze, Y. Miao, and J. P. Bigham, "Using keyword spotting to help humans correct captioning faster," in Sixteenth Annual Conference of the International Speech Communication Association, 2015.
- (2015) Sixteenth Annual Conference of the International Speech Communication Association
- Gaur, Y.¹ Metze, F.² Miao, Y.³ Bigham, J.P.⁴

14
- 84938721908
- A keyword search system using open source software
- South Lake Tahoe, NV; USA: IEEE, Dec to appear
- J. Trmal, G. Chen, D. Povey, S. Khudanpur, P. Ghahremani, X. Zhang, V. Manohar, C. Liu, A. Jansen, D. Klakow, D. Yarowsky, and F. Metze, "A keyword search system using open source software," in Proc. IEEE Workshop on Spoken Language Technology. South Lake Tahoe, NV; USA: IEEE, Dec. 2014, to appear.
- (2014) Proc. IEEE Workshop on Spoken Language Technology
- Trmal, J.¹ Chen, G.² Povey, D.³ Khudanpur, S.⁴ Ghahremani, P.⁵ Zhang, X.⁶ Manohar, V.⁷ Liu, C.⁸ Jansen, A.⁹ Klakow, D.¹⁰ Yarowsky, D.¹¹ Metze, F.¹²

15
- 84946076428
- Ted-lium: An automatic speech recognition dedicated corpus
- A. Rousseau, P. Deléglise, and Y. Estève, "Ted-lium: an automatic speech recognition dedicated corpus." in LREC, 2012, pp. 125- 129.
- (2012) LREC , pp. 125-129
- Rousseau, A.¹ Deléglise, P.² Estève, Y.³

16
- 84893696682
- The kaldi speech recognition toolkit
- Dec.
- D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The kaldi speech recognition toolkit," in IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, Dec. 2011.
- (2011) IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society
- Povey, D.¹ Ghoshal, A.² Boulianne, G.³ Burget, L.⁴ Glembek, O.⁵ Goel, N.⁶ Hannemann, M.⁷ Motlicek, P.⁸ Qian, Y.⁹ Schwarz, P.¹⁰ Silovsky, J.¹¹ Stemmer, G.¹² Vesely, K.¹³

17
- 84910038371
- CoRR, vol. abs/1401.6984
- Y. Miao, "Kaldi+pdnn: Building dnn-based ASR systems with kaldi and PDNN," CoRR, vol. abs/1401.6984, 2014. [Online]. Available: http://arxiv.org/abs/1401.6984
- (2014) Kaldi+pdnn: Building Dnn-based ASR Systems with Kaldi and PDNN
- Miao, Y.¹

18
- 84964489732
- EESEN: End-to-end speech recognition using deep rnn models and wfst-based decoding
- Scottsdale, AZ; U.S.A.: IEEE, Dec
- Y. Miao, M. Gowayyed, and F. Metze, "EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding," in Proc. Automatic Speech Recognition and Understanding Workshop (ASRU). Scottsdale, AZ; U.S.A.: IEEE, Dec. 2015, https://github.com/srvk/eesen.
- (2015) Proc. Automatic Speech Recognition and Understanding Workshop (ASRU)
- Miao, Y.¹ Gowayyed, M.² Metze, F.³

19
- 84946091011
- Scaling recurrent neural network language models
- Brisbane; Australia: IEEE, May
- W. Williams, N. Prasad, D. Mrva, T. Ash, and T. Robinson, "Scaling recurrent neural network language models," in Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. Brisbane; Australia: IEEE, May 2015.
- (2015) Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
- Williams, W.¹ Prasad, N.² Mrva, D.³ Ash, T.⁴ Robinson, T.⁵

20
- 33749259827
- Connectionist temporal classification: Labelling unsegmented seq uence data with recurrent neural networks
- ACM
- A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, "Connectionist temporal classification: labelling unsegmented seq uence data with recurrent neural networks," in Proceedings of the 23rd international conference on Machine Learning. ACM, 2006, pp. 369-376.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 369-376
- Graves, A.¹ Fernández, S.² Gomez, F.³ Schmidhuber, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.