-
1
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Nov
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, Nov 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
2
-
-
0030266571
-
Closed-captioned television presentation speed and vocabulary
-
C. Jensema, R. McCann, and S. Ramsey, "Closed-captioned television presentation speed and vocabulary, " American Annals of the deaf, vol. 141, no. 4, pp. 284-292, 1996.
-
(1996)
American Annals of the Deaf
, vol.141
, Issue.4
, pp. 284-292
-
-
Jensema, C.1
McCann, R.2
Ramsey, S.3
-
3
-
-
84906928729
-
Report on the 10th iwslt evaluation campaign
-
Heidelberg; Germany
-
M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, and M. Federico, "Report on the 10th iwslt evaluation campaign, " in Proc. IWSLT, Heidelberg; Germany, 2013, http: //www. eubridge. eu/87 282. php.
-
(2013)
Proc. IWSLT
-
-
Cettolo, M.1
Niehues, J.2
Stüker, S.3
Bentivogli, L.4
Federico, M.5
-
4
-
-
56149084455
-
Recent progress in the MIT spoken lecture processing project
-
J. Glass, T. J. Hazen, S. Cyphers, I. Malioutov, D. Huynh, and R. Barzilay, "Recent Progress in the MIT Spoken Lecture Processing Project, " in Proc. Interspeech, 2007. [Online]. Available: http: //groups. csail. mit. edu/sls/publications/2007/Interspeech07-glass-lecture. pdf
-
(2007)
Proc. Interspeech
-
-
Glass, J.1
Hazen, T.J.2
Cyphers, S.3
Malioutov, I.4
Huynh, D.5
Barzilay, R.6
-
5
-
-
51449091001
-
Dynamic language model adaptation using presentation slides for lecture speech recognition
-
H. Yamazaki, K. Iwano, K. Shinoda, S. Furui, and H. Yokota, "Dynamic language model adaptation using presentation slides for lecture speech recognition, " in In Proc. INTERSPEECH, 2007, pp. 2349-2352.
-
(2007)
Proc. INTERSPEECH
, pp. 2349-2352
-
-
Yamazaki, H.1
Iwano, K.2
Shinoda, K.3
Furui, S.4
Yokota, H.5
-
6
-
-
56149116530
-
Web-based language modelling for automatic lecture transcription
-
C. Munteanu, G. Penn, and R. Baecker, "Web-based language modelling for automatic lecture transcription, " in Proc. INTERSPEECH, 2007.
-
(2007)
Proc. INTERSPEECH
-
-
Munteanu, C.1
Penn, G.2
Baecker, R.3
-
7
-
-
56149107305
-
Automatic transcription for a web 2. 0 service to search podcasts
-
Antwerp, Belgium, August 27-31, 2007
-
J. Ogata, M. Goto, and K. Eto, "Automatic transcription for a web 2. 0 service to search podcasts, " in INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association, Antwerp, Belgium, August 27-31, 2007, 2007, pp. 2617-2620.
-
(2007)
INTERSPEECH 2007, 8th Annual Conference of the International Speech Communication Association
, pp. 2617-2620
-
-
Ogata, J.1
Goto, M.2
Eto, K.3
-
8
-
-
79951777091
-
Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data
-
G. Parent and M. Eskenazi, "Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data, " in Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE, 2010, pp. 312-317.
-
(2010)
Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE
, pp. 312-317
-
-
Parent, G.1
Eskenazi, M.2
-
9
-
-
78049407752
-
Using the amazon mechanical turk for transcription of spoken language
-
M. Marge, S. Banerjee, and A. I. Rudnicky, "Using the amazon mechanical turk for transcription of spoken language, " in Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on. IEEE, 2010, pp. 5270-5273.
-
(2010)
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference On. IEEE
, pp. 5270-5273
-
-
Marge, M.1
Banerjee, S.2
Rudnicky, A.I.3
-
10
-
-
79958275518
-
Cheap, fast and good enough: Automatic speech recognition with non-expert transcription
-
Los Angeles, California: Association for Computational Linguistics, June
-
S. Novotney and C. Callison-Burch, "Cheap, fast and good enough: Automatic speech recognition with non-expert transcription, " in Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Los Angeles, California: Association for Computational Linguistics, June 2010, pp. 207-215. [Online]. Available: http: //www. aclweb. org/anthology/N10-1024
-
(2010)
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
, pp. 207-215
-
-
Novotney, S.1
Callison-Burch, C.2
-
11
-
-
84865758190
-
A transcription task for crowdsourcing with automatic quality control
-
C.-Y. Lee and J. R. Glass, "A transcription task for crowdsourcing with automatic quality control, " in INTERSPEECH'11, 2011, pp. 3041-3044.
-
(2011)
INTERSPEECH'11
, pp. 3041-3044
-
-
Lee, C.-Y.1
Glass, J.R.2
-
12
-
-
57649158901
-
Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts
-
Florence, Italy, April 5-10, 2008
-
C. Munteanu, R. Baecker, and G. Penn, "Collaborative editing for improved usefulness and usability of transcript-enhanced webcasts, " in Proceedings of the 2008 Conference on Human Factors in Computing Systems, CHI 2008, 2008, Florence, Italy, April 5-10, 2008, 2008, pp. 373-382. [Online]. Available: http: //doi. acm. org/10. 1145/1357054. 1357117
-
(2008)
Proceedings of the 2008 Conference on Human Factors in Computing Systems, CHI 2008, 2008
, pp. 373-382
-
-
Munteanu, C.1
Baecker, R.2
Penn, G.3
-
13
-
-
84943243018
-
Evaluation of interactive user corrections for lecture transcription
-
Hong Kong, December 6-7, 2012
-
H. Kolkhorst, K. Kilgour, S. Stüker, and A. Waibel, "Evaluation of interactive user corrections for lecture transcription, " in 2012 International Workshop on Spoken Language Translation, IWSLT 2012, Hong Kong, December 6-7, 2012, 2012, pp. 217-221.
-
(2012)
2012 International Workshop on Spoken Language Translation, IWSLT 2012
, pp. 217-221
-
-
Kolkhorst, H.1
Kilgour, K.2
Stüker, S.3
Waibel, A.4
-
14
-
-
84869046812
-
Real-time captioning by groups of non-experts
-
ACM Press
-
W. Lasecki, C. Miller, A. Sadilek, A. Abumoussa, D. Borrello, R. S. Kushalnagar, and J. Bigham, "Real-time captioning by groups of non-experts, " in Proceedings of the 25th annual ACM symposium on User interface software and technology-UIST '12. ACM Press, 2012, pp. 23-34.
-
(2012)
Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology-UIST '12
, pp. 23-34
-
-
Lasecki, W.1
Miller, C.2
Sadilek, A.3
Abumoussa, A.4
Borrello, D.5
Kushalnagar, R.S.6
Bigham, J.7
-
16
-
-
84938721908
-
A keyword search system using open source software
-
South Lake Tahoe, NV; USA: IEEE, Dec. To appear
-
J. Trmal, G. Chen, D. Povey, S. Khudanpur, P. Ghahremani, X. Zhang, V. Manohar, C. Liu, A. Jansen, D. Klakow, D. Yarowsky, and F. Metze, "A keyword search system using open source software, " in Proc. IEEE Workshop on Spoken Language Technology. South Lake Tahoe, NV; USA: IEEE, Dec. 2014, to appear.
-
(2014)
Proc. IEEE Workshop on Spoken Language Technology
-
-
Trmal, J.1
Chen, G.2
Povey, D.3
Khudanpur, S.4
Ghahremani, P.5
Zhang, X.6
Manohar, V.7
Liu, C.8
Jansen, A.9
Klakow, D.10
Yarowsky, D.11
Metze, F.12
-
17
-
-
84976253431
-
Results of the 2006 spoken term detection evaluation
-
J. G. Fiscus, J. Ajot, J. S. Garofolo, and G. Doddingtion, "Results of the 2006 spoken term detection evaluation, " in Proc. SIGIR, vol. 7, 2007, pp. 51-57.
-
(2007)
Proc. SIGIR
, vol.7
, pp. 51-57
-
-
Fiscus, J.G.1
Ajot, J.2
Garofolo, J.S.3
Doddingtion, G.4
-
18
-
-
80052042597
-
Lattice indexing for spoken term detection
-
Nov
-
D. Can and M. Saraclar, "Lattice indexing for spoken term detection, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 8, pp. 2338-2347, Nov 2011.
-
(2011)
Audio, Speech, and Language Processing, IEEE Transactions on
, vol.19
, Issue.8
, pp. 2338-2347
-
-
Can, D.1
Saraclar, M.2
-
19
-
-
4544257924
-
Vocabulary-independent search in spontaneous speech
-
IEEE
-
F. Seide, P. Yu, C. Ma, and E. Chang, "Vocabulary-independent search in spontaneous speech, " in Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP'04). IEEE International Conference on, vol. 1. IEEE, 2004, pp. I-253.
-
(2004)
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP'04). IEEE International Conference on
, vol.1
, pp. I-253
-
-
Seide, F.1
Yu, P.2
Ma, C.3
Chang, E.4
-
20
-
-
36448941168
-
Vocabulary independent spoken term detection
-
J. Mamou, B. Ramabhadran, and O. Siohan, "Vocabulary independent spoken term detection, " in Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2007, pp. 615-622.
-
(2007)
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM
, pp. 615-622
-
-
Mamou, J.1
Ramabhadran, B.2
Siohan, O.3
-
21
-
-
56149113962
-
Rapid and accurate spoken term detection
-
D. R. Miller, M. Kleber, C.-L. Kao, O. Kimball, T. Colthurst, S. A. Lowe, R. M. Schwartz, and H. Gish, "Rapid and accurate spoken term detection, " in Eighth Annual Conference of the International Speech Communication Association, 2007.
-
(2007)
Eighth Annual Conference of the International Speech Communication Association
-
-
Miller, D.R.1
Kleber, M.2
Kao, C.-L.3
Kimball, O.4
Colthurst, T.5
Lowe, S.A.6
Schwartz, R.M.7
Gish, H.8
-
22
-
-
84946076428
-
Ted-lium: An automatic speech recognition dedicated corpus
-
A. Rousseau, P. Deléglise, and Y. Estève, "Ted-lium: An automatic speech recognition dedicated corpus. " in LREC, 2012, pp. 125-129.
-
(2012)
LREC
, pp. 125-129
-
-
Rousseau, A.1
Deléglise, P.2
Estève, Y.3
-
23
-
-
84876795561
-
The kaldi speech recognition toolkit
-
Dec
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motlicek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The kaldi speech recognition toolkit, " in IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society, Dec. 2011.
-
(2011)
IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. IEEE Signal Processing Society
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlicek, P.8
Qian, Y.9
Schwarz, P.10
Silovsky, J.11
Stemmer, G.12
Vesely, K.13
-
25
-
-
84910087158
-
-
National Institute of Standards and Technology
-
National Institute of Standards and Technology, "NIST open keyword search 2014 evaluation (OpenKWS14), " http: //www. nist. gov/itl/iad/mig/openkws14. cfm.
-
NIST Open Keyword Search 2014 Evaluation (OpenKWS14)
-
-
|