SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2013, Pages 6753-6757

Developing speech recognition systems for corpus indexing under the IARPA Babel program

(10) Cui, Jia a Cui, Xiaodong a Ramabhadran, Bhuvana a Kim, Janice a Kingsbury, Brian a Mamou, Jonathan b Mangu, Lidia a Picheny, Michael a Sainath, Tara N a Sethy, Abhinav a

a IBM T J WATSON RESEARCH CENTER (United States)

b IBM HAIFA RESEARCH LAB (Israel)

Author keywords

acoustic modeling; bootstrap; deep learning; keyword search; language modeling

Indexed keywords

ACOUSTIC MODEL; BOOTSTRAP; DEEP LEARNING; KEYWORD SEARCH; LANGUAGE MODEL;

COMPUTATIONAL LINGUISTICS; NEURAL NETWORKS; SIGNAL PROCESSING; SPEECH RECOGNITION;

SEARCH ENGINES;

EID: 84890507010 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2013.6638969 Document Type: Conference Paper

Times cited : (41)

References (22)

1
- 79951796005
- The IBM Attila speech recognition toolkit
- H. Soltau, G. Saon, and B. Kingsbury, "The IBM Attila speech recognition toolkit," in Proc. IEEE Workshop on Spoken Language Technology, 2010
- (2010) Proc. IEEE Workshop on Spoken Language Technology
- Soltau, H.¹ Saon, G.² Kingsbury, B.³

2
- 34047266376
- Advances in speech transcription at IBM under the DARPA EARS program
- S. Chen, B. Kingsbury, L. Mangu, D. Povey, G. Saon, H. Soltau, and G. Zweig, "Advances in speech transcription at IBM under the DARPA EARS program," IEEE Trans. on Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1596-1608, 2006
- (2006) IEEE Trans. on Audio, Speech and Language Processing , vol.14 , Issue.5 , pp. 1596-1608
- Chen, S.¹ Kingsbury, B.² Mangu, L.³ Povey, D.⁴ Saon, G.⁵ Soltau, H.⁶ Zweig, G.⁷

3
- 77949347726
- Dynamic network decoding revisited
- H. Soltau and G. Saon, "Dynamic network decoding revisited," in Proc. ASRU, 2009
- (2009) Proc. ASRU
- Soltau, H.¹ Saon, G.²

4
- 84865265602
- Hidden Markov acoustic modeling with bootstrap and restructuring for lowresourced languages
- X. Cui, J. Xue, X. Chen, P. A. Olsen, P. L. Dognin, U. V. Chaudhari, J. R. Hershey, and B. Zhou, "Hidden Markov acoustic modeling with bootstrap and restructuring for lowresourced languages," IEEE Trans. on Audio, Speech and Language Processing, vol. 20, no. 8, pp. 2252-2264, 2012
- (2012) IEEE Trans. on Audio, Speech and Language Processing , vol.20 , Issue.8 , pp. 2252-2264
- Cui, X.¹ Xue, J.² Chen, X.³ Olsen, P.A.⁴ Dognin, P.L.⁵ Chaudhari, U.V.⁶ Hershey, J.R.⁷ Zhou, B.⁸

5
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. ASRU, 2011
- (2011) Proc. ASRU
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

6
- 0038338085
- A continuous speech recognition system embedding MLP into HMM
- D. S. Touretzky, Ed
- H. Bourlard and N. Morgan, "A continuous speech recognition system embedding MLP into HMM," in Advanced in Neural Information Processing Systems 2, D. S. Touretzky, Ed., 1990, pp. 186-193
- (1990) Advanced in Neural Information Processing Systems 2 , pp. 186-193
- Bourlard, H.¹ Morgan, N.²

7
- 84878379108
- Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
- B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization," in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Kingsbury, B.¹ Sainath, T.N.² Soltau, H.³

8
- 77956541496
- Deep learning via Hessian-free optimization
- J. Martens, "Deep learning via Hessian-free optimization," in Proc. Intl. Conf. on Machine Learning (ICML), 2010
- (2010) Proc. Intl. Conf. on Machine Learning (ICML)
- Martens, J.¹

9
- 84858972572
- Making deep belief networks effective for large vocabulary continuous speech recognition
- T. N. Sainath, B. Kingsbury, B. Ramabhadran, P. Fousek, P. Novk, and A. Mohamed, "Making deep belief networks effective for large vocabulary continuous speech recognition," in ASRU, 2011, pp. 30-35
- (2011) ASRU , pp. 30-35
- Sainath, T.N.¹ Kingsbury, B.² Ramabhadran, B.³ Fousek, P.⁴ Novk, P.⁵ Mohamed, A.⁶

10
- 84867593213
- Autoencoder bottleneck features using deep belief networks
- T. N. Sainath, B. Kingsbury, and B. Ramabhadran, "Autoencoder bottleneck features using deep belief networks," in Proc. ICASSP, 2012
- (2012) Proc. ICASSP
- Sainath, T.N.¹ Kingsbury, B.² Ramabhadran, B.³

11
- 34547548235
- Probabilistic and bottleneck features for LVCSR of meetings
- F. Grezl, M. Karafiat, S. Kontar, and J. Cernocky, "Probabilistic and bottleneck features for LVCSR of meetings," in Proc. ICASSP, 2007
- (2007) Proc. ICASSP
- Grezl, F.¹ Karafiat, M.² Kontar, S.³ Cernocky, J.⁴

12
- 0009577944
- A neural probabilistic language model
- Y. Bengio, R. Ducharme, and P. Vincent, "A neural probabilistic language model," in Proc. Neural Information Processing Systems (NIPS), 2000
- (2000) Proc. Neural Information Processing Systems (NIPS)
- Bengio, Y.¹ Ducharme, R.² Vincent, P.³

13
- 84878422162
- Large scale hierarchical neural network language models
- H. Kuo, E. Arisoy, A. Emami, and P. Vozila, "Large scale hierarchical neural network language models," in Proc. Interspeech, 2012
- (2012) Proc. Interspeech
- Kuo, H.¹ Arisoy, E.² Emami, A.³ Vozila, P.⁴

14
- 77949349100
- Scaling shrinkage-based language models
- S. F. Chen, L. Mangu, B. Ramabhadran, R. Sarikaya, and A. Sethy, "Scaling shrinkage-based language models," in Proceedings of ASRU, 2009
- (2009) Proceedings of ASRU
- Chen, S.F.¹ Mangu, L.² Ramabhadran, B.³ Sarikaya, R.⁴ Sethy, A.⁵

15
- 79951634009
- Results of the 2006 spoken term detection evaluation
- J. G. Fiscus, J. G. Ajot, J. Garofalo, and G. Doddington, "Results of the 2006 spoken term detection evaluation," in Proc. SIGIR Workshop on Searching Spontaneous Conversational Speech, 2007, pp. 51-57
- (2007) Proc. SIGIR Workshop on Searching Spontaneous Conversational Speech , pp. 51-57
- Fiscus, J.G.¹ Ajot, J.G.² Garofalo, J.³ Doddington, G.⁴

16
- 70349211775
- Effect of pronounciations on OOV queries in spoken term detection
- Dogan Can, Erica Cooper, Abhinav Sethy, Chris White, Bhuvana Ramabhadran, and Murat Saraclar, "Effect of pronounciations on OOV queries in spoken term detection," Proceedings of ICASSP, 2009
- (2009) Proceedings of ICASSP
- Can, D.¹ Cooper, E.² Sethy, A.³ White, C.⁴ Ramabhadran, B.⁵ Saraclar, M.⁶

17
- 85050187568
- Lattice-based search for spoken utterance retrieval
- Murat Saraclar and Richard W. Sproat, "Lattice-based search for spoken utterance retrieval," in HLT-NAACL, 2004
- (2004) HLT-NAACL
- Saraclar, M.¹ Sproat, R.W.²

18
- 36448941168
- Vocabulary independent spoken term detection
- Jonathan Mamou, Bhuvana Ramabhadran, and Olivier Siohan, "Vocabulary independent spoken term detection," in Proceedings of SIGIR, 2007
- (2007) Proceedings of SIGIR
- Mamou, J.¹ Ramabhadran, B.² Siohan, O.³

19
- 77949407432
- Query-by-example spoken term detection for OOV terms
- C. Parada, A. Sethy, and B. Ramabhadran, "Query-by-example spoken term detection for OOV terms," in ASRU, 2009
- (2009) ASRU
- Parada, C.¹ Sethy, A.² Ramabhadran, B.³

20
- 84890542302
- Exploiting diversity for spoken term detection
- To appear
- L. Mangu, H. Soltau, H.-K. Kuo, B. Kingsbury, and G. Saon, "Exploiting diversity for spoken term detection," in Proc. ICASSP, 2013. To appear
- (2013) Proc. ICASSP
- Mangu, L.¹ Soltau, H.² Kuo, H.-K.³ Kingsbury, B.⁴ Saon, G.⁵

21
- 85032751458
- Deep neural networks for acoustic modeling in speech recognition
- G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal Processing Magazine, 2012
- (2012) IEEE Signal Processing Magazine
- Hinton, G.¹ Deng, L.² Yu, D.³ Dahl, G.⁴ Mohamed, A.⁵ Jaitly, N.⁶ Senior, A.⁷ Vanhoucke, V.⁸ Nguyen, P.⁹ Sainath, T.¹⁰ Kingsbury, B.¹¹

22
- 84890489531
- System combination and score normalization for spoken term detection
- To appear
- J. Mamou, J. Cui, X. Cui, M. J. F. Gales, B. Kingsbury, K. Knill, L. Mangu, D. Nolden, M. Picheny, B. Ramabhadran, R. Schluter, A. Sethy, and P. C. Woodland, "System combination and score normalization for spoken term detection," in Proc. ICASSP, 2013. To appear.
- (2013) Proc. ICASSP
- Mamou, J.¹ Cui, J.² Cui, X.³ Gales, M.J.F.⁴ Kingsbury, B.⁵ Knill, K.⁶ Mangu, L.⁷ Nolden, D.⁸ Picheny, M.⁹ Ramabhadran, B.¹⁰ Schluter, R.¹¹ Sethy, A.¹² Woodland, P.C.¹³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.