SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn 2015-January, Issue , 2015, Pages 6-10

Architectures for deep neural network based acoustic models defined over windowed speech waveforms

(2) Bhargava, Mayank a Rose, Richard a,b

a MCGILL UNIVERSITY (Canada)

b GOOGLE INC (United States)

Author keywords

Bottleneck features; Deep Neural Networks; Speech Recognition; Waveform Speech

Indexed keywords

NETWORK ARCHITECTURE; SPEECH; SPEECH COMMUNICATION;

AUTOMATIC SPEECH RECOGNITION; BOTTLENECK FEATURES; DEEP NEURAL NETWORKS; INTERNAL REPRESENTATION; SPEECH WAVEFORMS; TIME-DOMAIN SIGNAL; WALL STREET JOURNAL; WAVE FORMS;

SPEECH RECOGNITION;

EID: 84959098603 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (22)

References (13)

1
- 84906273908
- Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks
- Lyon, France, Aug.
- D. Palaz, R. Collobert, and M. Magimai.-Doss, "Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks, " in Proc. Interspeech, Lyon, France, Aug. 2013, pp. 1766-1770.
- (2013) Proc. Interspeech , pp. 1766-1770
- Palaz, D.¹ Collobert, R.² Magimai.-Doss, M.³

2
- 84910065702
- Acoustic modeling with deep neural networks using raw time signal for LVSCR
- Singapore, Sept.
- Z. Tüske, P. Golik, R. Schlüter, and H. Ney, "Acoustic modeling with deep neural networks using raw time signal for LVSCR, " in Interspeech, Singapore, Sept. 2014, pp. 890-894.
- (2014) Interspeech , pp. 890-894
- Tüske, Z.¹ Golik, P.² Schlüter, R.³ Ney, H.⁴

3
- 84959102679
- Speech acoustic modelling in raw multichannel waveforms
- Hoshen, Yedid, Ron J. Weiss, and Kevin W. Wilson. "Speech acoustic modelling in raw multichannel waveforms. " in Proc. ICASSP, 2015.
- (2015) Proc. ICASSP
- Hoshen, Y.¹ Weiss, R.J.² Wilson, K.W.³

4
- 0022548705
- On the role of spectral transition for speech perception
- S. Furui, "On the role of spectral transition for speech perception", J. Acoust. Soc. Am Vol. 80, No. 4, pp. 1016-1025, 1986.
- (1986) J. Acoust. Soc. Am , vol.80 , Issue.4 , pp. 1016-1025
- Furui, S.¹

5
- 3543081154
- Modulation spectrum in speech processing
- H. Hermansky "Modulation spectrum in speech processing. " Signal Analysis and Prediction. Birkhäuser Boston, 1998. 395-406.
- (1998) Signal Analysis and Prediction. Birkhäuser Boston , pp. 395-406
- Hermansky, H.¹

6
- 84867585919
- Understanding how deep belief networks perform acoustic modelling
- A. Mohamed, G. Hinton, and G. Penn, "Understanding how deep belief networks perform acoustic modelling, " in ICASSP, 2012.
- (2012) ICASSP
- Mohamed, A.¹ Hinton, G.² Penn, G.³

7
- 84865785753
- Improved bottleneck features using pretrained deep neural networks
- D. Yu and M. L. Seltzer, "Improved bottleneck features using pretrained deep neural networks", in Proc. Interspeech 2011, pp. 237-240.
- (2011) Proc. Interspeech , pp. 237-240
- Yu, D.¹ Seltzer, M.L.²

8
- 0012330750
- The design for the wall street journal-based csr corpus
- Association for Computational Linguistics
- D. B. Paul and J. M. Baker, "The design for the Wall Street Journal-based CSR corpus, " in Proceedings of the workshop on Speech and Natural Language. Association for Computational Linguistics, 1992, pp. 357362.
- (1992) Proceedings of the Workshop on Speech and Natural Language , pp. 357362
- Paul, D.B.¹ Baker, J.M.²

9
- 84858953642
- The kaldi speech recognition toolkit
- D. Povey, A. Ghoshal, et al., "The kaldi speech recognition toolkit, " in Proc. ASRU, 2011
- (2011) Proc. ASRU
- Povey, D.¹ Ghoshal, A.²

10
- 51449103447
- Optimizing bottle-neck features for LVCSR
- F. Grézl and P. Fousek, "Optimizing bottle-neck features for LVCSR, " in Proc. ICASSP, 2008
- (2008) Proc. ICASSP
- Grézl, F.¹ Fousek, P.²

11
- 79959811995
- Hierarchical neural net architectures for feature extraction in ASR
- F. Grézl and M. Karafiát, "Hierarchical neural net architectures for feature extraction in ASR, " in Proc. INTERSPEECH, 2010, pp. 1201-1204.
- (2010) Proc. INTERSPEECH , pp. 1201-1204
- Grézl, F.¹ Karafiát, M.²

12
- 84906273176
- Modular combination of deep neural networks for acoustic modeling
- J. Gehring, W. Lee, K. Kilgour, I. Lane, Y. Miao, and A. Waibel, "Modular combination of deep neural networks for acoustic modeling, " in Proc. Interspeech, pp. 94-98, 2013
- (2013) Proc. Interspeech , pp. 94-98
- Gehring, J.¹ Lee, W.² Kilgour, K.³ Lane, I.⁴ Miao, Y.⁵ Waibel, A.⁶

13
- 84893688455
- Learning filter banks within a deep neural network framework
- T. N. Sainath, B. Kingsbury, A. Mohamed, and B. Ramabhad-ran, "Learning filter banks within a deep neural network framework, " in Proc. of ASRU, 2013
- (2013) Proc. of ASRU
- Sainath, T.N.¹ Kingsbury, B.² Mohamed, A.³ Ramabhad-Ran, B.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.