메뉴 건너뛰기




Volumn 08-12-September-2016, Issue , 2016, Pages 3434-3438

Acoustic modelling from the signal domain using CNNs

Author keywords

Network In Network nonlinearity; Raw waveform; Statistic extraction layer

Indexed keywords

SPEECH COMMUNICATION; SPEECH PROCESSING; SPEECH RECOGNITION;

EID: 84994235770     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: 10.21437/Interspeech.2016-1495     Document Type: Conference Paper
Times cited : (88)

References (25)
  • 1
    • 0038133939 scopus 로고
    • Distance measures for speech recognition, psychological and instrumental
    • P. Mermelstein, "Distance measures for speech recognition, psychological and instrumental, " Pattern recognition and artificial intelligence, vol. 116, pp. 374-388, 1976.
    • (1976) Pattern Recognition and Artificial Intelligence , vol.116 , pp. 374-388
    • Mermelstein, P.1
  • 2
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " Journal of the Acoustical Society of America, vol. 87, pp. 1738-1752, 1990.
    • (1990) Journal of the Acoustical Society of America , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 4
    • 84910065702 scopus 로고    scopus 로고
    • Acoustic modeling with deep neural networks using raw time signal for LVCSR
    • Z. Tüske, P. Golik, R. Schlüter, and H. Ney, "Acoustic modeling with deep neural networks using raw time signal for LVCSR, " in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Tüske, Z.1    Golik, P.2    Schlüter, R.3    Ney, H.4
  • 10
    • 85016587886 scopus 로고
    • Switchboard: Telephone speech corpus for research and development
    • J. J. Godfrey et al., "Switchboard: Telephone speech corpus for research and development, " in ICASSP, 1992.
    • (1992) ICASSP
    • Godfrey, J.J.1
  • 14
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and Variance Adaptation Within the MLLR Framework, " Computer Speech and Language, vol. 10, pp. 249-264, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 16
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using i-vectors
    • G. Saon, H. Soltau, D. Nahamoo, and M. Picheny, "Speaker adaptation of neural network acoustic models using i-vectors." in ASRU, 2013, pp. 55-59.
    • (2013) ASRU , pp. 55-59
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 17
    • 84964483822 scopus 로고    scopus 로고
    • JHU ASpIRE system: Robust LVCSR with TDNNs, ivector Adaptation, and RNN-LMs
    • V. Peddinti, G. Chen, V. Manohar, T. Ko, D. Povey, and S. Khudanpur, "JHU ASpIRE system: Robust LVCSR with TDNNs, ivector Adaptation, and RNN-LMs, " in ASRU, 2015.
    • (2015) ASRU
    • Peddinti, V.1    Chen, G.2    Manohar, V.3    Ko, T.4    Povey, D.5    Khudanpur, S.6
  • 23
    • 84959115289 scopus 로고    scopus 로고
    • A time delay neural network architecture for efficient modeling of long temporal contexts
    • V. Peddinti, D. Povey, and S. Khudanpur, "A time delay neural network architecture for efficient modeling of long temporal contexts, " in Proceedings of INTERSPEECH, 2015.
    • (2015) Proceedings of INTERSPEECH
    • Peddinti, V.1    Povey, D.2    Khudanpur, S.3
  • 24
    • 0012330750 scopus 로고
    • The design for the Wall Street Journal-based CSR corpus
    • Association for Computational Linguistics
    • D. B. Paul and J. M. Baker, "The design for the Wall Street Journal-based CSR corpus, " in Proceedings of the workshop on Speech and Natural Language. Association for Computational Linguistics, 1992, pp. 357-362.
    • (1992) Proceedings of the Workshop on Speech and Natural Language , pp. 357-362
    • Paul, D.B.1    Baker, J.M.2
  • 25
    • 84858953642 scopus 로고    scopus 로고
    • The kaldi speech recognition toolkit
    • D. Povey, A. Ghoshal et al., "The Kaldi Speech Recognition Toolkit, " in Proc. ASRU, 2011.
    • (2011) Proc. ASRU
    • Povey, D.1    Ghoshal, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.