메뉴 건너뛰기




Volumn , Issue , 2008, Pages 4417-4420

Hierarchical spectro-temporal features for robust speech recognition

Author keywords

Robust features; Speech recognition

Indexed keywords

ACOUSTICS; FREQUENCY RESPONSE; HIDDEN MARKOV MODELS; HIERARCHICAL SYSTEMS; MARKOV PROCESSES; SIGNAL PROCESSING; SPEECH;

EID: 51449087857     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2008.4518635     Document Type: Conference Paper
Times cited : (23)

References (16)
  • 1
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • R.P. Lippmann, "Speech recognition by machines and humans," Speech Communication, vol. 22, no. 1, pp. 1-15, 1997.
    • (1997) Speech Communication , vol.22 , Issue.1 , pp. 1-15
    • Lippmann, R.P.1
  • 2
    • 0035425442 scopus 로고    scopus 로고
    • On the role of space and time in auditory processing
    • S. Shamma, "On the role of space and time in auditory processing," Trends in Cognitive Sciences, vol. 5, no. 8, pp. 340-348, 2001.
    • (2001) Trends in Cognitive Sciences , vol.5 , Issue.8 , pp. 340-348
    • Shamma, S.1
  • 4
    • 33947657978 scopus 로고    scopus 로고
    • A bilogically-inspired approach to the cocktail party problem
    • M. Elhilali and S. Shamma, "A bilogically-inspired approach to the cocktail party problem," in Proc. ICASSP, 2006, vol. 5, pp. V-637-640.
    • (2006) Proc. ICASSP , vol.5
    • Elhilali, M.1    Shamma, S.2
  • 5
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from non-speech based on multiscale spectro-temporal modulations
    • N. Mesgarani, M. Slaney, and S. Shamma, "Discrimination of speech from non-speech based on multiscale spectro-temporal modulations," IEEE Transactions on Speech and Audio Processing, vol. 14, no. 3, pp. 920-930, 2006.
    • (2006) IEEE Transactions on Speech and Audio Processing , vol.14 , Issue.3 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.3
  • 8
    • 0038159929 scopus 로고    scopus 로고
    • Learning optimized features for hierarchical models of invariant recognition
    • H. Wersing and E. Körner, "Learning optimized features for hierarchical models of invariant recognition," Neural Computation, vol. 15, no. 7, pp. 1559-1588, 2003.
    • (2003) Neural Computation , vol.15 , Issue.7 , pp. 1559-1588
    • Wersing, H.1    Körner, E.2
  • 9
    • 0003913694 scopus 로고
    • An efficient implementation of the Patterson-Holdsworth auditory filterbank
    • Tech. Rep, Apple Computer Co, Technical report #35
    • M. Slaney, "An efficient implementation of the Patterson-Holdsworth auditory filterbank," Tech. Rep., Apple Computer Co., 1993, Technical report #35.
    • (1993)
    • Slaney, M.1
  • 10
    • 84900510076 scopus 로고    scopus 로고
    • Non-negative matrix factorization with sparseness constraints
    • P. O. Hoyer, "Non-negative matrix factorization with sparseness constraints," Journal of Machine Learning Research, vol. 5, pp. 1457-1469, 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 1457-1469
    • Hoyer, P.O.1
  • 11
    • 0038669544 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France
    • H.-G. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in ASR 2000, Paris, France, 2000, pp. 181-188.
    • (2000) ASR 2000 , pp. 181-188
    • Hirsch, H.-G.1    Pearce, D.2
  • 12
    • 0021226391 scopus 로고    scopus 로고
    • R.G. Leonard, A database for speaker independent digit recognition, in Proc. ICASSP, 1984, 3, p. 42.11.
    • R.G. Leonard, "A database for speaker independent digit recognition," in Proc. ICASSP, 1984, vol. 3, p. 42.11.
  • 14
    • 0027623210 scopus 로고
    • Assessment for automatic speech recognition: Ii. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems
    • A. Varga and H.J.M. Steeneken, "Assessment for automatic speech recognition: Ii. noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Communication, vol. 12, no. 3, pp. 247-252, 1993.
    • (1993) Speech Communication , vol.12 , Issue.3 , pp. 247-252
    • Varga, A.1    Steeneken, H.J.M.2
  • 15
    • 0003822743 scopus 로고    scopus 로고
    • Cambridge, December
    • S. Young et al., "The htk book," Cambridge, December 2006.
    • (2006) The htk book
    • Young, S.1
  • 16
    • 85009265586 scopus 로고    scopus 로고
    • Frontend postprocessing and backend model enhancement on the aurora 2.0/3.0 databases
    • C.-P. Chen, K. Filali, and J. Bilmes, "Frontend postprocessing and backend model enhancement on the aurora 2.0/3.0 databases," in ICSLP, 2002.
    • (2002) ICSLP
    • Chen, C.-P.1    Filali, K.2    Bilmes, J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.