메뉴 건너뛰기




Volumn , Issue , 2003, Pages 255-260

TRAP-TANDEM: Data-driven extraction of temporal features from speech

Author keywords

[No Author keywords available]

Indexed keywords

SPECTRAL DENSITY; SPEECH;

EID: 84946730259     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2003.1318450     Document Type: Conference Paper
Times cited : (32)

References (32)
  • 2
    • 84863773378 scopus 로고    scopus 로고
    • Frequency-domain linear prediction for temporal features
    • St. Thomas, US Virgin Islands, (this proceedings)
    • M. Athineos and D. Ellis, "Frequency-domain linear prediction for temporal features", Proc. IEEE ASRU-2003 Workshop, St. Thomas, US Virgin Islands, 2003 (this proceedings)
    • (2003) Proc. IEEE ASRU-2003 Workshop
    • Athineos, M.1    Ellis, D.2
  • 3
    • 34548281969 scopus 로고
    • Could information theory provide an ecological theory of sensory processing?
    • J.J. Atick, "Could information theory provide an ecological theory of sensory processing?", in Network: Computation in Neural Systems, Vol. 3, pp. 213-251, 1992
    • (1992) Network: Computation in Neural Systems , vol.3 , pp. 213-251
    • Atick, J.J.1
  • 4
    • 0031619381 scopus 로고    scopus 로고
    • Maximal mutual information based reduction strategies for cross-correlation based joint distributional modeling
    • SP14.6, Seattle
    • J. Bilmes, "Maximal mutual information based reduction strategies for cross-correlation based joint distributional modeling", Proc. ICASSP98, SP14.6, Seattle, 1998
    • (1998) Proc. ICASSP98
    • Bilmes, J.1
  • 6
    • 0035097825 scopus 로고    scopus 로고
    • Spectro-temporal response fields characterization with dynamic ripples in ferret primary auditory cortex
    • D.D. Depireux, J.Z. Simon, D.J. Klein, S.S. Shamma, "Spectro-Temporal Response Fields Characterization with Dynamic Ripples in Ferret Primary Auditory Cortex", in J. Neurophysiology, Vol. 85, pp. 1220-1234, 2001
    • (2001) J. Neurophysiology , vol.85 , pp. 1220-1234
    • Depireux, D.D.1    Simon, J.Z.2    Klein, D.J.3    Shamma, S.S.4
  • 7
    • 0003549684 scopus 로고    scopus 로고
    • The ASA edition, edited by J.B. Allen, Acoust. Soc. Am., reissue of the original edition from 1953
    • H. Fletcher, Speech and hearing in communication, The ASA edition, edited by J.B. Allen, Acoust. Soc. Am., reissue of the original edition from 1953
    • Speech and Hearing in Communication
    • Fletcher, H.1
  • 9
    • 84890476103 scopus 로고    scopus 로고
    • Local averaging and differentiating of spectral plane for TRAP-based ASR
    • Geneva
    • F. Grezl and H. Hermansky, "Local averaging and differentiating of spectral plane for TRAP-based ASR", Proc. Eurospeech 2003, Geneva 2003
    • (2003) Proc. Eurospeech 2003
    • Grezl, F.1    Hermansky, H.2
  • 10
    • 0025041264 scopus 로고    scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • H. Hermansky, Perceptual linear predictive (PLP) analysis of speech, J. Acoust. Soc. Am., vol. 87, no. 4, pp. 1738-1752.
    • J. Acoust. Soc. Am. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 11
    • 0030638046 scopus 로고    scopus 로고
    • The modulation spectrum in automatic recognition of speech
    • H. Hermansky, "The Modulation Spectrum in Automatic Recognition of Speech", in 1997 IEEE Workshop on Automatic
    • 1997 IEEE Workshop on Automatic
    • Hermansky, H.1
  • 12
    • 85009254284 scopus 로고    scopus 로고
    • TRAPS classifiers of temporal patterns
    • Sydney, Australia
    • H. Hermansky and S. Sharma, "TRAPS Classifiers of Temporal Patterns", in ICSLP'98, Sydney, Australia, 1998,
    • (1998) ICSLP'98
    • Hermansky, H.1    Sharma, S.2
  • 13
    • 85009210577 scopus 로고    scopus 로고
    • Band-independent speech event categories for TRAP based ASR
    • Geneva
    • H. Hermansky and P. Jain, "Band-independent speech event categories for TRAP based ASR", Proc. Eurospeech 2003, Geneva 2003
    • (2003) Proc. Eurospeech 2003
    • Hermansky, H.1    Jain, P.2
  • 14
    • 0032139768 scopus 로고    scopus 로고
    • Should recognizers have ears?
    • 27
    • H. Hermansky, "Should recognizers have ears?", in Speech Communication, vol. 25, num. 3-27, 1998
    • (1998) Speech Communication , vol.25 , Issue.3
    • Hermansky, H.1
  • 15
    • 0007802346 scopus 로고    scopus 로고
    • Connectionist feature extraction for conventional HMM systems
    • Istanbul, Turkey
    • H. Hermansky and D.P.W. Ellis and S. Sharma, "Connectionist Feature Extraction for Conventional HMM Systems", in Proc. ICASSP'00, Istanbul, Turkey, 2000
    • (2000) Proc. ICASSP'00
    • Hermansky, H.1    Ellis, D.P.W.2    Sharma, S.3
  • 16
    • 0032658253 scopus 로고    scopus 로고
    • Temporal patterns (TRAPS) in ASR of noisy speech
    • Phoenix, Arizona, USA, Mar
    • H. Hermansky and S. Sharma, "Temporal Patterns (TRAPS) in ASR of Noisy Speech", in Proc. ICASSP'99, Phoenix, Arizona, USA, Mar, 1999
    • (1999) Proc. ICASSP'99
    • Hermansky, H.1    Sharma, S.2
  • 17
    • 85128398195 scopus 로고    scopus 로고
    • Spectral basis functions from discriminant analysis
    • Sydney, Australia
    • H. Hermansky and N. Malayath, "Spectral Basis Functions from Discriminant Analysis", in Proc. ICSLP'98, Sydney, Australia, 1998
    • (1998) Proc. ICSLP'98
    • Hermansky, H.1    Malayath, N.2
  • 18
    • 0038133932 scopus 로고
    • A statistical approach to metrics for word and syllable recognition
    • S35(A)
    • M.J. Hunt, "A statistical approach to metrics for word and syllable recognition", J. Acoust. Soc. Am., 66(S1), S35(A), 1979
    • (1979) J. Acoust. Soc. Am. , vol.66 , Issue.S1
    • Hunt, M.J.1
  • 19
    • 84946816154 scopus 로고    scopus 로고
    • www.icsi.berkeley.edu/speech/faq/ICSI-SPEECH-FAQ
  • 20
    • 84946816155 scopus 로고    scopus 로고
    • PhD. thesis, Department of Electrical and Computer Engineering, OGI School of Oregon Health & Sciences University, Portland, Oregon
    • P. Jain, PhD. thesis, Department of Electrical and Computer Engineering, OGI School of Oregon Health & Sciences University, Portland, Oregon, 2003
    • (2003)
    • Jain, P.1
  • 21
    • 84946816156 scopus 로고    scopus 로고
    • Effect of combining temporal patterns from critical-bands on ASR
    • Geneva
    • P. Jain and H. Hermansky, "Effect of combining temporal patterns from critical-bands on ASR ", Proc. Eurospeech 2003, Geneva 2003
    • (2003) Proc. Eurospeech 2003
    • Jain, P.1    Hermansky, H.2
  • 23
    • 0033911188 scopus 로고    scopus 로고
    • Robust spectro-temporal reverse correlation for auditory system: Optimizing stimulus design
    • D.J. Klein, D.A. Depireux, J.Z. Simon, S.S. Shamma," Robust spectro-temporal reverse correlation for auditory system: Optimizing stimulus design", in J. Comp. Neuroscience, Vol. 9, pp. 85-111, 2000
    • (2000) J. Comp. Neuroscience , vol.9 , pp. 85-111
    • Klein, D.J.1    Depireux, D.A.2    Simon, J.Z.3    Shamma, S.S.4
  • 24
    • 84946816157 scopus 로고    scopus 로고
    • Analysis of information in speech and its application in speech recognition
    • Brno, Czech Republic, Springer-Verlag
    • S. Kajarekar, and H. Hermansky (2000), " Analysis of information in speech and its application in speech recognition", in Proceedings of Workshop in Text, Speech and Dialogue 2000, Brno, Czech Republic, Springer-Verlag.
    • (2000) Proceedings of Workshop in Text, Speech and Dialogue 2000
    • Kajarekar, S.1    Hermansky, H.2
  • 25
    • 0032676337 scopus 로고    scopus 로고
    • On the relative importance of various components of modulation spectrum for automatic speech recognition
    • Elsevier
    • N. Kanedera, T. Arai, H. Hermansky and M. Pavel, "On the relative importance of various components of modulation spectrum for automatic speech recognition", Speech Communication, 28, (43-55), Elsevier 1999
    • (1999) Speech Communication , vol.28 , Issue.43-55
    • Kanedera, N.1    Arai, T.2    Hermansky, H.3    Pavel, M.4
  • 26
    • 0036212160 scopus 로고    scopus 로고
    • Efficient coding of natural sounds
    • M.S. Lewicki, "Efficient coding of natural sounds", Nature Neuroscience, 5(4), pp. 356-363, 2002
    • (2002) Nature Neuroscience , vol.5 , Issue.4 , pp. 356-363
    • Lewicki, M.S.1
  • 28
    • 0018617277 scopus 로고
    • Encoding of steady state vowels in the auditory nerve: Representation in terms of discharge rate
    • M. Sachs and E. Young, "Encoding of steady state vowels in the auditory nerve: representation in terms of discharge rate", J. Acoust. Soc. Am. 66, pp. 470-479, 1979
    • (1979) J. Acoust. Soc. Am , vol.66 , pp. 470-479
    • Sachs, M.1    Young, E.2
  • 30
    • 0141703361 scopus 로고    scopus 로고
    • Hierarchical tandem feature extraction
    • Orlando, Florida, USA, May
    • S. Sivadas and H. Hermansky, "Hierarchical Tandem Feature Extraction", in Proceedings ICASSP 2002, Orlando, Florida, USA, May, 2002
    • (2002) Proceedings ICASSP 2002
    • Sivadas, S.1    Hermansky, H.2
  • 31
    • 0002915083 scopus 로고    scopus 로고
    • Relevance of timefrequency features for phonetic and speaker/channel classification
    • Aug
    • H. H. Yang and S. Sharma and S. van Vuuren and H. Hermansky, "Relevance of TimeFrequency Features for Phonetic and Speaker/Channel Classification", in Speech Communication, Aug, 2000
    • (2000) Speech Communication
    • Yang, H.H.1    Sharma, S.2    Van Vuuren, S.3    Hermansky, H.4
  • 32
    • 0018606571 scopus 로고
    • Representation of steady-state vowels in the temporal aspects of the discharge patterns of population of auditory nerve fibers
    • E. Young and M. Sachs, "Representation of steady-state vowels in the temporal aspects of the discharge patterns of population of auditory nerve fibers, J. Acoust. Soc. Am 66, pp. 1381-1403, 1979.
    • (1979) J. Acoust. Soc. Am , vol.66 , pp. 1381-1403
    • Young, E.1    Sachs, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.