SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 1781-1785

Evaluating speech features with the minimal-pair ABX task: Analysis of the classical MFC/PLP pipeline

(6) Schatz, Thomas a,b Peddinti, Vijayaditya c Bach, Francis b Jansen, Aren c Hermansky, Hynek c Dupoux, Emmanuel a

a PSL RESEARCH UNIVERSITY (France)

b ECOLE NORMALE SUPÉRIEURE (France)

c JOHNS HOPKINS UNIVERSITY (United States)

Author keywords

Evaluation framework; Minimal pair ABX task; Speech representations; Zero resource

Indexed keywords

SIGNAL PROCESSING;

DISCRIMINATION TASKS; EVALUATION FRAMEWORK; MINIMAL-PAIR ABX TASK; SPEECH FEATURES; ZERO-RESOURCE;

PIPELINES;

EID: 84906230757 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (141)

References (19)

1
- 84865767134
- Rapid evaluation of speech representations for spoken term discovery
- M. A. Carlin, S. Thomas, A. Jansen, and H. Hermansky, "Rapid evaluation of speech representations for spoken term discovery, " in Proceedings of Interspeech, 2011.
- (2011) Proceedings of Interspeech
- Carlin, M.A.¹ Thomas, S.² Jansen, A.³ Hermansky, H.⁴

2
- 84055212007
- Sparse multilayer perceptron for phoneme recognition
- G. Sivaram and H. Hermansky, "Sparse multilayer perceptron for phoneme recognition, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, pp. 23-29, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 23-29
- Sivaram, G.¹ Hermansky, H.²

3
- 84855980817
- New nonsense syllables database analyses and preliminary ASR experiments
- P. Fousek, P. Svojanovsky, F. Grezl, and H. Hermansky, "New nonsense syllables database analyses and preliminary asr experiments, " in Proceedings of the International Conference on Spoken Language Processing (ICSLP), 2004, pp. 2004-29.
- (2004) Proceedings of the International Conference on Spoken Language Processing (ICSLP) , pp. 2004-2029
- Fousek, P.¹ Svojanovsky, P.² Grezl, F.³ Hermansky, H.⁴

4
- 84918876952
- Lawrence Erlbaum
- N. A. Macmillan and C. D. Creelman, Detection theory: A user's guide. Lawrence Erlbaum, 2004.
- (2004) Detection Theory: A User's Guide
- MacMillan, N.A.¹ Creelman, C.D.²

5
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " The Journal of the Acoustical Society of America, vol. 87, pp. 1738-1752, 1990.
- (1990) The Journal of the Acoustical Society of America , vol.87 , pp. 1738-1752
- Hermansky, H.¹

6
- 0038133939
- Distance measures for speech recognition, psychological and instrumental
- P. Mermelstein, "Distance measures for speech recognition, psychological and instrumental, " Pattern recognition and artificial intelligence, vol. 116, pp. 91-103, 1976.
- (1976) Pattern Recognition and Artificial Intelligence , vol.116 , pp. 91-103
- Mermelstein, P.¹

7
- 79955978656
- ASR systems in noisy environment: Analysis and solutions for increasing noise robustness
- J. Rajnoha and P. Pollak, "ASR systems in noisy environment: Analysis and solutions for increasing noise robustness, " Radioengineering, vol. 20, no. 1, pp. 74-84, 2011.
- (2011) Radioengineering , vol.20 , Issue.1 , pp. 74-84
- Rajnoha, J.¹ Pollak, P.²

8
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech, " IEEE Transactions on Speech and Audio Processing, vol. 2, no. 4, pp. 578-589, 1994.
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

9
- 84867214871
- (and MFCC, and inversion) in Matlab, [Online]. Available
- D. P. W. Ellis, "PLP and RASTA (and MFCC, and inversion) in Matlab, " 2005. [Online]. Available: http://www.ee.columbia.edu/dpwe/resources/ matlab/rastamat/.
- (2005) PLP and RASTA
- Ellis, D.P.W.¹

10
- 84890488932
- A summary of the 2012 JH CLSP workshop on zero resource speech technologies and models of early language acquisition
- A. Jansen, E. Dupoux, S. Goldwater, M. Johnson, S. Khudanpur, K. Church, N. Feldman, H. Hermansky, F. Metze, R. Rose, M. Seltzer, P. Clark, I. McGraw, B. Varadarajan, E. Bennett, B. Borschinger, J. Chiu, E. Dunbar, A. Fourtassi, D. Harwath, C.-y. Lee, K. Levin, A. Norouzian, V. Peddinti, R. Richardson, T. Schatz, and S. Thomas, "A summary of the 2012 JH CLSP workshop on zero resource speech technologies and models of early language acquisition, " in Proceedings of ICASSP 2013, 2013.
- (2013) Proceedings of ICASSP 2013
- Jansen, A.¹ Dupoux, E.² Goldwater, S.³ Johnson, M.⁴ Khudanpur, S.⁵ Church, K.⁶ Feldman, N.⁷ Hermansky, H.⁸ Metze, F.⁹ Rose, R.¹⁰ Seltzer, M.¹¹ Clark, P.¹² McGraw, I.¹³ Varadarajan, B.¹⁴ Bennett, E.¹⁵ Borschinger, B.¹⁶ Chiu, J.¹⁷ Dunbar, E.¹⁸ Fourtassi, A.¹⁹ Harwath, D.²⁰ more..

11
- 70450218182
- Static and dynamic modulation spectrum for speech recognition
- S. Ganapathy, S. Thomas, and H. Hermansky, "Static and dynamic modulation spectrum for speech recognition, " in Proceedings of Interspeech, 2009.
- (2009) Proceedings of Interspeech
- Ganapathy, S.¹ Thomas, S.² Hermansky, H.³

12
- 77950593007
- Unsupervised learning of acoustic sub-word units
- B. Varadarajan, S. Khudanpur, and E. Dupoux, "Unsupervised learning of acoustic sub-word units, " in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, 2008, pp. 165- 168.
- (2008) Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers , pp. 165-168
- Varadarajan, B.¹ Khudanpur, S.² Dupoux, E.³

13
- 84878421054
- Intrinsic spectral analysis for zero and high resource speech recognition
- A. Jansen, S. Thomas, and H. Hermansky, "Intrinsic spectral analysis for zero and high resource speech recognition, " in Proceedings of Interspeech, 2012.
- (2012) Proceedings of Interspeech
- Jansen, A.¹ Thomas, S.² Hermansky, H.³

14
- 84867809023
- A nonparametric Bayesian approach to acoustic model discovery
- C.-y. Lee and J. Glass, "A nonparametric bayesian approach to acoustic model discovery, " in Proceedings of ACL, 2012.
- (2012) Proceedings of ACL
- Lee, C.-Y.¹ Glass, J.²

15
- 25444478852
- A functional model of neural activity patterns and auditory images
- R. D. Patterson and J. Holdsworth, "A functional model of neural activity patterns and auditory images, " Advances in speech, hearing and language processing, vol. 3, pp. 547-563, 1996.
- (1996) Advances in Speech, Hearing and Language Processing , vol.3 , pp. 547-563
- Patterson, R.D.¹ Holdsworth, J.²

16
- 79251542316
- A computational model of filtering, detection, and compression in the cochlea
- R. Lyon, "A computational model of filtering, detection, and compression in the cochlea, " in IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP'82., vol. 7, 1982, pp. 1282-1285.
- (1982) IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP'82 , vol.7 , pp. 1282-1285
- Lyon, R.¹

17
- 30244474384
- A joint synchrony/mean-rate model of auditory speech processing
- S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing, " in Readings in speech recognition, 1990, pp. 101- 111.
- (1990) Readings in Speech Recognition , pp. 101-111
- Seneff, S.¹

18
- 23744508888
- Multiresolution spectrotemporal analysis of complex sounds
- T. Chi, P. Ru, and S. A. Shamma, "Multiresolution spectrotemporal analysis of complex sounds, " The Journal of the Acoustical Society of America, vol. 118, pp. 887-906, 2005.
- (2005) The Journal of the Acoustical Society of America , vol.118 , pp. 887-906
- Chi, T.¹ Ru, P.² Shamma, S.A.³

19
- 26244461684
- Clustering with Bregman divergences
- A. Banerjee, S. Merugu, I. S. Dhillon, and J. Ghosh, "Clustering with bregman divergences, " The Journal of Machine Learning Research, vol. 6, pp. 1705-1749, 2005.
- (2005) The Journal of Machine Learning Research , vol.6 , pp. 1705-1749
- Banerjee, A.¹ Merugu, S.² Dhillon, I.S.³ Ghosh, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.