메뉴 건너뛰기




Volumn 11, Issue 2, 2011, Pages 23-61

An overview of speaker identification: Accuracy and robustness issues

Author keywords

[No Author keywords available]

Indexed keywords

CLEAN SPEECH; ENVIRONMENTAL NOISE; MISSING DATA METHODS; RAPID DEGRADATION; ROBUST SPEAKER IDENTIFICATION; ROBUSTNESS ISSUES; SPEAKER IDENTIFICATION; SPEAKER IDENTIFICATION PERFORMANCE; SPEAKER MODELING; SYSTEM CLASSIFICATION; TOPDOWN;

EID: 79958818321     PISSN: 1531636X     EISSN: None     Source Type: Journal    
DOI: 10.1109/MCAS.2011.941079     Document Type: Review
Times cited : (263)

References (115)
  • 3
    • 0031233424 scopus 로고    scopus 로고
    • Speaker recognition: A tutorial
    • PII S0018921997069478
    • J. Campbell, "Speaker recognition: A tutorial", Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, 1997. (Pubitemid 127745630)
    • (1997) Proceedings of the IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 4
    • 0028516097 scopus 로고
    • Text-independent speaker identification
    • H. Gish and M. Schmidt, "Text-independent speaker identification", IEEE Signal Processing Mag., vol. 11, no. 4, pp. 18-32, 1994.
    • (1994) IEEE Signal Processing Mag. , vol.11 , Issue.4 , pp. 18-32
    • Gish, H.1    Schmidt, M.2
  • 5
    • 0031223555 scopus 로고    scopus 로고
    • Recent advances in speaker recognition
    • PII S0167865597000731
    • S. Furui, "Recent advances in speaker recognition", Pattern Recognit. Lett., vol. 18, no. 9, pp. 859-872, 1997. (Pubitemid 127411229)
    • (1997) Pattern Recognition Letters , vol.18 , Issue.9 , pp. 859-872
    • Furui, S.1
  • 6
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: From features to supervectors", Speech Commun., vol. 52, no. 1, pp. 12-40, 2010.
    • (2010) Speech Commun. , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 9
    • 79958852956 scopus 로고    scopus 로고
    • Special issue on speaker recognition
    • "Special issue on speaker recognition", Digital Signal Process., vol. 10, no. 1-3, pp. 1-266, 2000.
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 1-266
  • 10
    • 79958846357 scopus 로고    scopus 로고
    • Special section on speaker and language recognition
    • "Special section on speaker and language recognition", IEEE Trans. Audio Speech Language Process., vol. 15, no. 7, pp. 1951-2115, 2007.
    • (2007) IEEE Trans. Audio Speech Language Process. , vol.15 , Issue.7 , pp. 1951-2115
  • 14
    • 85032751474 scopus 로고
    • Signal processing with higherorder spectra
    • C. L. Nikias and J. M. Mendel, "Signal processing with higherorder spectra", IEEE Signal Processing Mag., vol. 10, no. 3, pp. 10-37, 1993.
    • (1993) IEEE Signal Processing Mag. , vol.10 , Issue.3 , pp. 10-37
    • Nikias, C.L.1    Mendel, J.M.2
  • 17
    • 0030247355 scopus 로고    scopus 로고
    • Robust speaker recognition: A feature-based approach
    • R. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition: A feature-based approach", IEEE Signal Processing Mag., vol. 13, no. 5, 1996, pp. 58-71. (Pubitemid 126527122)
    • (1996) IEEE Signal Processing Magazine , vol.13 , Issue.5 , pp. 58-71
    • Mammone, R.J.1    Zhang, X.2    Ramachandran, R.P.3
  • 18
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification", J. Acoustic. Soc. Amer., vol. 55, p. 1304, 1974.
    • (1974) J. Acoustic. Soc. Amer. , vol.55 , pp. 1304
    • Atal, B.1
  • 19
    • 0028515984 scopus 로고
    • Experimental evaluation of features for robust speaker identification
    • D. Reynolds, "Experimental evaluation of features for robust speaker identification", IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639-643, 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 639-643
    • Reynolds, D.1
  • 21
    • 0032141206 scopus 로고    scopus 로고
    • Cepstral domain segmental feature vector normalization for noise robust speech recognition
    • PII S0167639398000338
    • O. Viikki and K. Laurila, "Cepstral domain segmental feature vector normalization for noise robust speech recognition", Speech Commun., vol. 25, no. 1-3, pp. 133-147, 1998. (Pubitemid 128413638)
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 133-147
    • Viikki, O.1    Laurila, K.2
  • 22
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • DOI 10.1006/dspr.1999.0361
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models", Digital Signal Process., vol. 10, no. 1-3, pp. 19-41, 2000. (Pubitemid 30592166)
    • (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 24
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. Reynolds and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models", IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 25
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • D. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models", Speech Commun., vol. 17, no. 1-2, pp. 91-108, 1995.
    • (1995) Speech Commun. , vol.17 , Issue.1-2 , pp. 91-108
    • Reynolds, D.1
  • 26
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • W. Campbell, D. Sturim, and D. Reynolds, "Support vector machines using GMM supervectors for speaker verification", IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, 2006.
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.1    Sturim, D.2    Reynolds, D.3
  • 30
    • 0036753895 scopus 로고    scopus 로고
    • Text-independent speaker verification using utterance level scoring and covariance modeling
    • DOI 10.1109/TSA.2002.803419
    • R. Zilca, "Text-independent speaker verification using utterance level scoring and covariance modeling", IEEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 363-370, 2002. (Pubitemid 35311930)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.6 , pp. 363-370
    • Zilca, R.D.1
  • 31
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • C. Burges, "A tutorial on support vector machines for pattern recognition", Data Mining Knowl. Discov., vol. 2, no. 2, pp. 121-167, 1998. (Pubitemid 128695475)
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
    • Burges, C.J.C.1
  • 33
    • 29044444825 scopus 로고    scopus 로고
    • Support vector machines for speaker and language recognition
    • DOI 10.1016/j.csl.2005.06.003, PII S0885230805000318, Odyssey 2004: The Speaker and Language Recognition Workshop Odyssey-04
    • W. Campbell, J. Campbell, D. Reynolds, E. Singer, and P. Torres-Carrasquillo, "Support vector machines for speaker and language recognition", Comput. Speech Lang., vol. 20, no. 2-3, pp. 210-229, 2006. (Pubitemid 41787537)
    • (2006) Computer Speech and Language , vol.20 , Issue.SPEC. ISS. , pp. 210-229
    • Campbell, W.M.1    Campbell, J.P.2    Reynolds, D.A.3    Singer, E.4    Torres-Carrasquillo, P.A.5
  • 36
    • 33746593716 scopus 로고    scopus 로고
    • SVM and kernel methods Matlab toolbox
    • INSA de Rouen, Rouen, France Online. Available
    • S. Canu, Y. Grandvalet, V. Guigue, and A. Rakotomamonjy. (2005). SVM and kernel methods Matlab toolbox. Perception Systèmes et Information, INSA de Rouen, Rouen, France [Online]. Available: http://asi.insa-rouen.fr/ enseignants/arakotom/toolbox/
    • (2005) Perception Systèmes et Information
    • Canu, S.1    Grandvalet, Y.2    Guigue, V.3    Rakotomamonjy, A.4
  • 40
    • 0016939145 scopus 로고
    • Automatic recognition of speakers from their voices
    • B. S. Atal, "Automatic recognition of speakers from their voices", Proc. IEEE, vol. 64, no. 4, pp. 460-475, 1976. (Pubitemid 8019233)
    • (1976) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 460-475
    • Atal, B.S.1
  • 45
    • 0029269512 scopus 로고
    • A comparative study of robust linear predictive a nalysis methods with applications to speaker identification
    • R. P. Ramachandran, M. S. Zilov ic, and R. J. Mammone, "A comparative study of robust linear predictive a nalysis methods with applications to speaker identification", IEEE Trans. Speech Audio Process., vol. 3, no. 2, pp. 117-125, 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.2 , pp. 117-125
    • Ramachandran, R.P.1    Ic, M.S.Z.2    Mammone, R.J.3
  • 46
    • 0028517648 scopus 로고
    • New lp-derived features for speaker identification
    • K. T. Assaleh and R. J. Mammone, "New lp-derived features for speaker identification", IEEE Trans. Speec h Audio Process., vol. 2, no. 4, pp. 630-638, 1994.
    • (1994) IEEE Trans. Speec H Audio Process. , vol.2 , Issue.4 , pp. 630-638
    • Assaleh, K.T.1    Mammone, R.J.2
  • 47
    • 0032075135 scopus 로고    scopus 로고
    • Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer functions
    • PII S1063667698029010
    • M. S. Zilovic, R. P. Ramachandran, and R. J. Mammone, "Speaker identification based on the use of robust cepstral features obtained from pole-zero transfer functions", IEEE Trans. Speech Audio Process., vol. 6, no. 3, 1998, pp. 260-267. (Pubitemid 128720651)
    • (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.3 , pp. 260-267
    • Zilovic, M.S.1    Ramachandran, R.P.2    Mammone, R.J.3
  • 50
  • 51
    • 0033746018 scopus 로고    scopus 로고
    • Robustness to telephone handset distortion in spea ker recognition by discriminative feature design
    • L. P. Heck, Y. Konig, M. K. Snmez, and M. Weintraub, "Robustness to telephone handset distortion in spea ker recognition by discriminative feature design", Speech Commun., vol. 31, no. 2-3, pp. 181-192, 2000.
    • (2000) Speech Commun. , vol.31 , Issue.2-3 , pp. 181-192
    • Heck, L.P.1    Konig, Y.2    Snmez, M.K.3    Weintraub, M.4
  • 52
    • 79958833298 scopus 로고    scopus 로고
    • Speaker identification using higher order spectral phase featur es and their effectiveness vis-avis mel-cepstral features
    • Berlin: Springer-Verlag
    • V. Chandran, D. Ning, and S. Sridharan, "Speaker identification using higher order spectral phase featur es and their effectiveness vis-avis mel-cepstral features", in Biometric Authentication. Berlin: Springer-Verlag, 2004, vol. 3072, pp. 1-20.
    • (2004) Biometric Authentication , vol.3072 , pp. 1-20
    • Chandran, V.1    Ning, D.2    Sridharan, S.3
  • 54
    • 85075924869 scopus 로고    scopus 로고
    • Comparison of background normalization methods for text-independent speaker verificatio n
    • Rhodes, Greece
    • D. A. Reynolds, "Comparison of background normalization methods for text-independent speaker verificatio n", in Proc. European Conf. Speech Communication Technology (Eurospeech), Rhodes, Greece, 1997, pp. 963-966.
    • (1997) Proc. European Conf. Speech Communication Technology (Eurospeech) , pp. 963-966
    • Reynolds, D.A.1
  • 55
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • DOI 10.1006/dspr.1999.0360
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems", Digital Signal Process., vol. 10, no. 1-3, pp. 42-54, 2000. (Pubitemid 30592165)
    • (2000) Digital Signal Processing: A Review Journal , vol.10 , Issue.1 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 57
    • 0030366901 scopus 로고    scopus 로고
    • Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models
    • Philadelphia, PA
    • K. P. Markov and S. Nakagawa, "Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture models", in Proc. Int. Conf. Spoken Language Processing (ICSLP), Philadelphia, PA, 1996, pp. 1764-1767.
    • (1996) Proc. Int. Conf. Spoken Language Processing (ICSLP) , pp. 1764-1767
    • Markov, K.P.1    Nakagawa, S.2
  • 58
    • 21444457842 scopus 로고    scopus 로고
    • Text-independent speaker identification using gmm-ubm and frame level lik elihood normalization
    • Z. Rong, Z. Shuwu, and X. Bo, "Text-independent speaker identification using gmm-ubm and frame level lik elihood normalization", in Proc. Int. Symp. Chinese Spoken Language Process., 2004, pp. 289-292.
    • (2004) Proc. Int. Symp. Chinese Spoken Language Process. , pp. 289-292
    • Rong, Z.1    Shuwu, Z.2    Bo, X.3
  • 64
    • 0032595177 scopus 로고    scopus 로고
    • Robust text-independent speaker identification over telephone channels
    • H. A. Murthy, F. Beaufays, L. P. Heck, and M. Weintraub, "Robust text-independent speaker identification over telephone channels", IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 554-568, 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 554-568
    • Murthy, H.A.1    Beaufays, F.2    Heck, L.P.3    Weintraub, M.4
  • 66
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • PII S1063667696067120
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination", IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, 1996. (Pubitemid 126753023)
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 69
    • 0000652102 scopus 로고
    • Some solutions to the missing feature problem in vision
    • San Mateo, CA: Morgan Kaufmann
    • S. Ahmed and V. Tresp, "Some solutions to the missing feature problem in vision", in Advances in Neural Information Processing Systems 5. San Mateo, CA: Morgan Kaufmann, 1993, pp. 393-400.
    • (1993) Advances in Neural. Information Processing Systems 5 , pp. 393-400
    • Ahmed, S.1    Tresp, V.2
  • 71
    • 0035478859 scopus 로고    scopus 로고
    • The auditory organization of speech and other sources in listeners and computational models
    • DOI 10.1016/S0167-6393(00)00078-9, PII S0167639300000789
    • M. Cooke and D. P. W. Ellis, "The auditory organization of speech and other sources in listeners and com putational models", Speech Commun., vol. 35, no. 3-4, pp. 141-177, 2001. (Pubitemid 32922990)
    • (2001) Speech Communication , vol.35 , Issue.3-4 , pp. 141-177
    • Cooke, M.1    Ellis, D.P.W.2
  • 72
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • DOI 10.1109/MSP.2005.1511828
    • B. Raj and R. M. Stern, "Missing-feature approaches in speech recognition", IEEE Signal Processing Mag., vol. 22, no. 5, pp. 101-116, 2005. (Pubitemid 41488524)
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 74
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data", Speech Commun., vol. 34, no. 3, pp. 267-285, 2001. (Pubitemid 32284867)
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 76
    • 85009104896 scopus 로고    scopus 로고
    • Reconstruction of damaged spectrographic features for robust sp eech recognition
    • B. Raj, M. L. Seltzer, and R. M. Stern, "Reconstruction of damaged spectrographic features for robust sp eech recognition", in Proc. Int. Conf. Spoken Language Processing (ICSLP), 2000, vol. 1, pp. 357-360.
    • (2000) Proc. Int. Conf. Spoken Language Processing (ICSLP) , vol.1 , pp. 357-360
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 79
    • 84892151303 scopus 로고    scopus 로고
    • Some solutions to the missing feature problem in data class ification, with application to noise robust asr
    • A. C. Morris, M. P. Cooke, and P. D. Green, "Some solutions to the missing feature problem in data class ification, with application to noise robust asr", in Proc. IEEE Int. Conf. Acoustics Speech Signal Processing (ICASSP), 1998, vol. 2, pp. 737-740.
    • (1998) Proc. IEEE Int. Conf. Acoustics Speech Signal Processing (ICASSP) , vol.2 , pp. 737-740
    • Morris, A.C.1    Cooke, M.P.2    Green, P.D.3
  • 83
    • 79958820946 scopus 로고    scopus 로고
    • Sub-band partitioning for full covariance based missing data speaker recognition
    • Advances in Information and Systems Sciences Series
    • D. Pullella, M. Kuhne, and R. Togneri, "Sub-band partitioning for full covariance based missing data speaker recognition", Int. J. Inform. Syst. Sci., (Advances in Information and Systems Sciences Series), vol. 3, no. 3-4, pp. 641-648, 2009.
    • (2009) Int. J. Inform. Syst. Sci. , vol.3 , Issue.3-4 , pp. 641-648
    • Pullella, D.1    Kuhne, M.2    Togneri, R.3
  • 84
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust. Speech Sig nal Process., vol. 27, no. 2, pp. 113-120, 1979. (Pubitemid 9467471)
    • (1979) IEEE Trans Acoust Speech Signal Process , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll Steven, F.1
  • 87
    • 16344396527 scopus 로고    scopus 로고
    • Using missing feature theory to actively select features for robust speech re cognition with interruptions, filtering and noise
    • Rhodes, Greece, 1997
    • R. Lippmann and B. A. Carlson, "Using missing feature theory to actively select features for robust speech re cognition with interruptions, filtering and noise", in Proc. European Conf. Speech Communication Technology (Eurospeech), Rhodes, Greece, 1997, pp. 37-40.
    • Proc. European Conf. Speech Communication Technology (Eurospeech) , pp. 37-40
    • Lippmann, R.1    Carlson, B.A.2
  • 88
    • 11144343436 scopus 로고    scopus 로고
    • Detection of reliable features for speech recognition in noisy conditions using a statistical criterion
    • Aalborg, Denmark
    • P. Renevey and A. Drygajlo, "Detection of reliable features for speech recognition in noisy conditions using a statistical criterion", in Proc. Consistent and Reliable Acoustic Cues (CRAC) Workshop, Aalborg, Denmark, 2001.
    • (2001) Proc. Consistent and Reliable Acoustic Cues (CRAC) Workshop
    • Renevey, P.1    Drygajlo, A.2
  • 92
    • 85009106519 scopus 로고    scopus 로고
    • Robust asr based on clean speech models: An evaluation of missing data
    • Aalborg, Denmark
    • J. Barker, M. Cooke, and P. Green, "Robust asr based on clean speech models: An evaluation of missing data", in Proc. European Signal Process. Conf. (EUSIPCO), Aalborg, Denmark, 2001, pp. 213-216.
    • (2001) Proc. European Signal Process. Conf. (EUSIPCO) , pp. 213-216
    • Barker, J.1    Cooke, M.2    Green, P.3
  • 94
    • 4644265990 scopus 로고    scopus 로고
    • Monaural speech segregation based on pitch tracking and amplitude modulation
    • G. Hu and D. Wang, "Monaural speech segregation based on pitch tracking and amplitude modulation", IEEE Trans. Neural Networks, vol. 15, no. 5, pp. 1135-1150, 2004.
    • (2004) IEEE Trans. Neural. Networks , vol.15 , Issue.5 , pp. 1135-1150
    • Hu, G.1    Wang, D.2
  • 96
    • 33947274099 scopus 로고    scopus 로고
    • Estimation of voicing-character of speech spectra based on spectral shape
    • DOI 10.1109/LSP.2006.881517
    • P. Jančovič and M. Köküer, "Estimation of voicing-character of speech spectra based on spectral shape", IEEE Signal Process. Lett., vol. 14, no. 1, pp. 66-69, 2007. (Pubitemid 46431336)
    • (2007) IEEE Signal Processing Letters , vol.14 , Issue.1 , pp. 66-69
    • Jancovic, P.1    Kokuer, M.2
  • 97
    • 84863769359 scopus 로고    scopus 로고
    • Employment of voicing information of speech spectra for noise-robust speaker identification
    • Poznan, Poland
    • P. Jancovič and M. Köküer, "Employment of voicing information of speech spectra for noise-robust speaker identification", in Proc. European Signal Process. Conf. (EUSIPCO), Poznan, Poland, 2007.
    • (2007) Proc. European Signal Process. Conf. (EUSIPCO)
    • Jancovič, P.1    Köküer, M.2
  • 98
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
    • K. J. Palomäki, G. J. Brown, and J. P. Barker, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition", S peech Commun., vol. 43, no. 1-2, pp. 123-142, 2004.
    • (2004) S Peech Commun. , vol.43 , Issue.1-2 , pp. 123-142
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.P.3
  • 99
    • 4644304197 scopus 로고    scopus 로고
    • A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation
    • K. J. Palomäki, G. J. Brown, and D. Wang, "A binaural processor for missing data speech recognition in the presence of noise and small-room reverberation", S peech Commun., vol. 43, no. 4, pp. 361-378, 2004.
    • (2004) S Peech Commun. , vol.43 , Issue.4 , pp. 361-378
    • Palomäki, K.J.1    Brown, G.J.2    Wang, D.3
  • 100
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition
    • M. L. Seltzer, B. Raj, and R. M. Stern, "A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition", Speech Commun., vol. 43, no. 4, pp. 379-393, 2004.
    • (2004) Speech Commun. , vol.43 , Issue.4 , pp. 379-393
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 101
    • 85009089485 scopus 로고    scopus 로고
    • Classifier-based mask estimation for missing feature methods of robust speech recognition
    • Beijing, China
    • M. L. Seltzer, B. Raj, and R. M. Stern, "Classifier-based mask estimation for missing feature methods of robust speech recognition", in Proc. Int. Conf. Spoken Langu age Processing (ICSLP), Beijing, China, 2000, vol. 3, pp. 538-541.
    • (2000) Proc. Int. Conf. Spoken Langu Age Processing (ICSLP) , vol.3 , pp. 538-541
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 102
    • 33745200501 scopus 로고    scopus 로고
    • Environment-independent mask estimation for missing-feature reconstruction
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • W. Kim, R. M. Stern, and H. Ko, "Environment-independent mask estimation for missing-feature reconstructi on", in Proc. European Conf. Speech Communication Technology (Interspeech), 2005, pp. 2637-2640. (Pubitemid 43908637)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 2637-2640
    • Kim, W.1    Stern, R.M.2    Ko, H.3
  • 103
    • 33947703708 scopus 로고    scopus 로고
    • Band-independent mask estimation for missing-feature reconstruction in the pr esence of unknown background noise
    • W. Kim and R. M. Stern, "Band-independent mask estimation for missing-feature reconstruction in the pr esence of unknown background noise", in Proc. IEEE Int. Conf. Acoustics Speech Signal Processing (ICASSP), 2006, vol. 1, pp. 305-308.
    • (2006) Proc. IEEE Int. Conf. Acoustics Speech Signal Processing (ICASSP) , vol.1 , pp. 305-308
    • Kim, W.1    Stern, R.M.2
  • 104
    • 48149090146 scopus 로고    scopus 로고
    • Estimating single-channel source separation masks: Relevance vector machine cl assifiers vs. pitch-based masking
    • R. Weiss and D. Ellis, "Estimating single-channel source separation masks: Relevance vector machine cl assifiers vs. pitch-based masking", in Proc. Workshop Statistical Perceptual Audition (SAPA), 2006, pp. 31-36.
    • (2006) Proc. Workshop Statistical Perceptual Audition (SAPA) , pp. 31-36
    • Weiss, R.1    Ellis, D.2
  • 105
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • M. Wu, D. Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech", IEEE Trans. Speec h Audio Process., vol. 11, no. 3, pp. 229-241, 2003.
    • (2003) IEEE Trans. Speec H Audio Process. , vol.11 , Issue.3 , pp. 229-241
    • Wu, M.1    Wang, D.2    Brown, G.J.3
  • 106
    • 33847629729 scopus 로고    scopus 로고
    • On noise masking for automatic missing data speech recognition: A survey and discussion
    • DOI 10.1016/j.csl.2006.08.001, PII S0885230806000301
    • C. Cerisara, S. Demange, and J. Haton, "On noise masking for automatic missing data speech recognition: A survey and discussion", Comput. Speech Lang., vol. 21, no. 3, pp. 443-457, 2007. (Pubitemid 46367508)
    • (2007) Computer Speech and Language , vol.21 , Issue.3 , pp. 443-457
    • Cerisara, C.1    Demange, S.2    Haton, J.-P.3
  • 107
    • 85009113185 scopus 로고    scopus 로고
    • Active perception: Using a priori knowledge from clean speech models to ign ore non-target features
    • B. Cranen and J. de Veth, "Active perception: Using a priori knowledge from clean speech models to ign ore non-target features", in Proc. Int. Conf. Spoken Language Process. (ICSLP), 2004.
    • (2004) Proc. Int. Conf. Spoken Language Process. (ICSLP)
    • Cranen, B.1    De Veth, J.2
  • 109
    • 0036754943 scopus 로고    scopus 로고
    • Robust speech recognition using probabilistic union models
    • DOI 10.1109/TSA.2002.803439
    • J. Ming, P. Jančovič, and F. J. Smith, "Robust speech recognition using probabilistic union models", I EEE Trans. Speech Audio Process., vol. 10, no. 6, pp. 403-414, 2002. (Pubitemid 35311934)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.6 , pp. 403-414
    • Ming, J.1    Jancovic, P.2    Smith, F.J.3
  • 112
    • 79958841880 scopus 로고    scopus 로고
    • On the mask modeling and feature representation in the missing-feature asr: Evaluation on the consonant challenge
    • Brisbane, Australia
    • P. Jančovič and M. Köküer, "On the mask modeling and feature representation in the missing-feature asr: Evaluation on the consonant challenge", in Proc. European Conf. Speech Communication Technology (Interspeech), Brisbane, Australia, 2008, pp. 1777-1780.
    • (2008) Proc. European Conf. Speech Communication Technology (Interspeech) , pp. 1777-1780
    • Jančovič, P.1    Köküer, M.2
  • 114
    • 51449093807 scopus 로고    scopus 로고
    • Integrating bottom-up and topdown constraints to achieve robust asr: The multisource decoder
    • Aalborg, Denmark
    • J. Barker, M. Cooke, and D. Ellis, "Integrating bottom-up and topdown constraints to achieve robust asr: The multisource decoder", in Proc. Consistent and Reliable Acoustic Cues (CRAC) Workshop, Aalborg, Denmark, 2001, pp. 63-66.
    • (2001) Proc. Consistent and Reliable Acoustic Cues (CRAC) Workshop , pp. 63-66
    • Barker, J.1    Cooke, M.2    Ellis, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.