메뉴 건너뛰기




Volumn 42, Issue 3, 2009, Pages 487-494

Investigation on LP-residual representations for speaker identification

Author keywords

Feature extraction; LP residue; Non linear speech processing; Speaker identification

Indexed keywords

ARGON; DATABASE SYSTEMS; FILTER BANKS; FUSION REACTIONS; KETONES; LOUDSPEAKERS; PROGRAMMING THEORY; SPEECH; SPEECH PROCESSING; SPEECH RECOGNITION; STATISTICAL METHODS;

EID: 54549099008     PISSN: 00313203     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patcog.2008.08.008     Document Type: Article
Times cited : (34)

References (36)
  • 1
    • 0036947697 scopus 로고    scopus 로고
    • Learning statistically efficient features for speaker recognition
    • Jang G.J., Lee T.L., and Oh Y.H. Learning statistically efficient features for speaker recognition. Neurocomputing 49 (2002) 329-348
    • (2002) Neurocomputing , vol.49 , pp. 329-348
    • Jang, G.J.1    Lee, T.L.2    Oh, Y.H.3
  • 2
    • 54549086860 scopus 로고    scopus 로고
    • R.E. Slyh, E.G. Hansen, T.R. Anderson, Glottal modeling and closed-phase analysis for speaker recognition, in: Proceedings of the ISCA Tutorial and Research Workshop on Speaker and Language Recognition (Odyssey'04), 2004, pp. 315-322.
    • R.E. Slyh, E.G. Hansen, T.R. Anderson, Glottal modeling and closed-phase analysis for speaker recognition, in: Proceedings of the ISCA Tutorial and Research Workshop on Speaker and Language Recognition (Odyssey'04), 2004, pp. 315-322.
  • 3
    • 54549083512 scopus 로고    scopus 로고
    • L. Mary, K. Sri Rama Murty, S.R. Mahadeva Prasanna, B. Yegnanaraya, Features for speaker and language identification, in: Proceedings of the ISCA Tutorial and Research Workshop on Speaker and Language Recognition (Odyssey'04), 2004, pp. 323-328.
    • L. Mary, K. Sri Rama Murty, S.R. Mahadeva Prasanna, B. Yegnanaraya, Features for speaker and language identification, in: Proceedings of the ISCA Tutorial and Research Workshop on Speaker and Language Recognition (Odyssey'04), 2004, pp. 323-328.
  • 4
    • 26844446463 scopus 로고    scopus 로고
    • J. Ortega, et al., Ahumada: a large speech corpus in Spanish for speaker identification and verification, in: Proceedings of the IEEE ICASSP'98, vol. 2, 1998, pp. 773-775.
    • J. Ortega, et al., Ahumada: a large speech corpus in Spanish for speaker identification and verification, in: Proceedings of the IEEE ICASSP'98, vol. 2, 1998, pp. 773-775.
  • 5
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of speech wave
    • Atal B.S., and Hanauer S.L. Speech analysis and synthesis by linear prediction of speech wave. J. Acoust. Soc. Am. 50 (1971) 637-655
    • (1971) J. Acoust. Soc. Am. , vol.50 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 7
    • 54549124817 scopus 로고    scopus 로고
    • G. Kubin, Nonlinear processing of speech, in: W.B. Kleijn, K.K. Paliwal (Eds.), Speech Coding and Synthesis, 1995, pp. 557-610.
    • G. Kubin, Nonlinear processing of speech, in: W.B. Kleijn, K.K. Paliwal (Eds.), Speech Coding and Synthesis, 1995, pp. 557-610.
  • 8
    • 0029356550 scopus 로고
    • Usefulness of the LPC-residue in text-independent speaker verification
    • Thevenaz P., and Hügli H. Usefulness of the LPC-residue in text-independent speaker verification. Speech Commun. 17 1-2 (1995) 145-157
    • (1995) Speech Commun. , vol.17 , Issue.1-2 , pp. 145-157
    • Thevenaz, P.1    Hügli, H.2
  • 9
    • 85062118122 scopus 로고    scopus 로고
    • Speaker recognition using residual signal of linear and nonlinear prediction models
    • Faundez M., and Rodriguez D. Speaker recognition using residual signal of linear and nonlinear prediction models. ICSLP 2 (1998) 121-124
    • (1998) ICSLP , vol.2 , pp. 121-124
    • Faundez, M.1    Rodriguez, D.2
  • 10
    • 0034856452 scopus 로고    scopus 로고
    • B. Yegnanaraya, K.S. Reddy, S.P. Kishore, Source and system features for speaker recognition using AANN models, in: Proceedings of the IEEE ICASSP, 2001, pp. 409-412.
    • B. Yegnanaraya, K.S. Reddy, S.P. Kishore, Source and system features for speaker recognition using AANN models, in: Proceedings of the IEEE ICASSP, 2001, pp. 409-412.
  • 11
    • 33748443739 scopus 로고    scopus 로고
    • Extraction of speaker-specific excitation from linear prediction residual of speech
    • Mahadeva Prasanna S.R., Gupta C.S., and Yegnanaraya B. Extraction of speaker-specific excitation from linear prediction residual of speech. Speech Commun. 48 (2006) 1243-1261
    • (2006) Speech Commun. , vol.48 , pp. 1243-1261
    • Mahadeva Prasanna, S.R.1    Gupta, C.S.2    Yegnanaraya, B.3
  • 12
    • 54549105576 scopus 로고    scopus 로고
    • N. Zheng, T. Lee, P.C. Ching, Integration of complementary acoustic features for speaker recognition, IEEE Signal Process. Lett., 2006.
    • N. Zheng, T. Lee, P.C. Ching, Integration of complementary acoustic features for speaker recognition, IEEE Signal Process. Lett., 2006.
  • 13
    • 26844491755 scopus 로고    scopus 로고
    • A. Esposito, M. Marinaro, Some notes on nonlinearities of speech, in: G. Chollet, et al. (Eds.), Nonlinear Speech Modeling, Lecture Notes in Artificial Intelligence, vol. 3445, 2005, pp. 1-4.
    • A. Esposito, M. Marinaro, Some notes on nonlinearities of speech, in: G. Chollet, et al. (Eds.), Nonlinear Speech Modeling, Lecture Notes in Artificial Intelligence, vol. 3445, 2005, pp. 1-4.
  • 14
    • 54549109198 scopus 로고    scopus 로고
    • S. McLaughlin, S. Hovell, A. Lowry, Identification of nonlinearities in vowel generation, in: Proceedings of the EUSIPCO, 1988, pp. 1133-1136.
    • S. McLaughlin, S. Hovell, A. Lowry, Identification of nonlinearities in vowel generation, in: Proceedings of the EUSIPCO, 1988, pp. 1133-1136.
  • 15
    • 54549103582 scopus 로고    scopus 로고
    • H. Teager, S. Teager, Evidence for nonlinear sound production mechanisms in the vocal tract, in: Proceedings of the NATO ASI on Speech Production and Speech Modeling, vol. II, 1989, pp. 241-261.
    • H. Teager, S. Teager, Evidence for nonlinear sound production mechanisms in the vocal tract, in: Proceedings of the NATO ASI on Speech Production and Speech Modeling, vol. II, 1989, pp. 241-261.
  • 16
    • 0038610905 scopus 로고    scopus 로고
    • Speech probability distribution
    • Gazor S., and Zhang W. Speech probability distribution. IEEE Signal Process. Lett. 10 7 (2003) 204-207
    • (2003) IEEE Signal Process. Lett. , vol.10 , Issue.7 , pp. 204-207
    • Gazor, S.1    Zhang, W.2
  • 17
    • 33645790677 scopus 로고    scopus 로고
    • G. Chollet, A. Esposito, M. Faundez-Zanuy, M. Marinaro, Nonlinear speech modeling and applications, in: Lecture Notes in Artificial Intelligence, vol. 3445, 2005.
    • G. Chollet, A. Esposito, M. Faundez-Zanuy, M. Marinaro, Nonlinear speech modeling and applications, in: Lecture Notes in Artificial Intelligence, vol. 3445, 2005.
  • 18
    • 54549113463 scopus 로고    scopus 로고
    • M. Faundez, D. Rodriguez, Speaker recognition by means of a combination of linear and nonlinear predictive models, in: Proceedings of the IEEE ICASSP'99, 1999.
    • M. Faundez, D. Rodriguez, Speaker recognition by means of a combination of linear and nonlinear predictive models, in: Proceedings of the IEEE ICASSP'99, 1999.
  • 19
    • 54549117124 scopus 로고    scopus 로고
    • M. Chetouani, M. Faundez-Zanuy, B. Gas, J.L. Zarader, A new nonlinear speaker parameterization algorithm for speaker identification, in: Proceedings of the ISCA Tutorial and Research Workshop on Speaker and Language Recognition (Odyssey'04), 2004, pp. 309-314.
    • M. Chetouani, M. Faundez-Zanuy, B. Gas, J.L. Zarader, A new nonlinear speaker parameterization algorithm for speaker identification, in: Proceedings of the ISCA Tutorial and Research Workshop on Speaker and Language Recognition (Odyssey'04), 2004, pp. 309-314.
  • 20
    • 0344145224 scopus 로고    scopus 로고
    • E. Rank, G. Kubin, Nonlinear synthesis of vowels in the LP residual domain with a regularized RBF network, in: Proceedings of the IWANN, vol. 2085(II), 2001, pp. 746-753.
    • E. Rank, G. Kubin, Nonlinear synthesis of vowels in the LP residual domain with a regularized RBF network, in: Proceedings of the IWANN, vol. 2085(II), 2001, pp. 746-753.
  • 21
    • 84985763086 scopus 로고    scopus 로고
    • J. Thyssen, H. Nielsen, S.D. Hansen, Non-linearities short-term prediction in speech coding, in: Proceedings of the IEEE ICASSP'94, vol. 1, 1994, pp. 185-188.
    • J. Thyssen, H. Nielsen, S.D. Hansen, Non-linearities short-term prediction in speech coding, in: Proceedings of the IEEE ICASSP'94, vol. 1, 1994, pp. 185-188.
  • 22
    • 1842683867 scopus 로고    scopus 로고
    • Chaotic characteristics of speech signal and its LPC residual
    • Tao C., Mu J., Xu X., and Du G. Chaotic characteristics of speech signal and its LPC residual. Acoust. Sci. Technol. 25 1 (2004) 50-53
    • (2004) Acoust. Sci. Technol. , vol.25 , Issue.1 , pp. 50-53
    • Tao, C.1    Mu, J.2    Xu, X.3    Du, G.4
  • 23
    • 4544352735 scopus 로고    scopus 로고
    • S.H. Chen, H.C. Wang, Improvement of speaker recognition by combining residual and prosodic features with acoustic features, in: Proceedings of the IEEE ICASSP'04, vol. 1, 2004, pp. 93-96.
    • S.H. Chen, H.C. Wang, Improvement of speaker recognition by combining residual and prosodic features with acoustic features, in: Proceedings of the IEEE ICASSP'04, vol. 1, 2004, pp. 93-96.
  • 24
    • 0026370634 scopus 로고    scopus 로고
    • K.K. Paliwal, M.M. Sondhi, Recognition of noisy speech using cumulant-based linear prediction analysis, in: Proceedings of the IEEE ICASSP'91, vol. 1, 1991, pp. 429-432.
    • K.K. Paliwal, M.M. Sondhi, Recognition of noisy speech using cumulant-based linear prediction analysis, in: Proceedings of the IEEE ICASSP'91, vol. 1, 1991, pp. 429-432.
  • 25
    • 84947932996 scopus 로고    scopus 로고
    • S. Hayakawa, K. Takeda, F. Itakura, Speaker identification using harmonic structure of LP-residual spectrum, in: Audio Video Biometric Personal Authentification, Lecture Notes in Computer Science, vol. 1206, Springer, Berlin, 1997, pp. 253-260.
    • S. Hayakawa, K. Takeda, F. Itakura, Speaker identification using harmonic structure of LP-residual spectrum, in: Audio Video Biometric Personal Authentification, Lecture Notes in Computer Science, vol. 1206, Springer, Berlin, 1997, pp. 253-260.
  • 26
    • 0029725598 scopus 로고    scopus 로고
    • J. He, L. Liu, G. Palm, On the use of residual cepstrum in speech recognition, in: Proceedings of the IEEE ICASSP'96, vol. 1, 1991, pp. 5-8.
    • J. He, L. Liu, G. Palm, On the use of residual cepstrum in speech recognition, in: Proceedings of the IEEE ICASSP'96, vol. 1, 1991, pp. 5-8.
  • 27
    • 54549113462 scopus 로고    scopus 로고
    • A. Satue-Villar, M. Faundez-Zanuy, On the relevance of language in speaker recognition, in: Proceedings of the EUROSPEECH'99, vol. 3, 1999, pp. 1231-1234.
    • A. Satue-Villar, M. Faundez-Zanuy, On the relevance of language in speaker recognition, in: Proceedings of the EUROSPEECH'99, vol. 3, 1999, pp. 1231-1234.
  • 28
    • 0025680225 scopus 로고    scopus 로고
    • C. Jankowski, A. Kalyanswamy, S. Basson, J. Spitz, NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database, in: Proceedings of the IEEE ICASSP, vol. 1, 1990, pp. 109-112.
    • C. Jankowski, A. Kalyanswamy, S. Basson, J. Spitz, NTIMIT: a phonetically balanced, continuous speech, telephone bandwidth speech database, in: Proceedings of the IEEE ICASSP, vol. 1, 1990, pp. 109-112.
  • 29
    • 0029352294 scopus 로고
    • Second-order statistical measures for text-independent speaker identification
    • Bimbot F., Magrin-Chagnolleau I., and Mathan L. Second-order statistical measures for text-independent speaker identification. Speech Commun. 17 (1995) 177-192
    • (1995) Speech Commun. , vol.17 , pp. 177-192
    • Bimbot, F.1    Magrin-Chagnolleau, I.2    Mathan, L.3
  • 30
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • Reynolds D.A. Speaker identification and verification using Gaussian mixture speaker models. Speech Commun. 17 (1995) 91-108
    • (1995) Speech Commun. , vol.17 , pp. 91-108
    • Reynolds, D.A.1
  • 31
    • 0034227991 scopus 로고    scopus 로고
    • Subband architecture for automatic speaker recognition
    • Besacier L., and Bonastre J.F. Subband architecture for automatic speaker recognition. Signal Process. 80 (2000) 1245-1259
    • (2000) Signal Process. , vol.80 , pp. 1245-1259
    • Besacier, L.1    Bonastre, J.F.2
  • 32
    • 54549118313 scopus 로고    scopus 로고
    • F. Bimbot, L. Mathan, Text-free speaker recognition using an arithmetic-harmonic sphericity measure, in: Proceedings of the EUROSPEECH'91, 1999, pp. 169-172.
    • F. Bimbot, L. Mathan, Text-free speaker recognition using an arithmetic-harmonic sphericity measure, in: Proceedings of the EUROSPEECH'91, 1999, pp. 169-172.
  • 35
    • 54549112288 scopus 로고    scopus 로고
    • C. Sanderson, Information fusion and person verification using speech and face information, IDIAP Research Report 02-33, 1-37, September 2002.
    • C. Sanderson, Information fusion and person verification using speech and face information, IDIAP Research Report 02-33, 1-37, September 2002.
  • 36
    • 26844493397 scopus 로고    scopus 로고
    • M. Chetouani, M. Faundez-Zanuy, B. Gas, J.L. Zarader, Non-linear speech feature extraction for phoneme classification and speaker recognition, in: G. Chollet et al. (Eds.), Nonlinear Speech Modeling, Lecture Notes in Artificial Intelligence, vol. 3445, 2005, pp. 344-350.
    • M. Chetouani, M. Faundez-Zanuy, B. Gas, J.L. Zarader, Non-linear speech feature extraction for phoneme classification and speaker recognition, in: G. Chollet et al. (Eds.), Nonlinear Speech Modeling, Lecture Notes in Artificial Intelligence, vol. 3445, 2005, pp. 344-350.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.