메뉴 건너뛰기




Volumn 36, Issue 5, 2011, Pages 745-782

Group delay functions and its applications in speech technology

Author keywords

feature extraction from phase; feature switching; Fourier transform phase; group delay functions; K L divergence; mutual information

Indexed keywords

GROUP DELAY FUNCTIONS; HIGH RESOLUTION; K-L DIVERGENCE; LANGUAGE IDENTIFICATION; MODIFIED GROUP DELAY; MUTUAL INFORMATIONS; PHASE COMPONENT; PHASE FUNCTIONS; SPEAKER RECOGNITION; SPEECH SIGNALS; SPEECH SYSTEMS; SPEECH TECHNOLOGY; SYLLABLE BOUNDARIES; VOCAL-TRACTS;

EID: 84856259774     PISSN: 02562499     EISSN: 09737677     Source Type: Journal    
DOI: 10.1007/s12046-011-0045-1     Document Type: Article
Times cited : (109)

References (62)
  • 2
    • 33646255447 scopus 로고    scopus 로고
    • Further intelligibility results from human listening tests using the short-time phase spectrum
    • Alsteris L D, Paliwal K K 2006 Further intelligibility results from human listening tests using the short-time phase spectrum. Speech Commun. 48: 727-736.
    • (2006) Speech Commun. , vol.48 , pp. 727-736
    • Alsteris, L.D.1    Paliwal, K.K.2
  • 3
    • 0033884857 scopus 로고    scopus 로고
    • Score normalisation for text-independent speaker verification systems
    • Auckentaler R, Carey M, Lloyd-Thomas H 2000 Score normalisation for text-independent speaker verification systems. Digital Signal Process. 10: 42-54.
    • (2000) Digital Signal Process. , vol.10 , pp. 42-54
    • Auckentaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 5
    • 33947159989 scopus 로고    scopus 로고
    • Chirp group delay analysis of speech signals
    • Bozkurt B, Couvreur L, Dutoit T 2007 Chirp group delay analysis of speech signals. Speech Commun. 49(3): 159-176.
    • (2007) Speech Commun. , vol.49 , Issue.3 , pp. 159-176
    • Bozkurt, B.1    Couvreur, L.2    Dutoit, T.3
  • 8
    • 0017542202 scopus 로고
    • The cepstrum: A guide to processing
    • Childers D G 1977 The cepstrum: A guide to processing. Proc. IEEE 68: 1428-1443.
    • (1977) Proc. IEEE , vol.68 , pp. 1428-1443
    • Childers, D.G.1
  • 9
    • 84856254879 scopus 로고    scopus 로고
    • CUED 2002 HTK Speech Recognition Toolkit
    • CUED 2002 HTK Speech Recognition Toolkit. http://htk.eng.cam.ac.uk.
  • 10
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S, Mermelstein 1980 Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech, Signal Process 28: 357-366.
    • (1980) IEEE Trans. Acoust. Speech, Signal Process , vol.28 , pp. 357-366
    • Davis, S.1    Mermelstein2
  • 11
    • 28244462378 scopus 로고    scopus 로고
    • DDNews, India, Speech and Vision Lab, IIT Madras, Chennai
    • DDNews 2001 Database for Indian languages. India, Speech and Vision Lab, IIT Madras, Chennai.
    • (2001) Database For Indian Languages
  • 12
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Dupont S, Luettin J 2000 Audio-visual speech modeling for continuous speech recognition. IEEE Trans. Multimedia 2(3) 141-151.
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 14
    • 0033357399 scopus 로고    scopus 로고
    • Speaking in short hand - A syllable centric perspective for understanding pronounciation variation
    • Greenberg S 1999 Speaking in short hand - A syllable centric perspective for understanding pronounciation variation. Speech Commun. 29: 159-176.
    • (1999) Speech Commun. , vol.29 , pp. 159-176
    • Greenberg, S.1
  • 19
    • 0025041264 scopus 로고
    • Perceptually linear predictive (plp) analysis of speech
    • Hermansky H 1990 Perceptually linear predictive (plp) analysis of speech. J. of the Acoust. Soc. of Am. 87: 1738-1752.
    • (1990) J. Of the Acoust. Soc. Of Am , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 23
    • 1842475640 scopus 로고    scopus 로고
    • Automatic segmentation of continuous speech using minimum phase group delay functions
    • Kamakshi Prasad V, Nagarajan T, Murthy H A 2004 Automatic segmentation of continuous speech using minimum phase group delay functions. Speech Commun. 42: 429-446.
    • (2004) Speech Commun. , vol.42 , pp. 429-446
    • Kamakshi Prasad, V.1    Nagarajan, T.2    Murthy, H.A.3
  • 30
    • 0141590458 scopus 로고    scopus 로고
    • Training of stream weights for the decoding of speech using parallel feature streams
    • Li X, Stern R 2003 Training of stream weights for the decoding of speech using parallel feature streams. Proc. IEEE Int. Conf. Acoust. Speech Signal Process, 1: 832-835.
    • (2003) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 832-835
    • Li, X.1    Stern, R.2
  • 31
    • 0018478297 scopus 로고
    • Spectral root homomorphic deconvolution system
    • Lim J 1979 Spectral root homomorphic deconvolution system. IEEE Trans. Acoust. Speech Signal Process 27: 223-233.
    • (1979) IEEE Trans. Acoust. Speech Signal Process , vol.27 , pp. 223-233
    • Lim, J.1
  • 34
    • 0026204672 scopus 로고
    • Formant extraction from minimum phase group delay function
    • Murthy H A, Yegnanarayana B 1991 Formant extraction from minimum phase group delay function. Speech Commun. 10: 209-221.
    • (1991) Speech Commun. , vol.10 , pp. 209-221
    • Murthy, H.A.1    Yegnanarayana, B.2
  • 35
    • 0024681756 scopus 로고
    • Effectiveness of representation of signals through group delay functions
    • Murthy K V M, Yegnanarayana B 1989 Effectiveness of representation of signals through group delay functions. Elsevier Signal Process. 17: 141-150.
    • (1989) Elsevier Signal Process. , vol.17 , pp. 141-150
    • Murthy, K.V.M.1    Yegnanarayana, B.2
  • 39
    • 84856279105 scopus 로고    scopus 로고
    • NIST 2003 The NIST year 2003 speaker recognition evaluation plan
    • NIST 2003 The NIST year 2003 speaker recognition evaluation plan. http://www.itl.nist.gov/iad/mig/tests/sre/2003/index.html.
  • 40
    • 0014055288 scopus 로고
    • Cepstrum pitch determination
    • Noll AM 1967 Cepstrum pitch determination. J. Acoust. Soc. Am. 41(2): 179-195.
    • (1967) J. Acoust. Soc. Am , vol.41 , Issue.2 , pp. 179-195
    • Noll, A.M.1
  • 41
    • 84856254875 scopus 로고
    • The OGI multi-language telephone speech corpus
    • OGI, Proc. Int. Conf. Spoken Lang., Banff, Alberta
    • OGI 1992 The OGI multi-language telephone speech corpus. Proc. Int. Conf. Spoken Lang., Banff, Alberta.
    • (1992)
  • 45
    • 13544259544 scopus 로고    scopus 로고
    • On the usefulness of stft phase spectrum in human listening tests
    • Paliwal K K, Alsteris L D 2005 On the usefulness of stft phase spectrum in human listening tests. Speech Commun. 45 153-170.
    • (2005) Speech Commun. , vol.45 , pp. 153-170
    • Paliwal, K.K.1    Alsteris, L.D.2
  • 49
    • 65249112285 scopus 로고    scopus 로고
    • Vowel onset point detection using source, spectral peaks and modulation spectrum energies
    • Prasanna S, Reddy S B, Krishnamoorthy P 2009 Vowel onset point detection using source, spectral peaks and modulation spectrum energies. IEEE Trans. Audio Speech Language Process. 17(4): 556-565.
    • (2009) IEEE Trans. Audio Speech Language Process. , vol.17 , Issue.4 , pp. 556-565
    • Prasanna, S.1    Reddy, S.B.2    Krishnamoorthy, P.3
  • 50
    • 3943055955 scopus 로고
    • The chirp z-transform algorithm and its application
    • Rabiner L R, Schafer R W 1969 The chirp z-transform algorithm and its application. Bell Syst. Tech. J. 48(5): 1249-1292.
    • (1969) Bell Syst. Tech. J. , vol.48 , Issue.5 , pp. 1249-1292
    • Rabiner, L.R.1    Schafer, R.W.2
  • 56
    • 84856272996 scopus 로고
    • Acoustic-phonetic continuous speech corpus
    • TIMIT, National Institute of Standards and Technology Speech Disc 1-1. 1. Fisher W, Doddington G, Goudie Marshall K M, Proc. DARPA Workshop on Speech Recognition, California
    • TIMIT 1990 Acoustic-phonetic continuous speech corpus. National Institute of Standards and Technology Speech Disc 1-1. 1. Fisher W, Doddington G, Goudie Marshall K M 1986 The DARPA speech recognition research database: Specifications and status. Proc. DARPA Workshop on Speech Recognition, California, 93-99.
    • (1990) The DARPA Speech Recognition Research Database: Specifications and Status , pp. 93-99
  • 58
    • 0017969757 scopus 로고
    • Formant extraction from linear prediction phase spectra
    • Yegnanarayana B 1979 Formant extraction from linear-prediction phase spectra. J. Acoust. Soc. Am. 63: 1638-1640.
    • (1979) J. Acoust. Soc. Am. , vol.63 , pp. 1638-1640
    • Yegnanarayana, B.1
  • 59
    • 0026923568 scopus 로고
    • Significance of group delay functions in spectrum estimation
    • Yegnanarayana B, Murthy H A 1992 Significance of group delay functions in spectrum estimation. IEEE Trans. Signal Process. 40(9): 2281-2289.
    • (1992) IEEE Trans. Signal Process. , vol.40 , Issue.9 , pp. 2281-2289
    • Yegnanarayana, B.1    Murthy, H.A.2
  • 60
  • 62
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of four approaches to automatic language identification of telephone speech
    • Zissman M A 1996 Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. Speech Audio Process 4(1): 31-44.
    • (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.