메뉴 건너뛰기




Volumn 47, Issue 4, 2005, Pages 436-456

Rhythmic unit extraction and modelling for automatic language identification

Author keywords

Asian languages; European languages; Language identification; Rhythm modelling; Rhythm typology

Indexed keywords

ACOUSTICS; ALGORITHMS; STRESS ANALYSIS;

EID: 27644531433     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2005.04.012     Document Type: Article
Times cited : (45)

References (86)
  • 2
    • 84893194243 scopus 로고    scopus 로고
    • Segmentation of speech for speaker and language recognition
    • Geneva
    • Adami, A.G., Hermansky, H., 2003. Segmentation of speech for speaker and language recognition. In: Proc. Eurospeech, Geneva, pp. 841-844.
    • (2003) Proc. Eurospeech , pp. 841-844
    • Adami, A.G.1    Hermansky, H.2
  • 4
    • 27644524595 scopus 로고    scopus 로고
    • Approches Segmentales multilingues pour 1'identification automatique de la langue: Phones et syllabes
    • Fes, Morocco
    • Antoine, F., Zhu, D., Boula de Mareüil, P., Adda-Decker, M., 2004. Approches Segmentales multilingues pour 1'identification automatique de la langue: phones et syllabes. In: Proc. Journées d'Etude de la Parole, Fes, Morocco.
    • (2004) Proc. Journées d'Etude de la Parole
    • Antoine, F.1    Zhu, D.2    Boula De Mareüil, P.3    Adda-Decker, M.4
  • 5
    • 85050713071 scopus 로고    scopus 로고
    • Stratégies perceptuelles et identification automatique des langues
    • M. Barkat-Defradas, I. Vasilescu, and F. Pellegrino Stratégies perceptuelles et identification automatique des langues Revue PArole 25/26 2003 1 37
    • (2003) Revue PArole , vol.25-26 , pp. 1-37
    • Barkat-Defradas, M.1    Vasilescu, I.2    Pellegrino, F.3
  • 6
    • 0026437941 scopus 로고
    • Productive and perceptual constraints on speech error correction
    • T. Berg Productive and perceptual constraints on speech error correction Psychol. Res. 54 1992 114 126
    • (1992) Psychol. Res. , vol.54 , pp. 114-126
    • Berg, T.1
  • 8
    • 0034969303 scopus 로고    scopus 로고
    • Comparison between language and music
    • Zatorre, R., Peretz, I. (Eds.), "The biological foundations of music"
    • Besson, M., Schön, D., 2001. Comparison between language and music. In: Zatorre, R., Peretz, I. (Eds.), "The biological foundations of music". Annals of The New York Academy of Sciences, Vol. 930.
    • (2001) Annals of the New York Academy of Sciences , vol.930
    • Besson, M.1    Schön, D.2
  • 9
    • 23044509995 scopus 로고    scopus 로고
    • Distinguishing samples of spoken Korean from rhythmic and regional competitors
    • Z.S. Bond, and V. Stockmal Distinguishing samples of spoken Korean from rhythmic and regional competitors Lang. Sci. 24 2002 175 185
    • (2002) Lang. Sci. , vol.24 , pp. 175-185
    • Bond, Z.S.1    Stockmal, V.2
  • 11
    • 85128413775 scopus 로고    scopus 로고
    • A multilingual prosodic database
    • Sydney, Australia
    • Campione, E., Véronis, J., 1998. A multilingual prosodic database. In: Proc. ICSLP'98, Sydney, Australia.
    • (1998) Proc. ICSLP'98
    • Campione, E.1    Véronis, J.2
  • 15
    • 0039222571 scopus 로고    scopus 로고
    • Language identification from prosody without explicit features
    • Cummins, F., Gers, F., Schmidhuber, J., 1999. Language identification from prosody without explicit features. In: Proc. EUROSPEECH'99.
    • (1999) Proc. EUROSPEECH'99
    • Cummins, F.1    Gers, F.2    Schmidhuber, J.3
  • 18
    • 84926271877 scopus 로고
    • Stress-timing and syllable-timing reanalyzed
    • R.M. Dauer Stress-timing and syllable-timing reanalyzed J. Phonet. 11 1983
    • (1983) J. Phonet. , vol.11
    • Dauer, R.M.1
  • 19
    • 27644528947 scopus 로고
    • Syllabic features and phonic impression in english, german, french and spanish
    • P. Delattre, and C. Olsen Syllabic features and phonic impression in english, german, french and spanish Lingua 22 1969 160 175
    • (1969) Lingua , vol.22 , pp. 160-175
    • Delattre, P.1    Olsen, C.2
  • 20
    • 0034120139 scopus 로고    scopus 로고
    • Neural network processing of natural language: I. Sensitivity to serial, temporal and abstract structure in the infant
    • P.F. Dominey, and F. Ramus Neural network processing of natural language: I. Sensitivity to serial, temporal and abstract structure in the infant Lang. Cognitive Process. 15 1 2000
    • (2000) Lang. Cognitive Process. , vol.15 , Issue.1
    • Dominey, P.F.1    Ramus, F.2
  • 21
    • 0028287770 scopus 로고
    • Effect of reducing slow temporal modulation on speech reception
    • R. Drullman, J.M. Festen, and R. Plomp Effect of reducing slow temporal modulation on speech reception JASA 95 5 1994
    • (1994) JASA , vol.95 , Issue.5
    • Drullman, R.1    Festen, J.M.2    Plomp, R.3
  • 23
    • 85009083819 scopus 로고    scopus 로고
    • Rhythm in read British english: Interdialect variability
    • Jeju, Korea, October
    • Ferragne, E., Pellegrino, F. Rhythm in read British english: Interdialect variability. In: Proc. INTERSPEECH/ICSLP 2004, Jeju, Korea, October 2004.
    • (2004) Proc. INTERSPEECH/ICSLP 2004
    • Ferragne, E.1    Pellegrino, F.2
  • 25
    • 0016466220 scopus 로고
    • Syllable as a unit of speech recognition
    • 02/1975
    • Fujimura, O., 1975. Syllable as a unit of speech recognition. IEEE Trans. on ASSP ASSP-23 (1), 82-87, 02/1975.
    • (1975) IEEE Trans. on ASSP , vol.ASSP-23 , Issue.1 , pp. 82-87
    • Fujimura, O.1
  • 29
    • 0345889916 scopus 로고    scopus 로고
    • Durational variability in speech and the rhythm class hypothesis
    • Mouton
    • Grabe, E., Low, E.L., 2002. Durational variability in speech and the rhythm class hypothesis, Papers in Laboratory Phonology 7, Mouton.
    • (2002) Papers in Laboratory Phonology , vol.7
    • Grabe, E.1    Low, E.L.2
  • 33
    • 20244382614 scopus 로고    scopus 로고
    • The relation of stress accent to pronunciation variation in spontaneous American English discourse
    • Red Bank, NJ, USA, 2002
    • Greenberg, S., Carvey, H.M., Hitchcock, L., 2002. The relation of stress accent to pronunciation variation in spontaneous American English discourse. In: Proc. 2001 ISCA Workshop Prosody and Speech Processing, Red Bank, NJ, USA, 2002, pp. 53-56.
    • (2002) Proc. 2001 ISCA Workshop Prosody and Speech Processing , pp. 53-56
    • Greenberg, S.1    Carvey, H.M.2    Hitchcock, L.3
  • 36
    • 0020117798 scopus 로고
    • Forward masking as a function of frequency, masker level and signal delay
    • W. Jestead, S.P. Bacon, and J.R. Lehman Forward masking as a function of frequency, masker level and signal delay JASA 74 4 1982
    • (1982) JASA , vol.74 , Issue.4
    • Jestead, W.1    Bacon, S.P.2    Lehman, J.R.3
  • 37
    • 27644555648 scopus 로고    scopus 로고
    • Output requirements for a high-quality speech synthesis system: The case of disambiguation
    • 12-14 August 96
    • Keller, E., Zellner, B., 1997. Output requirements for a high-quality speech synthesis system: The case of disambiguation. In: Proc. MIDDIM-96, 12-14 August 96, pp. 300-308.
    • (1997) Proc. MIDDIM-96 , pp. 300-308
    • Keller, E.1    Zellner, B.2
  • 39
    • 27644556539 scopus 로고    scopus 로고
    • Periodicity of Japanese accent in continuous speech
    • Aix en Provence, France, April 2002
    • Kitazawa, S., 2002. Periodicity of Japanese accent in continuous speech. In: Speech Prosody, Aix en Provence, France, April 2002.
    • (2002) Speech Prosody
    • Kitazawa, S.1
  • 40
    • 27644450996 scopus 로고    scopus 로고
    • Perceptual discrimination of prosodic types
    • Nara, Japan, 2004
    • Komatsu, M., Arai, T., Sugawara, T., 2004. Perceptual discrimination of prosodic types. In: Proc. Speech Prosody, Nara, Japan, 2004, pp. 725-728.
    • (2004) Proc. Speech Prosody , pp. 725-728
    • Komatsu, M.1    Arai, T.2    Sugawara, T.3
  • 41
    • 84957798418 scopus 로고    scopus 로고
    • Speech recognition and syllable segments
    • Proc. Workshop on Text, Speech and Dialogue-TSD'99 Springer-Verlag
    • I. Kopecek Speech recognition and syllable segments Proc. Workshop on Text, Speech and Dialogue-TSD'99 Lectures Notes in Artificial Intelligence 1692 1999 Springer-Verlag
    • (1999) Lectures Notes in Artificial Intelligence 1692
    • Kopecek, I.1
  • 43
    • 0028417520 scopus 로고
    • Do speakers have access to a mental syllabary
    • W. Levelt, and L. Wheeldon Do speakers have access to a mental syllabary Cognition 1994 50
    • (1994) Cognition , pp. 50
    • Levelt, W.1    Wheeldon, L.2
  • 44
    • 0004698069 scopus 로고
    • Automatic language identification using syllabic spectral features
    • Adelaide, Australia
    • Li, K.P., 1994. Automatic language identification using syllabic spectral features. In: Proc. IEEE ICASSP'94, Adelaide, Australia.
    • (1994) Proc. IEEE ICASSP'94
    • Li, K.P.1
  • 45
    • 0022143866 scopus 로고
    • The motor theory of speech perception revised
    • A.M. Liberman, and I.G. Mattingly The motor theory of speech perception revised Cognition 1985 21
    • (1985) Cognition , pp. 21
    • Liberman, A.M.1    Mattingly, I.G.2
  • 47
    • 0031760948 scopus 로고    scopus 로고
    • The frame/content theory of evolution of speech production
    • P. MacNeilage The frame/content theory of evolution of speech production Brain Behavior. Sci. 21 1998 499 546
    • (1998) Brain Behavior. Sci. , vol.21 , pp. 499-546
    • MacNeilage, P.1
  • 48
    • 0010020177 scopus 로고    scopus 로고
    • Evolution of speech: The relation between ontogeny and phytogeny
    • J.R. Hurford C. Knight M.G. Studdert-Kennedy Cambridge University Press Cambridge
    • P.P. MacNeilage, and B.L. Davis Evolution of speech: The relation between ontogeny and phytogeny J.R. Hurford C. Knight M.G. Studdert-Kennedy The Evolutionary Emergence of Language 2000 Cambridge University Press Cambridge 146 160
    • (2000) The Evolutionary Emergence of Language , pp. 146-160
    • MacNeilage, P.P.1    Davis, B.L.2
  • 49
    • 0033658124 scopus 로고    scopus 로고
    • The motor core of speech: A comparison of serial organization patterns in infants and languages
    • P.P. MacNeilage, B.L. Davis, A. Kinney, and C.L. Maryear The motor core of speech: A comparison of serial organization patterns in infants and languages Child Develop. 71 2000 153 163
    • (2000) Child Develop. , vol.71 , pp. 153-163
    • MacNeilage, P.P.1    Davis, B.L.2    Kinney, A.3    Maryear, C.L.4
  • 50
    • 85009208002 scopus 로고    scopus 로고
    • NIST 2003 language recognition evaluation
    • Geneva
    • Martin, A.F., Przybocki., M.A., 2003. NIST 2003 language recognition evaluation. In: Proc. Eurospeech, Geneva, pp. 1341-1344.
    • (2003) Proc. Eurospeech , pp. 1341-1344
    • Martin, A.F.1    Przybocki, M.A.2
  • 51
    • 0015307394 scopus 로고
    • Preperceptual images, processing time and perceptual units in auditory perception
    • D.W. Massaro Preperceptual images, processing time and perceptual units in auditory perception Psychol. Rev. 79 2 1972
    • (1972) Psychol. Rev. , vol.79 , Issue.2
    • Massaro, D.W.1
  • 54
    • 0342931849 scopus 로고
    • Fast speakers in large vocabulary continuous speech recognition: Analysis & antidotes
    • Madrid, Spain
    • Mirghafori, N., Fosler, E., Morgan, N., 1995. Fast speakers in large vocabulary continuous speech recognition: Analysis & antidotes. In: Proc. Eurospeech'95, Madrid, Spain.
    • (1995) Proc. Eurospeech'95
    • Mirghafori, N.1    Fosler, E.2    Morgan, N.3
  • 55
    • 0000164460 scopus 로고
    • Perceptual benchmarks for automatic language identification
    • Adelaide, Australia
    • Muthusamy, Y.K., Jain, N., Cole, R.A., 1994. Perceptual benchmarks for automatic language identification. In: Proc. IEEE ICASSP'94, Adelaide, Australia.
    • (1994) Proc. IEEE ICASSP'94
    • Muthusamy, Y.K.1    Jain, N.2    Cole, R.A.3
  • 57
    • 0037680360 scopus 로고    scopus 로고
    • Perception and acquisition of linguistic rhythm by infants
    • T. Nazzi, and F. Ramus Perception and acquisition of linguistic rhythm by infants Speech Comm. 41 1-2 2003 233 243
    • (2003) Speech Comm. , vol.41 , Issue.1-2 , pp. 233-243
    • Nazzi, T.1    Ramus, F.2
  • 59
    • 0005665544 scopus 로고
    • On listeners' ability to identify languages by their prosody
    • Leon & Rossi (Eds.) Hurtubise HMH
    • Ohala, J.J., Gilbert, B., 1979. On listeners' ability to identify languages by their prosody. In: Leon & Rossi (Eds.), Problémes de prosodie, vol. 2, Hurtubise HMH.
    • (1979) Problémes de Prosodie , vol.2
    • Ohala, J.J.1    Gilbert, B.2
  • 61
    • 0034227923 scopus 로고    scopus 로고
    • Automatic language identification: An alternative approach to phonetic modelling
    • F. Pellegrino, and R. André-Obrecht Automatic language identification: An alternative approach to phonetic modelling Signal Process. 80 7 2000 1231 1244
    • (2000) Signal Process. , vol.80 , Issue.7 , pp. 1231-1244
    • Pellegrino, F.1    André-Obrecht, R.2
  • 62
    • 27644538450 scopus 로고    scopus 로고
    • Autom atic estimation of speaking rate in multilingual spontaneous speech
    • Nara, Japan, March 2004
    • Pellegrino, F., Farinas, J., Rouas, J-.L., 2004. Autom atic estimation of speaking rate in multilingual spontaneous speech. In: Proc. Speech Prosody 2004, Nara, Japan, March 2004.
    • (2004) Proc. Speech Prosody 2004
    • Pellegrino, F.1    Farinas, J.2    Rouas, J.-L.3
  • 63
    • 84892173311 scopus 로고    scopus 로고
    • Estimating the speaking rate by vowel detection
    • Seattle, WA, USA
    • Pfau, T., Ruske, G., 1998. Estimating the speaking rate by vowel detection. In: Proc. IEEE ICASSP'98, Seattle, WA, USA.
    • (1998) Proc. IEEE ICASSP'98
    • Pfau, T.1    Ruske, G.2
  • 65
    • 84937382170 scopus 로고    scopus 로고
    • Language discrimination by newborns: Teasing apart phonotactic, rhythmic, and intonational cues
    • F. Ramus Language discrimination by newborns: Teasing apart phonotactic, rhythmic, and intonational cues Ann. Rev. Lang. Acquis. 2 2002 85 115
    • (2002) Ann. Rev. Lang. Acquis. , vol.2 , pp. 85-115
    • Ramus, F.1
  • 66
    • 0347151091 scopus 로고    scopus 로고
    • Acoustic correlates of linguistic rhythm: Perspectives
    • Aix-en-Provence, France
    • Ramus, F., 2002b. Acoustic correlates of linguistic rhythm: Perspectives. In: Proc. Speech Prosody 2002, Aix-en-Provence, France.
    • (2002) Proc. Speech Prosody 2002
    • Ramus, F.1
  • 67
    • 0032943763 scopus 로고    scopus 로고
    • Language identification with suprasegmental cues: A study based on speech resynthesis
    • F. Ramus, and J. Mehler Language identification with suprasegmental cues: A study based on speech resynthesis J. Acoust. Soc. Amer. 105 1 1999
    • (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.1
    • Ramus, F.1    Mehler, J.2
  • 68
    • 0032725252 scopus 로고    scopus 로고
    • Correlates of linguistic rhythm in the speech signal
    • F. Ramus, M. Nespor, and J. Mehler Correlates of linguistic rhythm in the speech signal Cognition 73 3 1999
    • (1999) Cognition , vol.73 , Issue.3
    • Ramus, F.1    Nespor, M.2    Mehler, J.3
  • 69
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • D.A. Reynolds Speaker identification and verification using Gaussian mixture speaker models Speech Comm. 17 1-2 1995 91 108
    • (1995) Speech Comm. , vol.17 , Issue.1-2 , pp. 91-108
    • Reynolds, D.A.1
  • 70
    • 0141814772 scopus 로고    scopus 로고
    • Modeling prosody for language identification on read and spontaneous speech
    • Hong Kong, China
    • Rouas, J-.L., Farinas, J., Pellegrino, F., Regine André-Obrecht, 2003. Modeling prosody for language identification on read and spontaneous speech. In: Proc. ICASSP'2003, Hong Kong, China, pp. 40-43.
    • (2003) Proc. ICASSP'2003 , pp. 40-43
    • Rouas, J.-L.1    Farinas, J.2    Pellegrino, F.3    André-Obrecht, R.4
  • 71
    • 27644463967 scopus 로고    scopus 로고
    • Evaluation automatique du débit de la parole sur des données multilingues spontanées
    • Fés, Maroc, April 2004
    • Rouas, J-.L., Farinas, J., Pellegrino, F., Regine André-Obrecht, 2004. Evaluation automatique du débit de la parole sur des données multilingues spontanées. In: actes des XXVémes JEP, Fés, Maroc, April 2004.
    • (2004) Actes des XXVémes JEP
    • Rouas, J.-L.1    Farinas, J.2    Pellegrino, F.3    André-Obrecht, R.4
  • 72
    • 14944342211 scopus 로고    scopus 로고
    • Syllable detection and segmentation using temporal flow neural networks
    • San Francisco, CA, USA
    • Shastri, L., Chang, S., Greenberg, S., 1999. Syllable detection and segmentation using temporal flow neural networks. In: Proc. ICPhS'99, San Francisco, CA, USA.
    • (1999) Proc. ICPhS'99
    • Shastri, L.1    Chang, S.2    Greenberg, S.3
  • 74
    • 0030374924 scopus 로고    scopus 로고
    • Perceptual features of unknown foreign languages as revealed by multi-dimensional scaling
    • Philadelphia
    • Stockmal, V., Muljani, D., Bond, Z.S., 1996. Perceptual features of unknown foreign languages as revealed by multi-dimensional scaling. In: Proc. ICSLP, Philadelphia, pp. 1748-1751.
    • (1996) Proc. ICSLP , pp. 1748-1751
    • Stockmal, V.1    Muljani, D.2    Bond, Z.S.3
  • 76
    • 85135152214 scopus 로고    scopus 로고
    • Using intonation to constrain language models in speech recognition
    • Rhodes, Greece
    • Taylor, P.A., King, S., Isard, S.D., Wright, H., Kowtko, J., 1997. Using intonation to constrain language models in speech recognition. In: Proc. Eurospeech 97, Rhodes, Greece.
    • (1997) Proc. Eurospeech 97
    • Taylor, P.A.1    King, S.2    Isard, S.D.3    Wright, H.4    Kowtko, J.5
  • 77
    • 27644461426 scopus 로고    scopus 로고
    • Prosodic features in automatic language identification reflect language typology
    • San Francisco, CA, USA
    • Thymé-Gobbel, A., Hutchins, S.E., 1999. Prosodic features in automatic language identification reflect language typology. In: Proc. ICPhS'99, San Francisco, CA, USA.
    • (1999) Proc. ICPhS'99
    • Thymé-Gobbel, A.1    Hutchins, S.E.2
  • 78
    • 0344728663 scopus 로고
    • A computational model of prosody perception
    • Yokohama, Japan
    • Todd, N.P., Brown, G.J., 1994. A computational model of prosody perception. In: Proc. ICSLP'94, Yokohama, Japan.
    • (1994) Proc. ICSLP'94
    • Todd, N.P.1    Brown, G.J.2
  • 79
    • 27644513304 scopus 로고    scopus 로고
    • Des lexiques aux syllabes des langues du monde-Typologies et structures
    • Aussois, France
    • Vallée, N., Boë, L.J., Maddieson, I., Rousset, L., 2000. Des lexiques aux syllabes des langues du monde-Typologies et structures. In: Proc. JEP 2000, Aussois, France.
    • (2000) Proc. JEP 2000
    • Vallée, N.1    Boë, L.J.2    Maddieson, I.3    Rousset, L.4
  • 80
    • 85009083837 scopus 로고    scopus 로고
    • Perceptual features for the identification of romance languages
    • Beijing
    • Vasilescu, I., Pellegrino, F., Hombert, J., 2000. Perceptual features for the identification of romance languages. In: Proc. ICSLP'2000, Beijing.
    • (2000) Proc. ICSLP'2000
    • Vasilescu, I.1    Pellegrino, F.2    Hombert, J.3
  • 81
    • 27644549062 scopus 로고    scopus 로고
    • A fast and reliable rate of speech detector
    • Philadelphia, PA, USA
    • Verhasselt, J.P., Martens, J-.P., 1996. A fast and reliable rate of speech detector. In: Proc. ISCLP'96, Philadelphia, PA, USA.
    • (1996) Proc. ISCLP'96
    • Verhasselt, J.P.1    Martens, J.-P.2
  • 84
    • 21444459666 scopus 로고    scopus 로고
    • Revisiting the status of speech rhythm
    • Bernard Bel, Isabelle Marlien (Eds.) 11-13 April 2002
    • Zellner Keller, B., 2002. Revisiting the status of speech rhythm. In: Bernard Bel, Isabelle Marlien (Eds.), Proc. Speech Prosody 2002 Conf., 11-13 April 2002, pp. 727-730.
    • (2002) Proc. Speech Prosody 2002 Conf. , pp. 727-730
    • Zellner Keller, B.1
  • 85
    • 0012530434 scopus 로고    scopus 로고
    • Representing speech rhythm
    • E. Keller G. Bailly A. Monaghan J. Terken M. Huckvale John Wiley Chichester
    • B. Zellner Keller, and E. Keller Representing speech rhythm E. Keller G. Bailly A. Monaghan J. Terken M. Huckvale Improvements in Speech Synthesis 2001 John Wiley Chichester
    • (2001) Improvements in Speech Synthesis
    • Zellner Keller, B.1    Keller, E.2
  • 86
    • 0035427178 scopus 로고    scopus 로고
    • Automatic language identification
    • M.A. Zissman, and K.M. Berkling Automatic language identification Speech Comm. 35 1-2 2001 115 124
    • (2001) Speech Comm. , vol.35 , Issue.1-2 , pp. 115-124
    • Zissman, M.A.1    Berkling, K.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.