메뉴 건너뛰기




Volumn 101, Issue 5, 2013, Pages 1136-1159

Spoken language recognition: From fundamentals to practice

Author keywords

Acoustic features; calibration; classifier; fusion; language recognition evaluation (LRE); phonotactic features; spoken language recognition; tokenization; vector space modeling

Indexed keywords

CALIBRATION; CLASSIFIERS; COMPUTATION THEORY; FUSION REACTIONS; SIGNAL PROCESSING; SPEECH RECOGNITION; VECTOR SPACES;

EID: 84876676725     PISSN: 00189219     EISSN: None     Source Type: Journal    
DOI: 10.1109/JPROC.2012.2237151     Document Type: Article
Times cited : (291)

References (151)
  • 4
    • 0004694842 scopus 로고
    • Analysis of phoneme-based features for language identification
    • Adelaide, Australia
    • K. M. Berkling and E. Barnard, "Analysis of phoneme-based features for language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, pp. 289-292.
    • (1994) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 289-292
    • Berkling, K.M.1    Barnard, E.2
  • 5
    • 85135152779 scopus 로고
    • Language identification of six languages based on a common set of broad phonemes
    • Yokohama, Japan
    • K. M. Berkling and E. Barnard, "Language identification of six languages based on a common set of broad phonemes," in Proc. Int. Conf. Spoken Lang. Process., Yokohama, Japan, 1994, pp. 1891-1894.
    • (1994) Proc. Int. Conf. Spoken Lang. Process , pp. 1891-1894
    • Berkling, K.M.1    Barnard, E.2
  • 7
    • 24744468841 scopus 로고    scopus 로고
    • Automatic language identification with perceptually guided training and recurrent neural networks
    • Sydney, Australia
    • J. Braun and H. Levkowitz, "Automatic language identification with perceptually guided training and recurrent neural networks," in Proc. Int. Conf. Spoken Lang. Process., Sydney, Australia, 1998, pp. 289-292.
    • (1998) Proc. Int. Conf. Spoken Lang. Process , pp. 289-292
    • Braun, J.1    Levkowitz, H.2
  • 8
    • 42749108057 scopus 로고    scopus 로고
    • On calibration of language recognition scores
    • San Juan, Puerto Rico DOI: 10.1109/ODYSSEY.2006.248106
    • N. Brummer and D. Leeuwen, "On calibration of language recognition scores," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY.2006.248106.
    • (2006) Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
    • Brummer, N.1    Leeuwen, D.2
  • 9
    • 29044433376 scopus 로고    scopus 로고
    • Application-independent evaluation of speaker detection
    • N. Brummer and J. Preez, "Application-independent evaluation of speaker detection," Comput. Speech Lang., vol. 20, no. 2, pp. 230-275, 2006.
    • (2006) Comput. Speech Lang. , vol.20 , Issue.2 , pp. 230-275
    • Brummer, N.1    Preez, J.2
  • 17
    • 37649028010 scopus 로고    scopus 로고
    • Advanced language recognition using cepstra and phonotactics: MITLL system performance on the NIST 2005 language recognition evaluation
    • San Juan, Puerto Rico DOI: 10.1109/ODYSSEY.2006.248097
    • W. Campbell, T. Gleason, J. Navratil, D. Reynolds, W. Shen, E. Singer, and P. Torres-Carrasquillo, "Advanced language recognition using cepstra and phonotactics: MITLL system performance on the NIST 2005 language recognition evaluation," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY.2006.248097.
    • (2006) Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
    • Campbell, W.1    Gleason, T.2    Navratil, J.3    Reynolds, D.4    Shen, W.5    Singer, E.6    Torres-Carrasquillo, P.7
  • 18
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • May
    • W. M. Campbell, D. E. Sturim, and D. A. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-310, May 2006.
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-310
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3
  • 19
    • 51549119947 scopus 로고    scopus 로고
    • A covariance Kernel for SVM language recognition
    • Las Vegas, NV, USA
    • W. M. Campbell, "A covariance Kernel for SVM language recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4141-4144.
    • (2008) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 4141-4144
    • Campbell, W.M.1
  • 20
    • 84867216951 scopus 로고    scopus 로고
    • A comparison of subspace feature-domain methods for language recognition
    • Brisbane, Australia
    • W. M. Campbell, D. E. Sturim, P. Torres-Carrasquillo, and D. A. Reynolds, "A comparison of subspace feature-domain methods for language recognition," in Proc. Interspeech Conf., Brisbane, Australia, 2008, pp. 309-312.
    • (2008) Proc. Interspeech Conf , pp. 309-312
    • Campbell, W.M.1    Sturim, D.E.2    Torres-Carrasquillo, P.3    Reynolds, D.A.4
  • 23
    • 85032751967 scopus 로고    scopus 로고
    • Retrieval and browsing of spoken content
    • May
    • C. Chelba, T. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 39-49, May 2008.
    • (2008) IEEE Signal Process. Mag. , vol.25 , Issue.3 , pp. 39-49
    • Chelba, C.1    Hazen, T.2    Saraclar, M.3
  • 24
    • 0000567234 scopus 로고    scopus 로고
    • Vector-based natural language call routing
    • J. Chu-Carrol and B. Carpenter, "Vector-based natural language call routing," Comput. Linguist., vol. 25, no. 3, pp. 361-388, 1999.
    • (1999) Comput. Linguist. , vol.25 , Issue.3 , pp. 361-388
    • Chu-Carrol, J.1    Carpenter, B.2
  • 27
    • 0004717613 scopus 로고
    • Development of an automatic identification system of spoken languages: Phase i
    • Paris, France
    • D. Cimarusti and R. Ives, "Development of an automatic identification system of spoken languages: Phase I," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Paris, France, 1982, pp. 1661-1663.
    • (1982) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 1661-1663
    • Cimarusti, D.1    Ives, R.2
  • 31
    • 84859066901 scopus 로고    scopus 로고
    • Analysis of large-scale SVM training algorithms for language and speaker recognition
    • Jul.
    • S. Cumani and P. Laface, "Analysis of large-scale SVM training algorithms for language and speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 5, pp. 1585-1596, Jul. 2012.
    • (2012) IEEE Trans. Audio Speech Lang. Process. , vol.20 , Issue.5 , pp. 1585-1596
    • Cumani, S.1    Laface, P.2
  • 32
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. 28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 33
    • 64249101047 scopus 로고    scopus 로고
    • Modeling prosodic features with joint factor analysis for speaker verification
    • Sep.
    • N. Dehak, P. Dumouchel, and P. Kenny, "Modeling prosodic features with joint factor analysis for speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 7, pp. 2095-2103, Sep. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.7 , pp. 2095-2103
    • Dehak, N.1    Dumouchel, P.2    Kenny, P.3
  • 35
    • 84865750857 scopus 로고    scopus 로고
    • Language recognition via i-vectors and dimensionality reduction
    • Florence, Italy
    • N. Dehak, P. Torres-Carrasquillo, D. Reynolds, and R. Dehak, "Language recognition via i-vectors and dimensionality reduction," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 857-860.
    • (2011) Proc. Interspeech Conf , pp. 857-860
    • Dehak, N.1    Torres-Carrasquillo, P.2    Reynolds, D.3    Dehak, R.4
  • 36
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. Dumpster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Stat. Soc. , vol.39 , pp. 1-38
    • Dumpster, A.1    Laird, N.2    Rubin, D.3
  • 37
    • 0003984557 scopus 로고
    • Statistical identification of language
    • New Mexico State Univ., Las Cruces, NM, USA, Tech. Rep. MCCS-94-273
    • T. Dunning, "Statistical identification of language," Comput. Res. Lab (CRL), New Mexico State Univ., Las Cruces, NM, USA, Tech. Rep. MCCS-94-273, 1994.
    • (1994) Comput. Res. Lab (CRL)
    • Dunning, T.1
  • 39
    • 0028419019 scopus 로고
    • Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J. L. Gauvain and C.-H. Lee, "Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.-H.2
  • 41
    • 49949150022 scopus 로고
    • Language identification in the limit
    • E. M. Gold, "Language identification in the limit," Inf. Control, vol. 10, no. 5, pp. 447-474, 1967.
    • (1967) Inf. Control , vol.10 , Issue.5 , pp. 447-474
    • Gold, E.M.1
  • 44
    • 85135379346 scopus 로고
    • Automatic language identification using a segment-based approach
    • Berlin, Germany
    • T. J. Hazen and V. W. Zue, "Automatic language identification using a segment-based approach," in Proc. Eurospeech Conf., Berlin, Germany, 1993, pp. 1303-1306.
    • (1993) Proc. Eurospeech Conf , pp. 1303-1306
    • Hazen, T.J.1    Zue, V.W.2
  • 45
    • 84863902506 scopus 로고
    • Recent improvements in an approach to segment-based automatic language identification
    • Adelaide, Australia
    • T. J. Hazen and V. W. Zue, "Recent improvements in an approach to segment-based automatic language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, pp. 1883-1886.
    • (1994) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 1883-1886
    • Hazen, T.J.1    Zue, V.W.2
  • 46
    • 0030893502 scopus 로고    scopus 로고
    • Segment-based automatic language identification
    • T. J. Hazen and V. W. Zue, "Segment-based automatic language identification," J. Acoust. Soc. Amer., vol. 101, no. 4, pp. 2323-2331, 1997.
    • (1997) J. Acoust. Soc. Amer. , vol.101 , Issue.4 , pp. 2323-2331
    • Hazen, T.J.1    Zue, V.W.2
  • 48
    • 0030364814 scopus 로고    scopus 로고
    • Spoken language identification using large vocabulary speech recognition
    • Philadelphia, PA, USA
    • J. Hieronymus and S. Kadambe, "Spoken language identification using large vocabulary speech recognition," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, USA, 1996, pp. 1780-1783.
    • (1996) Proc. Int. Conf. Spoken Lang. Process , pp. 1780-1783
    • Hieronymus, J.1    Kadambe, S.2
  • 51
    • 0001152481 scopus 로고
    • Toward automatic identification of the language of an utterance. I. Preliminary methodological considerations
    • A. S. House and E. P. Neuburg, "Toward automatic identification of the language of an utterance. I. Preliminary methodological considerations, " J. Acoust. Soc. Amer., vol. 62, no. 3, pp. 708-713, 1977.
    • (1977) J. Acoust. Soc. Amer. , vol.62 , Issue.3 , pp. 708-713
    • House, A.S.1    Neuburg, E.P.2
  • 53
    • 0000262562 scopus 로고
    • Hierarchical mixtures of experts and the em algorithms
    • M. I. Jordan and R. A. Jacobs, "Hierarchical mixtures of experts and the EM algorithms," Neural Comput., vol. 6, pp. 181-214, 1994.
    • (1994) Neural Comput. , vol.6 , pp. 181-214
    • Jordan, M.I.1    Jacobs, R.A.2
  • 54
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • May
    • B. H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.H.1    Chou, W.2    Lee, C.-H.3
  • 56
    • 33645857482 scopus 로고    scopus 로고
    • Experiments in speaker verification using factor analysis likelihood ratios
    • Toledo, Spain
    • P. Kenny and P. Dumouchel, "Experiments in speaker verification using factor analysis likelihood ratios," in Proc. Odyssey: Speaker Lang. Recognit. Workshop, Toledo, Spain, 2004, pp. 219-226.
    • (2004) Proc. Odyssey: Speaker Lang. Recognit. Workshop , pp. 219-226
    • Kenny, P.1    Dumouchel, P.2
  • 58
    • 36249002496 scopus 로고    scopus 로고
    • Language characteristics
    • T. Schultz and K. Kirchhoff, Eds. Amsterdam, The Netherlands: Elsevier
    • K. Kirchhoff, "Language characteristics," in Multilingual Speech Processing, T. Schultz and K. Kirchhoff, Eds. Amsterdam, The Netherlands: Elsevier, 2006.
    • (2006) Multilingual Speech Processing
    • Kirchhoff, K.1
  • 59
    • 22444454265 scopus 로고
    • Combining classifiers: A theoretical framework
    • J. Kittler, "Combining classifiers: A theoretical framework," Pattern Anal. Appl., no. 1, pp. 18-27, 1988.
    • (1988) Pattern Anal. Appl. , Issue.1 , pp. 18-27
    • Kittler, J.1
  • 60
    • 84865778217 scopus 로고    scopus 로고
    • IVector fusion of prosodic and cepstral features for speaker verification
    • Florence, Italy
    • M. Kockmann, L. Ferrer, L. Burget, and ̌ J. Cernocḱy, "iVector fusion of prosodic and cepstral features for speaker verification," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 265-268.
    • (2011) Proc. Interspeech Conf , pp. 265-268
    • Kockmann, M.1    Ferrer, L.2    Burget, L.3    Cernocḱy, J.4
  • 61
    • 0036472946 scopus 로고    scopus 로고
    • A theoretical study on six classifier fusion strategies
    • Feb.
    • L. I. Kuncheva, "A theoretical study on six classifier fusion strategies," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 2, pp. 281-286, Feb. 2002.
    • (2002) IEEE Trans. Pattern Anal. Mach. Intell. , vol.24 , Issue.2 , pp. 281-286
    • Kuncheva, L.I.1
  • 62
    • 85049773640 scopus 로고
    • Language identification using phone-based acoustic likelihoods
    • Adelaide, Australia
    • L. F. Lamel and J. L. Gauvain, "Language identification using phone-based acoustic likelihoods," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, vol. 1, pp. 293-296.
    • (1994) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 293-296
    • Lamel, L.F.1    Gauvain, J.L.2
  • 63
    • 64549162996 scopus 로고
    • The OGI 22 language telephone speech corpus
    • Madrid, Spain
    • T. Lander, R. Cole, B. Oshika, and M. Noel, "The OGI 22 language telephone speech corpus," in Proc. Eurospeech Conf., Madrid, Spain, 1995, pp. 817-820.
    • (1995) Proc. Eurospeech Conf , pp. 817-820
    • Lander, T.1    Cole, R.2    Oshika, B.3    Noel, M.4
  • 64
    • 84876693427 scopus 로고    scopus 로고
    • Principles of spoken language recognition
    • J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag
    • C.-H. Lee, "Principles of spoken language recognition," in Springer Handbook of Speech Processing and Speech Communication, J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag, 2008.
    • (2008) Springer Handbook of Speech Processing and Speech Communication
    • Lee, C.-H.1
  • 65
    • 51449104855 scopus 로고    scopus 로고
    • Spoken language recognition using support vector machines with generative front-end
    • Las Vegas, NV, USA
    • K. A. Lee, C. You, and H. Li, "Spoken language recognition using support vector machines with generative front-end," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4153-4156.
    • (2008) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 4153-4156
    • Lee, K.A.1    You, C.2    Li, H.3
  • 66
    • 79953277529 scopus 로고    scopus 로고
    • Using discrete probabilities with Bhattacharyya measure for SVM-based speaker verification
    • May
    • K. A. Lee, C. H. You, H. Li, T. Kinnunen, and K. C. Sim, "Using discrete probabilities with Bhattacharyya measure for SVM-based speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 4, pp. 861-870, May 2011.
    • (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.4 , pp. 861-870
    • Lee, K.A.1    You, C.H.2    Li, H.3    Kinnunen, T.4    Sim, K.C.5
  • 67
    • 84865768678 scopus 로고    scopus 로고
    • Spoken language recognition in the latent topic simplex
    • Florence, Italy
    • K. A. Lee, C. H. You, V. Hautam̈aki, A. Larcher, and H. Li, "Spoken language recognition in the latent topic simplex," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 2933-2936.
    • (2011) Proc. Interspeech Conf , pp. 2933-2936
    • Lee, K.A.1    You, C.H.2    Hautam̈aki, V.3    Larcher, A.4    Li, H.5
  • 68
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • Atlanta, GA, USA
    • L. Lee and R. C. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Atlanta, GA, USA, 1996, vol. 1, pp. 353-356.
    • (1996) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 72
    • 33947677598 scopus 로고    scopus 로고
    • A phonotactic language model for spoken language identification
    • Ann Arbor, MI, USA
    • H. Li and B. Ma, "A phonotactic language model for spoken language identification," in Proc. Assoc. Comput. Linguist., Ann Arbor, MI, USA, 2005, pp. 515-522.
    • (2005) Proc. Assoc. Comput. Linguist , pp. 515-522
    • Li, H.1    Ma, B.2
  • 73
    • 34547502608 scopus 로고    scopus 로고
    • A vector space modeling approach to spoken language identification
    • Jan.
    • H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.1 , pp. 271-284
    • Li, H.1    Ma, B.2    Lee, C.-H.3
  • 75
    • 84994249527 scopus 로고    scopus 로고
    • Vector-based spoken language classification
    • J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag
    • H. Li, B. Ma, and C.-H. Lee, "Vector-based spoken language classification," in Springer Handbook of Speech Processing and Speech Communication, J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag, 2008.
    • (2008) Springer Handbook of Speech Processing and Speech Communication
    • Li, H.1    Ma, B.2    Lee, C.-H.3
  • 76
    • 85032751399 scopus 로고    scopus 로고
    • TechWare: Speaker and spoken language recognition resources
    • Nov.
    • H. Li and B. Ma, "TechWare: Speaker and spoken language recognition resources," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 139-142, Nov. 2010.
    • (2010) IEEE Signal Process. Mag. , vol.27 , Issue.6 , pp. 139-142
    • Li, H.1    Ma, B.2
  • 78
    • 84876664448 scopus 로고    scopus 로고
    • Machine learning paradigms for speech recognition: An overview
    • accepted for publication
    • X. Li, L. Deng, and J. Bilmes, "Machine learning paradigms for speech recognition: An overview," IEEE Trans. Audio Speech Lang. Process., accepted for publication.
    • IEEE Trans. Audio Speech Lang. Process
    • Li, X.1    Deng, L.2    Bilmes, J.3
  • 79
    • 33646810153 scopus 로고    scopus 로고
    • Using local and global phonotactic features in Chinese dialect identification
    • Philadelphia, PA, USA
    • B. P. Lim, H. Li, and B. Ma, "Using local and global phonotactic features in Chinese dialect identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Philadelphia, PA, USA, 2005, pp. 577-580.
    • (2005) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 577-580
    • Lim, B.P.1    Li, H.2    Ma, B.3
  • 80
    • 85009211861 scopus 로고    scopus 로고
    • Improved speaker verification through probabilistic subspace adaptation
    • Geneva, Switzerland
    • S. Lucey and T. Chen, "Improved speaker verification through probabilistic subspace adaptation," in Proc. Eurospeech Conf., Geneva, Switzerland, 2003, pp. 2021-2024.
    • (2003) Proc. Eurospeech Conf , pp. 2021-2024
    • Lucey, S.1    Chen, T.2
  • 81
    • 85009273139 scopus 로고    scopus 로고
    • Multilingual speech recognition with language identification
    • Denver, CO, USA
    • B. Ma, C. Guan, H. Li, and C.-H. Lee, "Multilingual speech recognition with language identification," in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, USA, 2002, pp. 505-508.
    • (2002) Proc. Int. Conf. Spoken Lang. Process , pp. 505-508
    • Ma, B.1    Guan, C.2    Li, H.3    Lee, C.-H.4
  • 82
    • 60849102345 scopus 로고    scopus 로고
    • Spoken language recognition with ensemble classifiers
    • Sep.
    • B. Ma, H. Li, and R. Tong, "Spoken language recognition with ensemble classifiers," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 7, pp. 2053-2062, Sep. 2007.
    • (2007) IEEE Trans. Audio Speech Lang. Process. , vol.15 , Issue.7 , pp. 2053-2062
    • Ma, B.1    Li, H.2    Tong, R.3
  • 86
    • 37649031157 scopus 로고    scopus 로고
    • The current state of language recognition: NIST 2005 evaluation results
    • San Juan, Puerto Rico DOI: 10.1109/ODYSSEY. 2006.248104
    • A. F. Martin and A. N. Le, "The current state of language recognition: NIST 2005 evaluation results," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY. 2006.248104.
    • (2006) Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
    • Martin, A.F.1    Le, A.N.2
  • 90
    • 33745190265 scopus 로고    scopus 로고
    • Phonotactic language identification using high quality phoneme recognition
    • Lisbon, Portugal
    • P. Matejka, P. Schwarz, J. Cernocky, and P. Chytil, "Phonotactic language identification using high quality phoneme recognition," in Proc. Interspeech Conf., Lisbon, Portugal, 2005, pp. 2237-2240.
    • (2005) Proc. Interspeech Conf , pp. 2237-2240
    • Matejka, P.1    Schwarz, P.2    Cernocky, J.3    Chytil, P.4
  • 91
    • 84867202539 scopus 로고    scopus 로고
    • Beyond frame independent: Parametric modeling of time duration in speaker and language recognition
    • Brisbane, Australia
    • A. McCree, F. Richardson, E. Singer, and D. Reynolds, "Beyond frame independent: Parametric modeling of time duration in speaker and language recognition," in Proc. Interspeech Conf., Brisbane, Australia, 2008, pp. 767-770.
    • (2008) Proc. Interspeech Conf , pp. 767-770
    • McCree, A.1    Richardson, F.2    Singer, E.3    Reynolds, D.4
  • 92
    • 0029725760 scopus 로고    scopus 로고
    • Automatic language identification using large vocabulary continuous speech recognition
    • Atlanta, GA, USA
    • S. Mendoza, L. Gillick, Y. Ito, S. Lowe, and M. Newman, "Automatic language identification using large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Atlanta, GA, USA, 1996, vol. 2, pp. 785-788.
    • (1996) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.2 , pp. 785-788
    • Mendoza, S.1    Gillick, L.2    Ito, Y.3    Lowe, S.4    Newman, M.5
  • 95
    • 0004656027 scopus 로고
    • A comparison of approaches to automatic language identification using telephone speech
    • Berlin, Germany
    • Y. K. Muthusamy, K. M. Berkling, T. Arai, R. A. Cole, and E. Barnard, "A comparison of approaches to automatic language identification using telephone speech," in Proc. Eurospeech Conf., Berlin, Germany, 1993, pp. 1307-1310.
    • (1993) Proc. Eurospeech Conf , pp. 1307-1310
    • Muthusamy, Y.K.1    Berkling, K.M.2    Arai, T.3    Cole, R.A.4    Barnard, E.5
  • 97
    • 0028516964 scopus 로고
    • Reviewing automatic language identification
    • Oct.
    • Y. K. Muthusamy, E. Barnard, and R. A. Cole, "Reviewing automatic language identification," IEEE Signal Process. Mag., vol. 11, no. 4, pp. 33-41, Oct. 1994.
    • (1994) IEEE Signal Process. Mag. , vol.11 , Issue.4 , pp. 33-41
    • Muthusamy, Y.K.1    Barnard, E.2    Cole, R.A.3
  • 98
    • 85079281912 scopus 로고
    • Speaker-independent, text-independent language identification by HMM
    • Banff, AB, Canada
    • S. Nakagawa, Y. Ueda, and T. Seino, "Speaker-independent, text-independent language identification by HMM," in Proc. Int. Conf. Spoken Lang. Process., Banff, AB, Canada, 1992, pp. 1011-1014.
    • (1992) Proc. Int. Conf. Spoken Lang. Process , pp. 1011-1014
    • Nakagawa, S.1    Ueda, Y.2    Seino, T.3
  • 99
    • 0035441593 scopus 로고    scopus 로고
    • Spoken language recognitionVA step toward multilinguality in speech processing
    • Sep.
    • J. Navratil, "Spoken language recognitionVA step toward multilinguality in speech processing," IEEE Trans. Speech Audio Process., vol. 9, no. 6, pp. 678-685, Sep. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.6 , pp. 678-685
    • Navratil, J.1
  • 101
    • 79959820103 scopus 로고    scopus 로고
    • Towards long-range prosodic attribute modeling for language recognition
    • Chiba, Japan
    • R. W. M. Ng, C.-C. Leung, V. Hautam̈aki, T. Lee, B. Ma, and H. Li, "Towards long-range prosodic attribute modeling for language recognition," in Proc. Interspeech Conf., Chiba, Japan, 2010, pp. 1792-1795.
    • (2010) Proc. Interspeech Conf , pp. 1792-1795
    • Ng, R.W.M.1    Leung, C.-C.2    Hautam̈aki, V.3    Lee, T.4    Ma, B.5    Li, H.6
  • 102
    • 84876680475 scopus 로고    scopus 로고
    • NIST Language Recognition Evaluations. [Online]. Available
    • NIST Language Recognition Evaluations. [Online]. Available: http://nist.gov/itl/iad/mig/lre.cfm
  • 103
    • 80052055182 scopus 로고    scopus 로고
    • Improved modeling of cross-decoder phone co-occurrences in SVM-Based phonotactic language recognition
    • Nov.
    • M. Penagarikano, A. Varona, L. J. Rodriguez-Fuentes, and G. Bordel, "Improved modeling of cross-decoder phone co-occurrences in SVM-Based phonotactic language recognition," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 8, pp. 2348-2363, Nov. 2011.
    • (2011) IEEE Trans. Audio Speech Lang. Process. , vol.19 , Issue.8 , pp. 2348-2363
    • Penagarikano, M.1    Varona, A.2    Rodriguez-Fuentes, L.J.3    Bordel, G.4
  • 104
    • 0033902487 scopus 로고    scopus 로고
    • Applying logistic regression to fusion of the NIST'99 1-speaker submissions
    • S. Pigeon, P. Druyts, and P. Verlinde, "Applying logistic regression to fusion of the NIST'99 1-speaker submissions," Digital Signal Process., vol. 10, pp. 237-248, 2000.
    • (2000) Digital Signal Process. , vol.10 , pp. 237-248
    • Pigeon, S.1    Druyts, P.2    Verlinde, P.3
  • 107
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected publication in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected publication in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 108
    • 0032943763 scopus 로고    scopus 로고
    • Language identification with suprasegmental cues: A study based on speech re-synthesis
    • F. Ramus and J. Mehler, "Language identification with suprasegmental cues: A study based on speech re-synthesis," J. Acoust. Soc. Amer., vol. 105, no. 1, pp. 512-521, 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.1 , pp. 512-521
    • Ramus, F.1    Mehler, J.2
  • 109
    • 0032725252 scopus 로고    scopus 로고
    • Correlates of linguistic rhythm in the speech signal
    • R. Ramus, M. Nespor, and J. Mehler, "Correlates of linguistic rhythm in the speech signal," Cognition, vol. 73, no. 3, pp. 265-292, 1999.
    • (1999) Cognition , vol.73 , Issue.3 , pp. 265-292
    • Ramus, R.1    Nespor, M.2    Mehler, J.3
  • 110
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan.
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 111
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Process. , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 114
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval," Inf. Process. Manage., vol. 24, no. 5, pp. 513-523, 1988.
    • (1988) Inf. Process. Manage. , vol.24 , Issue.5 , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 116
    • 0035426931 scopus 로고    scopus 로고
    • Language independent and language adaptive
    • T. Schultz and A. Waibel, "Language independent and language adaptive," Speech Commun., vol. 35, no. 1-2, pp. 31-51, 2001.
    • (2001) Speech Commun. , vol.35 , Issue.1-2 , pp. 31-51
    • Schultz, T.1    Waibel, A.2
  • 117
    • 85009274666 scopus 로고    scopus 로고
    • Globalphone: A multilingual text and speech database developed at Karlsruhe University
    • Denver, CO, USA
    • T. Schultz, "Globalphone: A multilingual text and speech database developed at Karlsruhe University," in Proc. Interspeech Conf., Denver, CO, USA, 2002, pp. 345-348.
    • (2002) Proc. Interspeech Conf , pp. 345-348
    • Schultz, T.1
  • 119
    • 51449123703 scopus 로고    scopus 로고
    • Improved GMM-Based language recognition using constrained MLLR transforms
    • Las Vegas, NV, USA
    • W. Shen and D. A. Reynolds, "Improved GMM-Based language recognition using constrained MLLR transforms," in Proc. Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4149-4152.
    • (2008) Proc. Int. Conf. Acoust. Speech Signal Process , pp. 4149-4152
    • Shen, W.1    Reynolds, D.A.2
  • 120
    • 66149124829 scopus 로고    scopus 로고
    • On acoustic diversification front-end for spoken language identification
    • Jul.
    • K. C. Sim and H. Li, "On acoustic diversification front-end for spoken language identification," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 5, pp. 1029-1037, Jul. 2008.
    • (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.5 , pp. 1029-1037
    • Sim, K.C.1    Li, H.2
  • 122
    • 70450159475 scopus 로고    scopus 로고
    • Exploring universal attribute characterization of spoken languages for spoken language recognition
    • Brighton, U.K.
    • S. M. Siniscalchi, J. Reed, T. Svendsen, and C.-H. Lee, "Exploring universal attribute characterization of spoken languages for spoken language recognition," in Proc. Interspeech Conf., Brighton, U.K., 2009, pp. 168-171.
    • (2009) Proc. Interspeech Conf , pp. 168-171
    • Siniscalchi, S.M.1    Reed, J.2    Svendsen, T.3    Lee, C.-H.4
  • 123
    • 79959820578 scopus 로고    scopus 로고
    • Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition
    • Chiba, Japan
    • S. M. Siniscalchi, J. Reed, T. Svendsen, and C.-H. Lee, "Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition," in Proc. Interspeech Conf., Chiba, Japan, 2010, pp. 2718-2721.
    • (2010) Proc. Interspeech Conf , pp. 2718-2721
    • Siniscalchi, S.M.1    Reed, J.2    Svendsen, T.3    Lee, C.-H.4
  • 125
    • 0026404756 scopus 로고
    • Automatic language recognition using acoustic features
    • Toronto, ON, Canada
    • M. Sugiyama, "Automatic language recognition using acoustic features," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Toronto, ON, Canada, 1991, vol. 2, pp. 813-816.
    • (1991) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.2 , pp. 813-816
    • Sugiyama, M.1
  • 126
    • 0033556788 scopus 로고    scopus 로고
    • Mixtures of probabilistic principal component analysis
    • M. E. Tipping and C. M. Bishop, "Mixtures of probabilistic principal component analysis," Neural Comput., vol. 11, no. 2, pp. 443-482, 1999.
    • (1999) Neural Comput. , vol.11 , Issue.2 , pp. 443-482
    • Tipping, M.E.1    Bishop, C.M.2
  • 127
    • 33947644912 scopus 로고    scopus 로고
    • Integrating acoustic, prosodic and phonotactic features for spoken language identification
    • Toulouse, France
    • R. Tong, B. Ma, D. Zhu, H. Li, and E.-S. Chng, "Integrating acoustic, prosodic and phonotactic features for spoken language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Toulouse, France, 2006, pp. 205-208.
    • (2006) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 205-208
    • Tong, R.1    Ma, B.2    Zhu, D.3    Li, H.4    Chng, E.-S.5
  • 128
    • 68549110291 scopus 로고    scopus 로고
    • A target-oriented phonotactic front-end for spoken language recognition
    • Sep.
    • R. Tong, B. Ma, H. Li, and E. Chng, "A target-oriented phonotactic front-end for spoken language recognition," IEEE Trans. Audio Speech Lang. Process., vol. 17, no. 7, pp. 1335-1347, Sep. 2009.
    • (2009) IEEE Trans. Audio Speech Lang. Process. , vol.17 , Issue.7 , pp. 1335-1347
    • Tong, R.1    Ma, B.2    Li, H.3    Chng, E.4
  • 129
    • 84865770491 scopus 로고    scopus 로고
    • Target-aware lattice rescoring for dialect recognition
    • Florence, Italy
    • R. Tong, B. Ma, H. Li, and E. Chng, "Target-aware lattice rescoring for dialect recognition," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 733-736.
    • (2011) Proc. Interspeech Conf , pp. 733-736
    • Tong, R.1    Ma, B.2    Li, H.3    Chng, E.4
  • 134
    • 42749098416 scopus 로고    scopus 로고
    • Channel-dependent GMM and multi-class logistic regression models for language recognition
    • San Juan, Puerto Rico DOI: 10.1109/ODYSSEY. 2006.248094
    • D. A. van Leeuwen and N. Brummer, "Channel-dependent GMM and multi-class logistic regression models for language recognition," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY. 2006.248094.
    • (2006) Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
    • Van Leeuwen, D.A.1    Brummer, N.2
  • 135
    • 36248952139 scopus 로고    scopus 로고
    • An introduction to application independent evaluation of speaker recognition systems
    • R. Müller, Ed. Berlin, Germany: Springer-Verlag
    • D. A. van Leeuwen and N. Brümmer, "An introduction to application independent evaluation of speaker recognition systems," in Speaker Classification, vol. 4343, R. Müller, Ed. Berlin, Germany: Springer-Verlag, 2007.
    • (2007) Speaker Classification , vol.4343
    • Van Leeuwen, D.A.1    Brümmer, N.2
  • 136
    • 70450121368 scopus 로고    scopus 로고
    • An open-set detection evaluation methodology applied to language and emotion recognition
    • Antwerp, Belgium
    • D. A. van Leeuwen and K. P. Truong, "An open-set detection evaluation methodology applied to language and emotion recognition," in Proc. Interspeech Conf., Antwerp, Belgium, 2007, pp. 338-341.
    • (2007) Proc. Interspeech Conf , pp. 338-341
    • Van Leeuwen, D.A.1    Truong, K.P.2
  • 139
    • 42749106196 scopus 로고    scopus 로고
    • Channel factors compensation in model and feature domain for speaker recognition
    • San Juan, Puerto Rico DOI: 10.1109/ODYSSEY.2006.248117
    • C. Vair, D. Colibro, F. Castaldo, E. Dalmasso, and P. Laface, "Channel factors compensation in model and feature domain for speaker recognition," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY.2006.248117.
    • (2006) Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
    • Vair, C.1    Colibro, D.2    Castaldo, F.3    Dalmasso, E.4    Laface, P.5
  • 140
    • 34548248573 scopus 로고    scopus 로고
    • Explicit modelling of session variability for speaker verification
    • R. Vogt and S. Sridharan, "Explicit modelling of session variability for speaker verification," Comput. Speech Lang., vol. 22, pp. 17-38, 2008.
    • (2008) Comput. Speech Lang. , vol.22 , pp. 17-38
    • Vogt, R.1    Sridharan, S.2
  • 141
    • 0012327341 scopus 로고    scopus 로고
    • Multilinguality in speech and spoken language systems
    • Aug.
    • A. Waibel, P. Geutner, L. M. Tomokiyo, T. Schultz, and M. Woszczyna, "Multilinguality in speech and spoken language systems," Proc. IEEE, vol. 88, no. 8, pp. 1181-1190, Aug. 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1181-1190
    • Waibel, A.1    Geutner, P.2    Tomokiyo, L.M.3    Schultz, T.4    Woszczyna, M.5
  • 142
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden Markov models for speech recognition
    • P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models for speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-47, 2002.
    • (2002) Comput. Speech Lang. , vol.16 , Issue.1 , pp. 25-47
    • Woodland, P.C.1    Povey, D.2
  • 143
    • 0028996642 scopus 로고
    • An approach to automatic language identification based on language-dependent phone recognition
    • Detroit, MI, USA
    • Y. Yan and E. Barnard, "An approach to automatic language identification based on language-dependent phone recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Detroit, MI, USA, 1995, vol. 5, pp. 3511-3514.
    • (1995) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.5 , pp. 3511-3514
    • Yan, Y.1    Barnard, E.2
  • 144
    • 0029755106 scopus 로고    scopus 로고
    • Development of an approach to language identification based on phone recognition
    • Y. Yan, E. Barnard, and R. Cole, "Development of an approach to language identification based on phone recognition," Comput. Speech Lang., vol. 10, pp. 37-54, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , pp. 37-54
    • Yan, Y.1    Barnard, E.2    Cole, R.3
  • 145
    • 77955790894 scopus 로고    scopus 로고
    • GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition
    • Aug.
    • C. H. You, K. A. Lee, and H. Li, "GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 18, no. 6, pp. 1300-1312, Aug. 2010.
    • (2010) IEEE Trans. Audio Speech Lang. Process. , vol.18 , Issue.6 , pp. 1300-1312
    • You, C.H.1    Lee, K.A.2    Li, H.3
  • 146
    • 84863799477 scopus 로고    scopus 로고
    • A GMM-supervector approach to language recognition with adaptive relevance factor
    • Aalborg, Denmark
    • C. H. You, H. Li, and K. A. Lee, "A GMM-supervector approach to language recognition with adaptive relevance factor," in Proc. EUSIPCO, Aalborg, Denmark, 2010, pp. 1993-1997.
    • (2010) Proc. EUSIPCO , pp. 1993-1997
    • You, C.H.1    Li, H.2    Lee, K.A.3
  • 147
    • 54149098943 scopus 로고    scopus 로고
    • Cortical competition during language discrimination
    • J. Zhao, H. Shu, L. Zhang, X. Wang, Q. Gong, and P. Li, "Cortical competition during language discrimination," NeuroImage, vol. 43, pp. 624-633, 2008.
    • (2008) NeuroImage , vol.43 , pp. 624-633
    • Zhao, J.1    Shu, H.2    Zhang, L.3    Wang, X.4    Gong, Q.5    Li, P.6
  • 148
    • 78049394638 scopus 로고    scopus 로고
    • Soft margin estimation of Gaussian mixture model parameters for spoken language recognition
    • Dallas, TX, USA
    • D. Zhu, B. Ma, and H. Li, "Soft margin estimation of Gaussian mixture model parameters for spoken language recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Dallas, TX, USA, 2010, pp. 4990-4993.
    • (2010) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , pp. 4990-4993
    • Zhu, D.1    Ma, B.2    Li, H.3
  • 149
    • 0027316611 scopus 로고
    • Automatic language identification using Gaussian mixture and hidden Markov models
    • Minneapolis, MN, USA
    • M. A. Zissman, "Automatic language identification using Gaussian mixture and hidden Markov models," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Minneapolis, MN, USA, 1993, vol. 2, pp. 399-402.
    • (1993) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.2 , pp. 399-402
    • Zissman, M.A.1
  • 150
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of four approaches to automatic language identification of telephone speech
    • Jan.
    • M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 31-44, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.A.1
  • 151
    • 85135186373 scopus 로고    scopus 로고
    • Predicting, diagnosing and improving automatic language identification performance
    • Rhodes, Greece
    • M. A. Zissman, "Predicting, diagnosing and improving automatic language identification performance," in Proc. Eurospeech Conf., Rhodes, Greece, 1997, pp. 51-54.
    • (1997) Proc. Eurospeech Conf , pp. 51-54
    • Zissman, M.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.