-
1
-
-
79958844455
-
Language identification: A tutorial
-
May
-
E. Ambikairajah, H. Li, L. Wang, B. Yin, and V. Sethu, "Language identification: A tutorial," IEEE Circuits Syst. Mag., vol. 11, no. 2, pp. 82-108, May 2011.
-
(2011)
IEEE Circuits Syst. Mag.
, vol.11
, Issue.2
, pp. 82-108
-
-
Ambikairajah, E.1
Li, H.2
Wang, L.3
Yin, B.4
Sethu, V.5
-
3
-
-
84858394637
-
Language identification: The long and the short of the matter
-
Los Angeles, CA, USA
-
T. Baldwin and M. Lui, "Language identification: The long and the short of the matter," in Proc. Annu. Conf. North Amer. Chapter ACL Human Lang. Technol., Los Angeles, CA, USA, pp. 229-237.
-
Proc. Annu. Conf. North Amer. Chapter ACL Human Lang. Technol
, pp. 229-237
-
-
Baldwin, T.1
Lui, M.2
-
4
-
-
0004694842
-
Analysis of phoneme-based features for language identification
-
Adelaide, Australia
-
K. M. Berkling and E. Barnard, "Analysis of phoneme-based features for language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, pp. 289-292.
-
(1994)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 289-292
-
-
Berkling, K.M.1
Barnard, E.2
-
5
-
-
85135152779
-
Language identification of six languages based on a common set of broad phonemes
-
Yokohama, Japan
-
K. M. Berkling and E. Barnard, "Language identification of six languages based on a common set of broad phonemes," in Proc. Int. Conf. Spoken Lang. Process., Yokohama, Japan, 1994, pp. 1891-1894.
-
(1994)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1891-1894
-
-
Berkling, K.M.1
Barnard, E.2
-
6
-
-
85073120304
-
Discriminative phonotactics for dialect recognition using context-dependent phone classifiers
-
Brno, Czech Republic
-
F. Biadsy, H. Soltauy, L. Manguy, J. Navratily, and J. Hirschberg, "Discriminative phonotactics for dialect recognition using context-dependent phone classifiers," in Proc. IEEE Odyssey: Speaker and Language Recognition Workshop, Brno, Czech Republic, 2010, pp. 263-270.
-
(2010)
Proc. IEEE Odyssey: Speaker and Language Recognition Workshop
, pp. 263-270
-
-
Biadsy, F.1
Soltauy, H.2
Manguy, L.3
Navratily, J.4
Hirschberg, J.5
-
7
-
-
24744468841
-
Automatic language identification with perceptually guided training and recurrent neural networks
-
Sydney, Australia
-
J. Braun and H. Levkowitz, "Automatic language identification with perceptually guided training and recurrent neural networks," in Proc. Int. Conf. Spoken Lang. Process., Sydney, Australia, 1998, pp. 289-292.
-
(1998)
Proc. Int. Conf. Spoken Lang. Process
, pp. 289-292
-
-
Braun, J.1
Levkowitz, H.2
-
8
-
-
42749108057
-
On calibration of language recognition scores
-
San Juan, Puerto Rico DOI: 10.1109/ODYSSEY.2006.248106
-
N. Brummer and D. Leeuwen, "On calibration of language recognition scores," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY.2006.248106.
-
(2006)
Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
-
-
Brummer, N.1
Leeuwen, D.2
-
9
-
-
29044433376
-
Application-independent evaluation of speaker detection
-
N. Brummer and J. Preez, "Application-independent evaluation of speaker detection," Comput. Speech Lang., vol. 20, no. 2, pp. 230-275, 2006.
-
(2006)
Comput. Speech Lang.
, vol.20
, Issue.2
, pp. 230-275
-
-
Brummer, N.1
Preez, J.2
-
11
-
-
84876673769
-
BUT-AGNITIO system description for NIST language recognition evaluation 2009
-
Baltimore, MD, USA
-
N. Brummer, L. Burget, O. Glembek, V. Hubeika, Z. Jancik, M. Karafiat, P. Matejka, T. Mikolov, O. Plchot, and A. Strasheim, "BUT-AGNITIO system description for NIST language recognition evaluation 2009," in Proc. NIST Lang. Recognit. Eval. Workshop, Baltimore, MD, USA, 2009.
-
(2009)
Proc. NIST Lang. Recognit. Eval. Workshop
-
-
Brummer, N.1
Burget, L.2
Glembek, O.3
Hubeika, V.4
Jancik, Z.5
Karafiat, M.6
Matejka, P.7
Mikolov, T.8
Plchot, O.9
Strasheim, A.10
-
12
-
-
80052047297
-
-
Ph.D. dissertation, Dept. Electr. Electron. Eng., Stellenbosch Univ., Stellenbosch, South Africa
-
N. Brümmer, "Measuring, refining and calibrating speaker and language information extracted from speech," Ph.D. dissertation, Dept. Electr. Electron. Eng., Stellenbosch Univ., Stellenbosch, South Africa, 2010.
-
(2010)
Measuring, Refining and Calibrating Speaker and Language Information Extracted from Speech
-
-
Brümmer, N.1
-
13
-
-
85073192959
-
Description and analysis of the Brno276 system for LRE2011
-
N. Brümmer, S. Cumani, O. Glembek, M. Karafíat, P. Maťejka, J. Peš́an, O. Plchot, M. Soufifar, E. de Villiers, and ̌ J. Cernocḱy, "Description and analysis of the Brno276 system for LRE2011," in Proc. Odyssey: Speaker Lang. Recognit. Workshop, Singapore, 2012, pp. 216-223.
-
(2012)
Proc. Odyssey: Speaker Lang. Recognit. Workshop, Singapore
, pp. 216-223
-
-
Brümmer, N.1
Cumani, S.2
Glembek, O.3
Karafíat, M.4
Maťejka, P.5
Peš́an, J.6
Plchot, O.7
Soufifar, M.8
De Villiers, E.9
Cernocḱy, J.10
-
14
-
-
33947660079
-
Discriminative training techniques for acoustic language identification
-
Toulouse, France
-
L. Burget, P. Matejka, and J. Cernocky, "Discriminative training techniques for acoustic language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Toulouse, France, 2006, pp. 209-212.
-
(2006)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 209-212
-
-
Burget, L.1
Matejka, P.2
Cernocky, J.3
-
15
-
-
4544277099
-
High-level speaker verification with support vector machines
-
Montreal, QC, Canada
-
W. M. Campbell, J. P. Campbell, D. A. Reynolds, D. A. Jones, and T. R. Leek, "High-level speaker verification with support vector machines," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Montreal, QC, Canada, 2004, pp. 73-76.
-
(2004)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 73-76
-
-
Campbell, W.M.1
Campbell, J.P.2
Reynolds, D.A.3
Jones, D.A.4
Leek, T.R.5
-
16
-
-
29044444825
-
Support vector machines for speaker and language recognition
-
W. M. Campbell, J. P. Campbell, D. A. Reynolds, E. Singer, and P. A. Torres-Carrasquillo, "Support vector machines for speaker and language recognition," Comput. Speech Lang., vol. 20, no. 2-3, pp. 210-229, 2006.
-
(2006)
Comput. Speech Lang.
, vol.20
, Issue.2-3
, pp. 210-229
-
-
Campbell, W.M.1
Campbell, J.P.2
Reynolds, D.A.3
Singer, E.4
Torres-Carrasquillo, P.A.5
-
17
-
-
37649028010
-
Advanced language recognition using cepstra and phonotactics: MITLL system performance on the NIST 2005 language recognition evaluation
-
San Juan, Puerto Rico DOI: 10.1109/ODYSSEY.2006.248097
-
W. Campbell, T. Gleason, J. Navratil, D. Reynolds, W. Shen, E. Singer, and P. Torres-Carrasquillo, "Advanced language recognition using cepstra and phonotactics: MITLL system performance on the NIST 2005 language recognition evaluation," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY.2006.248097.
-
(2006)
Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
-
-
Campbell, W.1
Gleason, T.2
Navratil, J.3
Reynolds, D.4
Shen, W.5
Singer, E.6
Torres-Carrasquillo, P.7
-
18
-
-
33645887246
-
Support vector machines using GMM supervectors for speaker verification
-
May
-
W. M. Campbell, D. E. Sturim, and D. A. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-310, May 2006.
-
(2006)
IEEE Signal Process. Lett.
, vol.13
, Issue.5
, pp. 308-310
-
-
Campbell, W.M.1
Sturim, D.E.2
Reynolds, D.A.3
-
19
-
-
51549119947
-
A covariance Kernel for SVM language recognition
-
Las Vegas, NV, USA
-
W. M. Campbell, "A covariance Kernel for SVM language recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4141-4144.
-
(2008)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 4141-4144
-
-
Campbell, W.M.1
-
20
-
-
84867216951
-
A comparison of subspace feature-domain methods for language recognition
-
Brisbane, Australia
-
W. M. Campbell, D. E. Sturim, P. Torres-Carrasquillo, and D. A. Reynolds, "A comparison of subspace feature-domain methods for language recognition," in Proc. Interspeech Conf., Brisbane, Australia, 2008, pp. 309-312.
-
(2008)
Proc. Interspeech Conf
, pp. 309-312
-
-
Campbell, W.M.1
Sturim, D.E.2
Torres-Carrasquillo, P.3
Reynolds, D.A.4
-
21
-
-
43249126081
-
Compensation of nuisance factors for speaker and language recognition
-
Sep.
-
F. Castaldo, D. Colibro, E. Dalmasso, P. Laface, and C. Vair, "Compensation of nuisance factors for speaker and language recognition," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 7, pp. 1969-1978, Sep. 2007.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.7
, pp. 1969-1978
-
-
Castaldo, F.1
Colibro, D.2
Dalmasso, E.3
Laface, P.4
Vair, C.5
-
22
-
-
0002636321
-
N-gram-based text categorization
-
Las Vegas, NV, USA
-
W. B. Cavnar and J. M. Trenkle, "N-gram-based text categorization," in Proc. 3rd Annu. Symp. Document Anal. Inf. Retrieval, Las Vegas, NV, USA, 1994, pp. 161-175.
-
(1994)
Proc. 3rd Annu. Symp. Document Anal. Inf. Retrieval
, pp. 161-175
-
-
Cavnar, W.B.1
Trenkle, J.M.2
-
23
-
-
85032751967
-
Retrieval and browsing of spoken content
-
May
-
C. Chelba, T. Hazen, and M. Saraclar, "Retrieval and browsing of spoken content," IEEE Signal Process. Mag., vol. 25, no. 3, pp. 39-49, May 2008.
-
(2008)
IEEE Signal Process. Mag.
, vol.25
, Issue.3
, pp. 39-49
-
-
Chelba, C.1
Hazen, T.2
Saraclar, M.3
-
24
-
-
0000567234
-
Vector-based natural language call routing
-
J. Chu-Carrol and B. Carpenter, "Vector-based natural language call routing," Comput. Linguist., vol. 25, no. 3, pp. 361-388, 1999.
-
(1999)
Comput. Linguist.
, vol.25
, Issue.3
, pp. 361-388
-
-
Chu-Carrol, J.1
Carpenter, B.2
-
25
-
-
84921813726
-
The mixer corpus of multilingual, multichannel speaker recognition data
-
Lisbon, Portugal
-
C. Cieri, J. P. Campbell, H. Nakasone, D. Miller, and K. Walker, "The mixer corpus of multilingual, multichannel speaker recognition data," in Proc. Int. Conf. Lang. Resources Eval., Lisbon, Portugal, 2004, pp. 24-30.
-
(2004)
Proc. Int. Conf. Lang. Resources Eval
, pp. 24-30
-
-
Cieri, C.1
Campbell, J.P.2
Nakasone, H.3
Miller, D.4
Walker, K.5
-
26
-
-
70450210553
-
The broadcast narrow band speech corpus: A new resource type for large scale language recognition
-
Brighton, U.K.
-
C. Cieri, L. Brandschain, A. Neely, D. Graff, K. Walker, C. Caruso, A. Martin, and C. Greenberg, "The broadcast narrow band speech corpus: A new resource type for large scale language recognition," in Proc. Interspeech Conf., Brighton, U.K., 2009, pp. 2867-2870.
-
(2009)
Proc. Interspeech Conf
, pp. 2867-2870
-
-
Cieri, C.1
Brandschain, L.2
Neely, A.3
Graff, D.4
Walker, K.5
Caruso, C.6
Martin, A.7
Greenberg, C.8
-
27
-
-
0004717613
-
Development of an automatic identification system of spoken languages: Phase i
-
Paris, France
-
D. Cimarusti and R. Ives, "Development of an automatic identification system of spoken languages: Phase I," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Paris, France, 1982, pp. 1661-1663.
-
(1982)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 1661-1663
-
-
Cimarusti, D.1
Ives, R.2
-
31
-
-
84859066901
-
Analysis of large-scale SVM training algorithms for language and speaker recognition
-
Jul.
-
S. Cumani and P. Laface, "Analysis of large-scale SVM training algorithms for language and speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 20, no. 5, pp. 1585-1596, Jul. 2012.
-
(2012)
IEEE Trans. Audio Speech Lang. Process.
, vol.20
, Issue.5
, pp. 1585-1596
-
-
Cumani, S.1
Laface, P.2
-
32
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug.
-
S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust. Speech Signal Process., vol. 28, no. 4, pp. 357-366, Aug. 1980.
-
(1980)
IEEE Trans. Acoust. Speech Signal Process.
, vol.28
, Issue.4
, pp. 357-366
-
-
Davis, S.1
Mermelstein, P.2
-
33
-
-
64249101047
-
Modeling prosodic features with joint factor analysis for speaker verification
-
Sep.
-
N. Dehak, P. Dumouchel, and P. Kenny, "Modeling prosodic features with joint factor analysis for speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 7, pp. 2095-2103, Sep. 2007.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.7
, pp. 2095-2103
-
-
Dehak, N.1
Dumouchel, P.2
Kenny, P.3
-
34
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
May
-
N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Acoust. Speech Signal Process., vol. 19, no. 4, pp. 788-798, May 2011.
-
(2011)
IEEE Trans. Acoust. Speech Signal Process.
, vol.19
, Issue.4
, pp. 788-798
-
-
Dehak, N.1
Kenny, P.J.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
35
-
-
84865750857
-
Language recognition via i-vectors and dimensionality reduction
-
Florence, Italy
-
N. Dehak, P. Torres-Carrasquillo, D. Reynolds, and R. Dehak, "Language recognition via i-vectors and dimensionality reduction," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 857-860.
-
(2011)
Proc. Interspeech Conf
, pp. 857-860
-
-
Dehak, N.1
Torres-Carrasquillo, P.2
Reynolds, D.3
Dehak, R.4
-
36
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
A. Dumpster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Stat. Soc., vol. 39, pp. 1-38, 1977.
-
(1977)
J. R. Stat. Soc.
, vol.39
, pp. 1-38
-
-
Dumpster, A.1
Laird, N.2
Rubin, D.3
-
37
-
-
0003984557
-
Statistical identification of language
-
New Mexico State Univ., Las Cruces, NM, USA, Tech. Rep. MCCS-94-273
-
T. Dunning, "Statistical identification of language," Comput. Res. Lab (CRL), New Mexico State Univ., Las Cruces, NM, USA, Tech. Rep. MCCS-94-273, 1994.
-
(1994)
Comput. Res. Lab (CRL)
-
-
Dunning, T.1
-
39
-
-
0028419019
-
Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains
-
Apr.
-
J. L. Gauvain and C.-H. Lee, "Maximum a posterior estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.L.1
Lee, C.-H.2
-
40
-
-
85009084315
-
Language recognition using phone lattices
-
Jeju Island, Korea
-
J. L. Gauvain, A. Messaoudi, and H. Schwenk, "Language recognition using phone lattices," in Proc. Int. Conf. Spoken Lang. Process., Jeju Island, Korea, 2004, pp. 1283-1286.
-
(2004)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1283-1286
-
-
Gauvain, J.L.1
Messaoudi, A.2
Schwenk, H.3
-
41
-
-
49949150022
-
Language identification in the limit
-
E. M. Gold, "Language identification in the limit," Inf. Control, vol. 10, no. 5, pp. 447-474, 1967.
-
(1967)
Inf. Control
, vol.10
, Issue.5
, pp. 447-474
-
-
Gold, E.M.1
-
42
-
-
0024912654
-
Improved automatic language identification in noisy speech
-
Glasgow, Scotland
-
F. J. Goodman, A. F. Martin, and R. E. Wohlford, "Improved automatic language identification in noisy speech," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Glasgow, Scotland, 1989, vol. 1, pp. 528-531.
-
(1989)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 528-531
-
-
Goodman, F.J.1
Martin, A.F.2
Wohlford, R.E.3
-
44
-
-
85135379346
-
Automatic language identification using a segment-based approach
-
Berlin, Germany
-
T. J. Hazen and V. W. Zue, "Automatic language identification using a segment-based approach," in Proc. Eurospeech Conf., Berlin, Germany, 1993, pp. 1303-1306.
-
(1993)
Proc. Eurospeech Conf
, pp. 1303-1306
-
-
Hazen, T.J.1
Zue, V.W.2
-
45
-
-
84863902506
-
Recent improvements in an approach to segment-based automatic language identification
-
Adelaide, Australia
-
T. J. Hazen and V. W. Zue, "Recent improvements in an approach to segment-based automatic language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, pp. 1883-1886.
-
(1994)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 1883-1886
-
-
Hazen, T.J.1
Zue, V.W.2
-
46
-
-
0030893502
-
Segment-based automatic language identification
-
T. J. Hazen and V. W. Zue, "Segment-based automatic language identification," J. Acoust. Soc. Amer., vol. 101, no. 4, pp. 2323-2331, 1997.
-
(1997)
J. Acoust. Soc. Amer.
, vol.101
, Issue.4
, pp. 2323-2331
-
-
Hazen, T.J.1
Zue, V.W.2
-
47
-
-
0028517164
-
RASTA processing of speech
-
Oct.
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
48
-
-
0030364814
-
Spoken language identification using large vocabulary speech recognition
-
Philadelphia, PA, USA
-
J. Hieronymus and S. Kadambe, "Spoken language identification using large vocabulary speech recognition," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, USA, 1996, pp. 1780-1783.
-
(1996)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1780-1783
-
-
Hieronymus, J.1
Kadambe, S.2
-
49
-
-
0033350721
-
Products of experts
-
Edinburgh, U.K.
-
G. Hinton, "Products of experts," in Proc. 9th Int. Conf. Artif. Neural Netw., Edinburgh, U.K., 1999, vol. 1, pp. 1-6.
-
(1999)
Proc. 9th Int. Conf. Artif. Neural Netw
, vol.1
, pp. 1-6
-
-
Hinton, G.1
-
50
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Nov.
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. Sainath, and B. Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.10
Kingsbury, B.11
-
51
-
-
0001152481
-
Toward automatic identification of the language of an utterance. I. Preliminary methodological considerations
-
A. S. House and E. P. Neuburg, "Toward automatic identification of the language of an utterance. I. Preliminary methodological considerations, " J. Acoust. Soc. Amer., vol. 62, no. 3, pp. 708-713, 1977.
-
(1977)
J. Acoust. Soc. Amer.
, vol.62
, Issue.3
, pp. 708-713
-
-
House, A.S.1
Neuburg, E.P.2
-
52
-
-
0004056285
-
-
Englewood Cliffs NJ USA: Prentice-Hall
-
X. Huang, A. Acero, and H. W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Englewood Cliffs, NJ, USA: Prentice-Hall, 2001.
-
(2001)
Spoken Language Processing: A Guide to Theory Algorithm and System Development
-
-
Huang, X.1
Acero, A.2
Hon, H.W.3
-
53
-
-
0000262562
-
Hierarchical mixtures of experts and the em algorithms
-
M. I. Jordan and R. A. Jacobs, "Hierarchical mixtures of experts and the EM algorithms," Neural Comput., vol. 6, pp. 181-214, 1994.
-
(1994)
Neural Comput.
, vol.6
, pp. 181-214
-
-
Jordan, M.I.1
Jacobs, R.A.2
-
54
-
-
0031139839
-
Minimum classification error rate methods for speech recognition
-
May
-
B. H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997.
-
(1997)
IEEE Trans. Speech Audio Process.
, vol.5
, Issue.3
, pp. 257-265
-
-
Juang, B.H.1
Chou, W.2
Lee, C.-H.3
-
56
-
-
33645857482
-
Experiments in speaker verification using factor analysis likelihood ratios
-
Toledo, Spain
-
P. Kenny and P. Dumouchel, "Experiments in speaker verification using factor analysis likelihood ratios," in Proc. Odyssey: Speaker Lang. Recognit. Workshop, Toledo, Spain, 2004, pp. 219-226.
-
(2004)
Proc. Odyssey: Speaker Lang. Recognit. Workshop
, pp. 219-226
-
-
Kenny, P.1
Dumouchel, P.2
-
57
-
-
50249170027
-
Joint factor analysis versus eigenchannels in speaker recognition
-
May
-
P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 4, pp. 1435-1447, May 2007.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.4
, pp. 1435-1447
-
-
Kenny, P.1
Boulianne, G.2
Ouellet, P.3
Dumouchel, P.4
-
58
-
-
36249002496
-
Language characteristics
-
T. Schultz and K. Kirchhoff, Eds. Amsterdam, The Netherlands: Elsevier
-
K. Kirchhoff, "Language characteristics," in Multilingual Speech Processing, T. Schultz and K. Kirchhoff, Eds. Amsterdam, The Netherlands: Elsevier, 2006.
-
(2006)
Multilingual Speech Processing
-
-
Kirchhoff, K.1
-
59
-
-
22444454265
-
Combining classifiers: A theoretical framework
-
J. Kittler, "Combining classifiers: A theoretical framework," Pattern Anal. Appl., no. 1, pp. 18-27, 1988.
-
(1988)
Pattern Anal. Appl.
, Issue.1
, pp. 18-27
-
-
Kittler, J.1
-
60
-
-
84865778217
-
IVector fusion of prosodic and cepstral features for speaker verification
-
Florence, Italy
-
M. Kockmann, L. Ferrer, L. Burget, and ̌ J. Cernocḱy, "iVector fusion of prosodic and cepstral features for speaker verification," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 265-268.
-
(2011)
Proc. Interspeech Conf
, pp. 265-268
-
-
Kockmann, M.1
Ferrer, L.2
Burget, L.3
Cernocḱy, J.4
-
61
-
-
0036472946
-
A theoretical study on six classifier fusion strategies
-
Feb.
-
L. I. Kuncheva, "A theoretical study on six classifier fusion strategies," IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 2, pp. 281-286, Feb. 2002.
-
(2002)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.24
, Issue.2
, pp. 281-286
-
-
Kuncheva, L.I.1
-
62
-
-
85049773640
-
Language identification using phone-based acoustic likelihoods
-
Adelaide, Australia
-
L. F. Lamel and J. L. Gauvain, "Language identification using phone-based acoustic likelihoods," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, vol. 1, pp. 293-296.
-
(1994)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 293-296
-
-
Lamel, L.F.1
Gauvain, J.L.2
-
63
-
-
64549162996
-
The OGI 22 language telephone speech corpus
-
Madrid, Spain
-
T. Lander, R. Cole, B. Oshika, and M. Noel, "The OGI 22 language telephone speech corpus," in Proc. Eurospeech Conf., Madrid, Spain, 1995, pp. 817-820.
-
(1995)
Proc. Eurospeech Conf
, pp. 817-820
-
-
Lander, T.1
Cole, R.2
Oshika, B.3
Noel, M.4
-
64
-
-
84876693427
-
Principles of spoken language recognition
-
J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag
-
C.-H. Lee, "Principles of spoken language recognition," in Springer Handbook of Speech Processing and Speech Communication, J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag, 2008.
-
(2008)
Springer Handbook of Speech Processing and Speech Communication
-
-
Lee, C.-H.1
-
65
-
-
51449104855
-
Spoken language recognition using support vector machines with generative front-end
-
Las Vegas, NV, USA
-
K. A. Lee, C. You, and H. Li, "Spoken language recognition using support vector machines with generative front-end," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4153-4156.
-
(2008)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 4153-4156
-
-
Lee, K.A.1
You, C.2
Li, H.3
-
66
-
-
79953277529
-
Using discrete probabilities with Bhattacharyya measure for SVM-based speaker verification
-
May
-
K. A. Lee, C. H. You, H. Li, T. Kinnunen, and K. C. Sim, "Using discrete probabilities with Bhattacharyya measure for SVM-based speaker verification," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 4, pp. 861-870, May 2011.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.4
, pp. 861-870
-
-
Lee, K.A.1
You, C.H.2
Li, H.3
Kinnunen, T.4
Sim, K.C.5
-
67
-
-
84865768678
-
Spoken language recognition in the latent topic simplex
-
Florence, Italy
-
K. A. Lee, C. H. You, V. Hautam̈aki, A. Larcher, and H. Li, "Spoken language recognition in the latent topic simplex," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 2933-2936.
-
(2011)
Proc. Interspeech Conf
, pp. 2933-2936
-
-
Lee, K.A.1
You, C.H.2
Hautam̈aki, V.3
Larcher, A.4
Li, H.5
-
68
-
-
0029747183
-
Speaker normalization using efficient frequency warping procedures
-
Atlanta, GA, USA
-
L. Lee and R. C. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Atlanta, GA, USA, 1996, vol. 1, pp. 353-356.
-
(1996)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 353-356
-
-
Lee, L.1
Rose, R.C.2
-
72
-
-
33947677598
-
A phonotactic language model for spoken language identification
-
Ann Arbor, MI, USA
-
H. Li and B. Ma, "A phonotactic language model for spoken language identification," in Proc. Assoc. Comput. Linguist., Ann Arbor, MI, USA, 2005, pp. 515-522.
-
(2005)
Proc. Assoc. Comput. Linguist
, pp. 515-522
-
-
Li, H.1
Ma, B.2
-
73
-
-
34547502608
-
A vector space modeling approach to spoken language identification
-
Jan.
-
H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 1, pp. 271-284, Jan. 2007.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.1
, pp. 271-284
-
-
Li, H.1
Ma, B.2
Lee, C.-H.3
-
74
-
-
84876690978
-
Institute for Infocomm Research system description for the language recognition evaluation 2007 submission
-
Orlando, FL, USA
-
H. Li, B. Ma, K. C. Sim, R. Tong, K. A. Lee, H. Sun, D. Zhu, C. You, M. Dong, and X. Wang, "Institute for Infocomm Research system description for the language recognition evaluation 2007 submission," in Proc. NIST Lang. Recognit. Eval. Workshop, Orlando, FL, USA, 2007.
-
(2007)
Proc. NIST Lang. Recognit. Eval. Workshop
-
-
Li, H.1
Ma, B.2
Sim, K.C.3
Tong, R.4
Lee, K.A.5
Sun, H.6
Zhu, D.7
You, C.8
Dong, M.9
Wang, X.10
-
75
-
-
84994249527
-
Vector-based spoken language classification
-
J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag
-
H. Li, B. Ma, and C.-H. Lee, "Vector-based spoken language classification," in Springer Handbook of Speech Processing and Speech Communication, J. Benesty, M. M. Sondhi, and A. Huang, Eds. New York, NY, USA: Springer-Verlag, 2008.
-
(2008)
Springer Handbook of Speech Processing and Speech Communication
-
-
Li, H.1
Ma, B.2
Lee, C.-H.3
-
76
-
-
85032751399
-
TechWare: Speaker and spoken language recognition resources
-
Nov.
-
H. Li and B. Ma, "TechWare: Speaker and spoken language recognition resources," IEEE Signal Process. Mag., vol. 27, no. 6, pp. 139-142, Nov. 2010.
-
(2010)
IEEE Signal Process. Mag.
, vol.27
, Issue.6
, pp. 139-142
-
-
Li, H.1
Ma, B.2
-
77
-
-
0019145840
-
Statistical models for automatic language identification
-
Denver, CO, USA
-
K. P. Li and T. J. Edwards, "Statistical models for automatic language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Denver, CO, USA, 1980, pp. 884-887.
-
(1980)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 884-887
-
-
Li, K.P.1
Edwards, T.J.2
-
78
-
-
84876664448
-
Machine learning paradigms for speech recognition: An overview
-
accepted for publication
-
X. Li, L. Deng, and J. Bilmes, "Machine learning paradigms for speech recognition: An overview," IEEE Trans. Audio Speech Lang. Process., accepted for publication.
-
IEEE Trans. Audio Speech Lang. Process
-
-
Li, X.1
Deng, L.2
Bilmes, J.3
-
79
-
-
33646810153
-
Using local and global phonotactic features in Chinese dialect identification
-
Philadelphia, PA, USA
-
B. P. Lim, H. Li, and B. Ma, "Using local and global phonotactic features in Chinese dialect identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Philadelphia, PA, USA, 2005, pp. 577-580.
-
(2005)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 577-580
-
-
Lim, B.P.1
Li, H.2
Ma, B.3
-
80
-
-
85009211861
-
Improved speaker verification through probabilistic subspace adaptation
-
Geneva, Switzerland
-
S. Lucey and T. Chen, "Improved speaker verification through probabilistic subspace adaptation," in Proc. Eurospeech Conf., Geneva, Switzerland, 2003, pp. 2021-2024.
-
(2003)
Proc. Eurospeech Conf
, pp. 2021-2024
-
-
Lucey, S.1
Chen, T.2
-
81
-
-
85009273139
-
Multilingual speech recognition with language identification
-
Denver, CO, USA
-
B. Ma, C. Guan, H. Li, and C.-H. Lee, "Multilingual speech recognition with language identification," in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, USA, 2002, pp. 505-508.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process
, pp. 505-508
-
-
Ma, B.1
Guan, C.2
Li, H.3
Lee, C.-H.4
-
82
-
-
60849102345
-
Spoken language recognition with ensemble classifiers
-
Sep.
-
B. Ma, H. Li, and R. Tong, "Spoken language recognition with ensemble classifiers," IEEE Trans. Audio Speech Lang. Process., vol. 15, no. 7, pp. 2053-2062, Sep. 2007.
-
(2007)
IEEE Trans. Audio Speech Lang. Process.
, vol.15
, Issue.7
, pp. 2053-2062
-
-
Ma, B.1
Li, H.2
Tong, R.3
-
84
-
-
85046873967
-
The DET curve in assessment of detection task performance
-
A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," in Proc. Eurospeech Conf., Rhodes, Greece, 1997, vol. 4, pp. 1895-1898.
-
(1997)
Proc. Eurospeech Conf., Rhodes, Greece
, vol.4
, pp. 1895-1898
-
-
Martin, A.1
Doddington, G.2
Kamm, T.3
Ordowski, M.4
Przybocki, M.5
-
85
-
-
85009208002
-
NIST 2003 language recognition evaluation
-
A. F. Martin and M. A. Przybocki, "NIST 2003 language recognition evaluation," in Proc. Eurospeech Conf., Geneva, Switzerland, 2003, pp. 1341-1344.
-
(2003)
Proc. Eurospeech Conf., Geneva, Switzerland
, pp. 1341-1344
-
-
Martin, A.F.1
Przybocki, M.A.2
-
86
-
-
37649031157
-
The current state of language recognition: NIST 2005 evaluation results
-
San Juan, Puerto Rico DOI: 10.1109/ODYSSEY. 2006.248104
-
A. F. Martin and A. N. Le, "The current state of language recognition: NIST 2005 evaluation results," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY. 2006.248104.
-
(2006)
Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
-
-
Martin, A.F.1
Le, A.N.2
-
87
-
-
84969216997
-
NIST speech processing evaluations: LVCSR, speaker recognition, language recognition
-
Washington, DC, USA
-
A. F. Martin and J. S. Garofolo, "NIST speech processing evaluations: LVCSR, speaker recognition, language recognition," in Proc. IEEE Workshop Signal Process. Appl. Public Security Forensics, Washington, DC, USA, 2007, pp. 1-7.
-
(2007)
Proc. IEEE Workshop Signal Process. Appl. Public Security Forensics
, pp. 1-7
-
-
Martin, A.F.1
Garofolo, J.S.2
-
89
-
-
85073106909
-
The 2009 NIST language recognition evaluation
-
Brno, Czech Republic
-
A. F. Martin and C. Greenberg, "The 2009 NIST language recognition evaluation," in Proc. Odyssey: Speaker Lang. Recognit. Workshop, Brno, Czech Republic, 2010, pp. 165-171.
-
(2010)
Proc. Odyssey: Speaker Lang. Recognit. Workshop
, pp. 165-171
-
-
Martin, A.F.1
Greenberg, C.2
-
90
-
-
33745190265
-
Phonotactic language identification using high quality phoneme recognition
-
Lisbon, Portugal
-
P. Matejka, P. Schwarz, J. Cernocky, and P. Chytil, "Phonotactic language identification using high quality phoneme recognition," in Proc. Interspeech Conf., Lisbon, Portugal, 2005, pp. 2237-2240.
-
(2005)
Proc. Interspeech Conf
, pp. 2237-2240
-
-
Matejka, P.1
Schwarz, P.2
Cernocky, J.3
Chytil, P.4
-
91
-
-
84867202539
-
Beyond frame independent: Parametric modeling of time duration in speaker and language recognition
-
Brisbane, Australia
-
A. McCree, F. Richardson, E. Singer, and D. Reynolds, "Beyond frame independent: Parametric modeling of time duration in speaker and language recognition," in Proc. Interspeech Conf., Brisbane, Australia, 2008, pp. 767-770.
-
(2008)
Proc. Interspeech Conf
, pp. 767-770
-
-
McCree, A.1
Richardson, F.2
Singer, E.3
Reynolds, D.4
-
92
-
-
0029725760
-
Automatic language identification using large vocabulary continuous speech recognition
-
Atlanta, GA, USA
-
S. Mendoza, L. Gillick, Y. Ito, S. Lowe, and M. Newman, "Automatic language identification using large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Atlanta, GA, USA, 1996, vol. 2, pp. 785-788.
-
(1996)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.2
, pp. 785-788
-
-
Mendoza, S.1
Gillick, L.2
Ito, Y.3
Lowe, S.4
Newman, M.5
-
93
-
-
0012352869
-
-
Carnegie Mellon Univ., Pittsburgh, PA, USA, Tech. Rep. 738
-
T. P. Minka, "Algorithms for maximum-likelihood logistic regression," Carnegie Mellon Univ., Pittsburgh, PA, USA, Tech. Rep. 738, 2001.
-
(2001)
Algorithms for Maximum-likelihood Logistic Regression
-
-
Minka, T.P.1
-
94
-
-
85081009009
-
The OGI multi-language telephone speech corpus
-
Y. K. Muthusamy, R. Cole, and B. Oshika, "The OGI multi-language telephone speech corpus," in Proc. Int. Conf. Spoken Lang. Process., BanffABCanada, 1992, pp. 895-898.
-
(1992)
Proc. Int. Conf. Spoken Lang. Process., BanffABCanada
, pp. 895-898
-
-
Muthusamy, Y.K.1
Cole, R.2
Oshika, B.3
-
95
-
-
0004656027
-
A comparison of approaches to automatic language identification using telephone speech
-
Berlin, Germany
-
Y. K. Muthusamy, K. M. Berkling, T. Arai, R. A. Cole, and E. Barnard, "A comparison of approaches to automatic language identification using telephone speech," in Proc. Eurospeech Conf., Berlin, Germany, 1993, pp. 1307-1310.
-
(1993)
Proc. Eurospeech Conf
, pp. 1307-1310
-
-
Muthusamy, Y.K.1
Berkling, K.M.2
Arai, T.3
Cole, R.A.4
Barnard, E.5
-
96
-
-
0000164460
-
Perceptual benchmarks for automatic language identification
-
Adelaide, Australia
-
Y. K. Muthusamy, N. Jain, and R. A. Cole, "Perceptual benchmarks for automatic language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, vol. 1, pp. 333-336.
-
(1994)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 333-336
-
-
Muthusamy, Y.K.1
Jain, N.2
Cole, R.A.3
-
97
-
-
0028516964
-
Reviewing automatic language identification
-
Oct.
-
Y. K. Muthusamy, E. Barnard, and R. A. Cole, "Reviewing automatic language identification," IEEE Signal Process. Mag., vol. 11, no. 4, pp. 33-41, Oct. 1994.
-
(1994)
IEEE Signal Process. Mag.
, vol.11
, Issue.4
, pp. 33-41
-
-
Muthusamy, Y.K.1
Barnard, E.2
Cole, R.A.3
-
98
-
-
85079281912
-
Speaker-independent, text-independent language identification by HMM
-
Banff, AB, Canada
-
S. Nakagawa, Y. Ueda, and T. Seino, "Speaker-independent, text-independent language identification by HMM," in Proc. Int. Conf. Spoken Lang. Process., Banff, AB, Canada, 1992, pp. 1011-1014.
-
(1992)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1011-1014
-
-
Nakagawa, S.1
Ueda, Y.2
Seino, T.3
-
99
-
-
0035441593
-
Spoken language recognitionVA step toward multilinguality in speech processing
-
Sep.
-
J. Navratil, "Spoken language recognitionVA step toward multilinguality in speech processing," IEEE Trans. Speech Audio Process., vol. 9, no. 6, pp. 678-685, Sep. 2001.
-
(2001)
IEEE Trans. Speech Audio Process.
, vol.9
, Issue.6
, pp. 678-685
-
-
Navratil, J.1
-
100
-
-
78049400391
-
Prosodic attribute model for spoken language identification
-
Dallas, TX, USA
-
R. W. M. Ng, C.-C. Leung, T. Lee, B. Ma, and H. Li, "Prosodic attribute model for spoken language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Dallas, TX, USA, 2010, pp. 5022-5025.
-
(2010)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 5022-5025
-
-
Ng, R.W.M.1
Leung, C.-C.2
Lee, T.3
Ma, B.4
Li, H.5
-
101
-
-
79959820103
-
Towards long-range prosodic attribute modeling for language recognition
-
Chiba, Japan
-
R. W. M. Ng, C.-C. Leung, V. Hautam̈aki, T. Lee, B. Ma, and H. Li, "Towards long-range prosodic attribute modeling for language recognition," in Proc. Interspeech Conf., Chiba, Japan, 2010, pp. 1792-1795.
-
(2010)
Proc. Interspeech Conf
, pp. 1792-1795
-
-
Ng, R.W.M.1
Leung, C.-C.2
Hautam̈aki, V.3
Lee, T.4
Ma, B.5
Li, H.6
-
102
-
-
84876680475
-
-
NIST Language Recognition Evaluations. [Online]. Available
-
NIST Language Recognition Evaluations. [Online]. Available: http://nist.gov/itl/iad/mig/lre.cfm
-
-
-
-
103
-
-
80052055182
-
Improved modeling of cross-decoder phone co-occurrences in SVM-Based phonotactic language recognition
-
Nov.
-
M. Penagarikano, A. Varona, L. J. Rodriguez-Fuentes, and G. Bordel, "Improved modeling of cross-decoder phone co-occurrences in SVM-Based phonotactic language recognition," IEEE Trans. Audio Speech Lang. Process., vol. 19, no. 8, pp. 2348-2363, Nov. 2011.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.8
, pp. 2348-2363
-
-
Penagarikano, M.1
Varona, A.2
Rodriguez-Fuentes, L.J.3
Bordel, G.4
-
104
-
-
0033902487
-
Applying logistic regression to fusion of the NIST'99 1-speaker submissions
-
S. Pigeon, P. Druyts, and P. Verlinde, "Applying logistic regression to fusion of the NIST'99 1-speaker submissions," Digital Signal Process., vol. 10, pp. 237-248, 2000.
-
(2000)
Digital Signal Process.
, vol.10
, pp. 237-248
-
-
Pigeon, S.1
Druyts, P.2
Verlinde, P.3
-
105
-
-
84876686443
-
Discriminative training of GMM for language identification
-
Tokyo, Japan
-
D. Qu and B. Wang, "Discriminative training of GMM for language identification," in Proc. ISCA IEEE Workshop Spontaneous Speech Process. Recognit., Tokyo, Japan, 2003, pp. 67-70.
-
(2003)
Proc. ISCA IEEE Workshop Spontaneous Speech Process. Recognit
, pp. 67-70
-
-
Qu, D.1
Wang, B.2
-
107
-
-
0024610919
-
A tutorial on hidden Markov models and selected publication in speech recognition
-
Feb.
-
L. R. Rabiner, "A tutorial on hidden Markov models and selected publication in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
-
(1989)
Proc. IEEE
, vol.77
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
108
-
-
0032943763
-
Language identification with suprasegmental cues: A study based on speech re-synthesis
-
F. Ramus and J. Mehler, "Language identification with suprasegmental cues: A study based on speech re-synthesis," J. Acoust. Soc. Amer., vol. 105, no. 1, pp. 512-521, 1999.
-
(1999)
J. Acoust. Soc. Amer.
, vol.105
, Issue.1
, pp. 512-521
-
-
Ramus, F.1
Mehler, J.2
-
109
-
-
0032725252
-
Correlates of linguistic rhythm in the speech signal
-
R. Ramus, M. Nespor, and J. Mehler, "Correlates of linguistic rhythm in the speech signal," Cognition, vol. 73, no. 3, pp. 265-292, 1999.
-
(1999)
Cognition
, vol.73
, Issue.3
, pp. 265-292
-
-
Ramus, R.1
Nespor, M.2
Mehler, J.3
-
110
-
-
0029209272
-
Robust text-independent speaker identification using Gaussian mixture speaker models
-
Jan.
-
D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
-
(1995)
IEEE Trans. Speech Audio Process.
, vol.3
, Issue.1
, pp. 72-83
-
-
Reynolds, D.A.1
Rose, R.C.2
-
111
-
-
0033884858
-
Speaker verification using adapted Gaussian mixture models
-
D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol. 10, pp. 19-41, 2000.
-
(2000)
Digital Signal Process.
, vol.10
, pp. 19-41
-
-
Reynolds, D.A.1
Quatieri, T.F.2
Dunn, R.B.3
-
112
-
-
51449124361
-
Language recognition with discriminative keyword selection
-
Las Vegas, NV, USA
-
F. S. Richardson and W. M. Campbell, "Language recognition with discriminative keyword selection," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4145-4148.
-
(2008)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 4145-4148
-
-
Richardson, F.S.1
Campbell, W.M.2
-
114
-
-
45549117987
-
Term-weighting approaches in automatic text retrieval
-
G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval," Inf. Process. Manage., vol. 24, no. 5, pp. 513-523, 1988.
-
(1988)
Inf. Process. Manage.
, vol.24
, Issue.5
, pp. 513-523
-
-
Salton, G.1
Buckley, C.2
-
115
-
-
0029725380
-
LVCSR-based language identification
-
Atlanta, GA, USA
-
T. Schultz, I. Rogina, and A. Waibel, "LVCSR-based language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Atlanta, GA, USA, 1996, vol. 2, pp. 781-784.
-
(1996)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.2
, pp. 781-784
-
-
Schultz, T.1
Rogina, I.2
Waibel, A.3
-
116
-
-
0035426931
-
Language independent and language adaptive
-
T. Schultz and A. Waibel, "Language independent and language adaptive," Speech Commun., vol. 35, no. 1-2, pp. 31-51, 2001.
-
(2001)
Speech Commun.
, vol.35
, Issue.1-2
, pp. 31-51
-
-
Schultz, T.1
Waibel, A.2
-
117
-
-
85009274666
-
Globalphone: A multilingual text and speech database developed at Karlsruhe University
-
Denver, CO, USA
-
T. Schultz, "Globalphone: A multilingual text and speech database developed at Karlsruhe University," in Proc. Interspeech Conf., Denver, CO, USA, 2002, pp. 345-348.
-
(2002)
Proc. Interspeech Conf
, pp. 345-348
-
-
Schultz, T.1
-
118
-
-
42749098771
-
Experiments with lattice-based PPRLM language identification
-
San Juan, Puerto Rico DOI: 10.1109/ODYSSEY. 2006.248100
-
W. Shen, W. Campbell, T. Gleason, D. Reynolds, and E. Singer, "Experiments with lattice-based PPRLM language identification," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY. 2006.248100.
-
(2006)
Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
-
-
Shen, W.1
Campbell, W.2
Gleason, T.3
Reynolds, D.4
Singer, E.5
-
119
-
-
51449123703
-
Improved GMM-Based language recognition using constrained MLLR transforms
-
Las Vegas, NV, USA
-
W. Shen and D. A. Reynolds, "Improved GMM-Based language recognition using constrained MLLR transforms," in Proc. Int. Conf. Acoust. Speech Signal Process., Las Vegas, NV, USA, 2008, pp. 4149-4152.
-
(2008)
Proc. Int. Conf. Acoust. Speech Signal Process
, pp. 4149-4152
-
-
Shen, W.1
Reynolds, D.A.2
-
120
-
-
66149124829
-
On acoustic diversification front-end for spoken language identification
-
Jul.
-
K. C. Sim and H. Li, "On acoustic diversification front-end for spoken language identification," IEEE Trans. Audio Speech Lang. Process., vol. 16, no. 5, pp. 1029-1037, Jul. 2008.
-
(2008)
IEEE Trans. Audio Speech Lang. Process.
, vol.16
, Issue.5
, pp. 1029-1037
-
-
Sim, K.C.1
Li, H.2
-
121
-
-
85073202259
-
The MITLL NIST LRE2011 language recognition system
-
Singapore
-
E. Singer, P. Torres-Carrasquillo, D. Reynolds, A. McCree, F. Richardson, N. Dehak, and D. Sturim, "The MITLL NIST LRE2011 language recognition system," in Proc. Odyssey: Speaker Lang. Recognit. Workshop, Singapore, 2012, pp. 209-215.
-
(2012)
Proc. Odyssey: Speaker Lang. Recognit. Workshop
, pp. 209-215
-
-
Singer, E.1
Torres-Carrasquillo, P.2
Reynolds, D.3
McCree, A.4
Richardson, F.5
Dehak, N.6
Sturim, D.7
-
122
-
-
70450159475
-
Exploring universal attribute characterization of spoken languages for spoken language recognition
-
Brighton, U.K.
-
S. M. Siniscalchi, J. Reed, T. Svendsen, and C.-H. Lee, "Exploring universal attribute characterization of spoken languages for spoken language recognition," in Proc. Interspeech Conf., Brighton, U.K., 2009, pp. 168-171.
-
(2009)
Proc. Interspeech Conf
, pp. 168-171
-
-
Siniscalchi, S.M.1
Reed, J.2
Svendsen, T.3
Lee, C.-H.4
-
123
-
-
79959820578
-
Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition
-
Chiba, Japan
-
S. M. Siniscalchi, J. Reed, T. Svendsen, and C.-H. Lee, "Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition," in Proc. Interspeech Conf., Chiba, Japan, 2010, pp. 2718-2721.
-
(2010)
Proc. Interspeech Conf
, pp. 2718-2721
-
-
Siniscalchi, S.M.1
Reed, J.2
Svendsen, T.3
Lee, C.-H.4
-
124
-
-
33645895387
-
Advances in channel compensation for SVM speaker recognition
-
Philadelphia, PA, USA
-
A. Solomonoff, W. M. Campbell, and I. Boardman, "Advances in channel compensation for SVM speaker recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Philadelphia, PA, USA, 2005, pp. 629-632.
-
(2005)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 629-632
-
-
Solomonoff, A.1
Campbell, W.M.2
Boardman, I.3
-
125
-
-
0026404756
-
Automatic language recognition using acoustic features
-
Toronto, ON, Canada
-
M. Sugiyama, "Automatic language recognition using acoustic features," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Toronto, ON, Canada, 1991, vol. 2, pp. 813-816.
-
(1991)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.2
, pp. 813-816
-
-
Sugiyama, M.1
-
126
-
-
0033556788
-
Mixtures of probabilistic principal component analysis
-
M. E. Tipping and C. M. Bishop, "Mixtures of probabilistic principal component analysis," Neural Comput., vol. 11, no. 2, pp. 443-482, 1999.
-
(1999)
Neural Comput.
, vol.11
, Issue.2
, pp. 443-482
-
-
Tipping, M.E.1
Bishop, C.M.2
-
127
-
-
33947644912
-
Integrating acoustic, prosodic and phonotactic features for spoken language identification
-
Toulouse, France
-
R. Tong, B. Ma, D. Zhu, H. Li, and E.-S. Chng, "Integrating acoustic, prosodic and phonotactic features for spoken language identification," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Toulouse, France, 2006, pp. 205-208.
-
(2006)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 205-208
-
-
Tong, R.1
Ma, B.2
Zhu, D.3
Li, H.4
Chng, E.-S.5
-
128
-
-
68549110291
-
A target-oriented phonotactic front-end for spoken language recognition
-
Sep.
-
R. Tong, B. Ma, H. Li, and E. Chng, "A target-oriented phonotactic front-end for spoken language recognition," IEEE Trans. Audio Speech Lang. Process., vol. 17, no. 7, pp. 1335-1347, Sep. 2009.
-
(2009)
IEEE Trans. Audio Speech Lang. Process.
, vol.17
, Issue.7
, pp. 1335-1347
-
-
Tong, R.1
Ma, B.2
Li, H.3
Chng, E.4
-
129
-
-
84865770491
-
Target-aware lattice rescoring for dialect recognition
-
Florence, Italy
-
R. Tong, B. Ma, H. Li, and E. Chng, "Target-aware lattice rescoring for dialect recognition," in Proc. Interspeech Conf., Florence, Italy, 2011, pp. 733-736.
-
(2011)
Proc. Interspeech Conf
, pp. 733-736
-
-
Tong, R.1
Ma, B.2
Li, H.3
Chng, E.4
-
130
-
-
17444453660
-
Language identification using Gaussian mixture model tokenization
-
Orlando, FL, USA
-
P. A. Torres-Carrasquillo, D. A. Reynolds, and R. J. Deller, Jr., "Language identification using Gaussian mixture model tokenization," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Orlando, FL, USA, 2002, pp. 757-760.
-
(2002)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 757-760
-
-
Torres-Carrasquillo, P.A.1
Reynolds, D.A.2
Deller, Jr.R.J.3
-
131
-
-
85009275225
-
Approaches to language identification using Gaussian mixture models and shifted delta cepstral features
-
Denver, CO, USA
-
P. Torres-Carrasquillo, E. Singer, M. Kohler, R. Greene, D. Reynolds, and J. Deller, Jr., "Approaches to language identification using Gaussian mixture models and shifted delta cepstral features," in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, USA, 2002, pp. 89-92.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process
, pp. 89-92
-
-
Torres-Carrasquillo, P.1
Singer, E.2
Kohler, M.3
Greene, R.4
Reynolds, D.5
Deller, Jr.J.6
-
132
-
-
84867223521
-
The MITLL NIST LRE 2007 language recognition system
-
Brisbane, Australia
-
P. Torres-Carrasquillo, E. Singer, W. Campbell, T. Gleason, A. McCree, D. Reynolds, F. Richardson, W. Shen, and D. Sturim, "The MITLL NIST LRE 2007 language recognition system," in Proc. Interspeech Conf., Brisbane, Australia, 2008, pp. 719-722.
-
(2008)
Proc. Interspeech Conf
, pp. 719-722
-
-
Torres-Carrasquillo, P.1
Singer, E.2
Campbell, W.3
Gleason, T.4
McCree, A.5
Reynolds, D.6
Richardson, F.7
Shen, W.8
Sturim, D.9
-
133
-
-
0037856368
-
Automatic language identification using sub-words models
-
Adelaide, Australia
-
R. C. Tucker, M. J. Carey, and E. S. Paris, "Automatic language identification using sub-words models," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Adelaide, Australia, 1994, pp. 301-304.
-
(1994)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 301-304
-
-
Tucker, R.C.1
Carey, M.J.2
Paris, E.S.3
-
134
-
-
42749098416
-
Channel-dependent GMM and multi-class logistic regression models for language recognition
-
San Juan, Puerto Rico DOI: 10.1109/ODYSSEY. 2006.248094
-
D. A. van Leeuwen and N. Brummer, "Channel-dependent GMM and multi-class logistic regression models for language recognition," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY. 2006.248094.
-
(2006)
Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
-
-
Van Leeuwen, D.A.1
Brummer, N.2
-
135
-
-
36248952139
-
An introduction to application independent evaluation of speaker recognition systems
-
R. Müller, Ed. Berlin, Germany: Springer-Verlag
-
D. A. van Leeuwen and N. Brümmer, "An introduction to application independent evaluation of speaker recognition systems," in Speaker Classification, vol. 4343, R. Müller, Ed. Berlin, Germany: Springer-Verlag, 2007.
-
(2007)
Speaker Classification
, vol.4343
-
-
Van Leeuwen, D.A.1
Brümmer, N.2
-
136
-
-
70450121368
-
An open-set detection evaluation methodology applied to language and emotion recognition
-
Antwerp, Belgium
-
D. A. van Leeuwen and K. P. Truong, "An open-set detection evaluation methodology applied to language and emotion recognition," in Proc. Interspeech Conf., Antwerp, Belgium, 2007, pp. 338-341.
-
(2007)
Proc. Interspeech Conf
, pp. 338-341
-
-
Van Leeuwen, D.A.1
Truong, K.P.2
-
137
-
-
85084014488
-
A human benchmark for the NIST language recognition evaluation 2005
-
Stellenbosch, South Africa paper 012
-
D. A. van Leeuwen, M. Boer, and R. Orr, "A human benchmark for the NIST language recognition evaluation 2005," presented at the Odyssey: Speaker Lang. Recognit. Workshop, Stellenbosch, South Africa, 2008, paper 012.
-
(2008)
Presented at the Odyssey: Speaker Lang. Recognit. Workshop
-
-
Van Leeuwen, D.A.1
Boer, M.2
Orr, R.3
-
139
-
-
42749106196
-
Channel factors compensation in model and feature domain for speaker recognition
-
San Juan, Puerto Rico DOI: 10.1109/ODYSSEY.2006.248117
-
C. Vair, D. Colibro, F. Castaldo, E. Dalmasso, and P. Laface, "Channel factors compensation in model and feature domain for speaker recognition," in Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop, San Juan, Puerto Rico, 2006, DOI: 10.1109/ODYSSEY.2006.248117.
-
(2006)
Proc. IEEE Odyssey: Speaker Lang. Recognit. Workshop
-
-
Vair, C.1
Colibro, D.2
Castaldo, F.3
Dalmasso, E.4
Laface, P.5
-
140
-
-
34548248573
-
Explicit modelling of session variability for speaker verification
-
R. Vogt and S. Sridharan, "Explicit modelling of session variability for speaker verification," Comput. Speech Lang., vol. 22, pp. 17-38, 2008.
-
(2008)
Comput. Speech Lang.
, vol.22
, pp. 17-38
-
-
Vogt, R.1
Sridharan, S.2
-
141
-
-
0012327341
-
Multilinguality in speech and spoken language systems
-
Aug.
-
A. Waibel, P. Geutner, L. M. Tomokiyo, T. Schultz, and M. Woszczyna, "Multilinguality in speech and spoken language systems," Proc. IEEE, vol. 88, no. 8, pp. 1181-1190, Aug. 2000.
-
(2000)
Proc. IEEE
, vol.88
, Issue.8
, pp. 1181-1190
-
-
Waibel, A.1
Geutner, P.2
Tomokiyo, L.M.3
Schultz, T.4
Woszczyna, M.5
-
142
-
-
0036461035
-
Large scale discriminative training of hidden Markov models for speech recognition
-
P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models for speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-47, 2002.
-
(2002)
Comput. Speech Lang.
, vol.16
, Issue.1
, pp. 25-47
-
-
Woodland, P.C.1
Povey, D.2
-
143
-
-
0028996642
-
An approach to automatic language identification based on language-dependent phone recognition
-
Detroit, MI, USA
-
Y. Yan and E. Barnard, "An approach to automatic language identification based on language-dependent phone recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Detroit, MI, USA, 1995, vol. 5, pp. 3511-3514.
-
(1995)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.5
, pp. 3511-3514
-
-
Yan, Y.1
Barnard, E.2
-
144
-
-
0029755106
-
Development of an approach to language identification based on phone recognition
-
Y. Yan, E. Barnard, and R. Cole, "Development of an approach to language identification based on phone recognition," Comput. Speech Lang., vol. 10, pp. 37-54, 1996.
-
(1996)
Comput. Speech Lang.
, vol.10
, pp. 37-54
-
-
Yan, Y.1
Barnard, E.2
Cole, R.3
-
145
-
-
77955790894
-
GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition
-
Aug.
-
C. H. You, K. A. Lee, and H. Li, "GMM-SVM kernel with a Bhattacharyya-based distance for speaker recognition," IEEE Trans. Audio Speech Lang. Process., vol. 18, no. 6, pp. 1300-1312, Aug. 2010.
-
(2010)
IEEE Trans. Audio Speech Lang. Process.
, vol.18
, Issue.6
, pp. 1300-1312
-
-
You, C.H.1
Lee, K.A.2
Li, H.3
-
146
-
-
84863799477
-
A GMM-supervector approach to language recognition with adaptive relevance factor
-
Aalborg, Denmark
-
C. H. You, H. Li, and K. A. Lee, "A GMM-supervector approach to language recognition with adaptive relevance factor," in Proc. EUSIPCO, Aalborg, Denmark, 2010, pp. 1993-1997.
-
(2010)
Proc. EUSIPCO
, pp. 1993-1997
-
-
You, C.H.1
Li, H.2
Lee, K.A.3
-
147
-
-
54149098943
-
Cortical competition during language discrimination
-
J. Zhao, H. Shu, L. Zhang, X. Wang, Q. Gong, and P. Li, "Cortical competition during language discrimination," NeuroImage, vol. 43, pp. 624-633, 2008.
-
(2008)
NeuroImage
, vol.43
, pp. 624-633
-
-
Zhao, J.1
Shu, H.2
Zhang, L.3
Wang, X.4
Gong, Q.5
Li, P.6
-
148
-
-
78049394638
-
Soft margin estimation of Gaussian mixture model parameters for spoken language recognition
-
Dallas, TX, USA
-
D. Zhu, B. Ma, and H. Li, "Soft margin estimation of Gaussian mixture model parameters for spoken language recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Dallas, TX, USA, 2010, pp. 4990-4993.
-
(2010)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, pp. 4990-4993
-
-
Zhu, D.1
Ma, B.2
Li, H.3
-
149
-
-
0027316611
-
Automatic language identification using Gaussian mixture and hidden Markov models
-
Minneapolis, MN, USA
-
M. A. Zissman, "Automatic language identification using Gaussian mixture and hidden Markov models," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Minneapolis, MN, USA, 1993, vol. 2, pp. 399-402.
-
(1993)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.2
, pp. 399-402
-
-
Zissman, M.A.1
-
150
-
-
0029733178
-
Comparison of four approaches to automatic language identification of telephone speech
-
Jan.
-
M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 31-44, Jan. 1996.
-
(1996)
IEEE Trans. Speech Audio Process.
, vol.4
, Issue.1
, pp. 31-44
-
-
Zissman, M.A.1
-
151
-
-
85135186373
-
Predicting, diagnosing and improving automatic language identification performance
-
Rhodes, Greece
-
M. A. Zissman, "Predicting, diagnosing and improving automatic language identification performance," in Proc. Eurospeech Conf., Rhodes, Greece, 1997, pp. 51-54.
-
(1997)
Proc. Eurospeech Conf
, pp. 51-54
-
-
Zissman, M.A.1
|