SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 3027-3031

UBM fused total variability modeling for language identification

(3) Van Segbroeck, Maarten a Travadi, Ruchir a Narayanan, Shrikanth S a

a UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

I vector representation; Language identification; Noise robustness; RATS; Short duration

Indexed keywords

RATING; RATS; SPEECH COMMUNICATION;

AUTOMATIC TRANSCRIPTION; FEATURE REPRESENTATION; I VECTORS; LANGUAGE IDENTIFICATION; NOISE ROBUSTNESS; SHORT-DURATION; TOTAL VARIABILITIES; UNIVERSAL BACKGROUND MODEL;

SPEECH RECOGNITION;

EID: 84910070752 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (7)

References (32)

1
- 0028996643
- Language identification using phoneme recognition and phonotactic language modeling
- IEEE
- M. A. Zissman, "Language identification using phoneme recognition and phonotactic language modeling, " in Proc. ICASSP, vol. 5. IEEE, 1995, pp. 3503-3506.
- (1995) Proc. ICASSP , vol.5 , pp. 3503-3506
- Zissman, M.A.¹

2
- 0028996642
- An approach to automatic language identification based on language-dependent phone recognition
- IEEE
- Y. Yan and E. Barnard, "An approach to automatic language identification based on language-dependent phone recognition, " in Proc. ICASSP, vol. 5. IEEE, 1995, pp. 3511-3514.
- (1995) Proc. ICASSP , vol.5 , pp. 3511-3514
- Yan, Y.¹ Barnard, E.²

3
- 84910024739
- Acoustic, phonetic, and discriminative approaches to automatic language identification
- E. Singer, P. A. Torres-Carrasquillo, T. P. Gleason, W. M. Campbell, and D. A. Reynolds, "Acoustic, phonetic, and discriminative approaches to automatic language identification, " in Proc. Interspeech, 2003.
- (2003) Proc. Interspeech
- Singer, E.¹ Torres-Carrasquillo, P.A.² Gleason, T.P.³ Campbell, W.M.⁴ Reynolds, D.A.⁵

4
- 33745190265
- Phonotactic language identification using high quality phoneme recognition
- P. Matejka, P. Schwarz, J. Cernocky, and P. Chytil, "Phonotactic language identification using high quality phoneme recognition, " in Proc. Interspeech, 2005, pp. 2237-2240.
- (2005) Proc. Interspeech , pp. 2237-2240
- Matejka, P.¹ Schwarz, P.² Cernocky, J.³ Chytil, P.⁴

5
- 17444453660
- Language identification using gaussian mixture model tokenization
- IEEE
- P. A. Torres-Carrasquillo, D. A. Reynolds, and J. Deller Jr, "Language identification using gaussian mixture model tokenization, " in Proc. ICASSP, vol. 1. IEEE, 2002, pp. 1-757.
- (2002) Proc. ICASSP , vol.1 , pp. 1-757
- Torres-Carrasquillo, P.A.¹ Reynolds, D.A.² Deller, J.³

6
- 84910087367
- Methods to improve gaussian mixture model based language identification system
- E. Wong and S. Sridharan, "Methods to improve gaussian mixture model based language identification system, " in Proc. Interspeech, 2002.
- (2002) Proc. Interspeech
- Wong, E.¹ Sridharan, S.²

7
- 33947696754
- SVM based speaker verification using a GMM supervector kernel and NAP variability compensation
- W. M. Campbell, D. E. Sturim, D. A. Reynolds, and A. Solomonoff, "SVM based speaker verification using a GMM supervector kernel and NAP variability compensation, " in Proc. ICASSP, vol. 1, 2006.
- (2006) Proc. ICASSP , vol.1
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³ Solomonoff, A.⁴

8
- 18744386134
- Eigenvoice modeling with sparse training data
- P. Kenny, G. Boulianne, and P. Dumouchel, "Eigenvoice modeling with sparse training data, " Speech and Audio Processing, IEEE Transactions on, vol. 13, no. 3, pp. 345-354, 2005.
- (2005) Speech and Audio Processing, IEEE Transactions on , vol.13 , Issue.3 , pp. 345-354
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

9
- 50249170027
- Joint factor analysis versus eigenchannels in speaker recognition
- P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, "Joint factor analysis versus eigenchannels in speaker recognition, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, no. 4, pp. 1435-1447, 2007.
- (2007) Audio, Speech, and Language Processing, IEEE Transactions on , vol.15 , Issue.4 , pp. 1435-1447
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

10
- 70450195685
- Factor analysis and svm for language recognition
- F. Verdet, D. Matrouf, J.-F. Bonastre, and J. Hennebert, "Factor analysis and svm for language recognition." in Proc. Interspeech, 2009, pp. 164-167.
- (2009) Proc. Interspeech , pp. 164-167
- Verdet, F.¹ Matrouf, D.² Bonastre, J.-F.³ Hennebert, J.⁴

11
- 79951609039
- Front-end factor analysis for speaker verification
- N. Dehak, P. J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 19, no. 4, pp. 788-798, 2011.
- (2011) Audio, Speech, and Language Processing, IEEE Transactions on , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.J.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

12
- 84865769863
- Language recognition in ivectors space
- Firenze, Italy
- D. Martinez, O. Plchot, L. Burget, O. Glembek, and P. Matejka, "Language recognition in ivectors space, " Proceedings of Interspeech, Firenze, Italy, pp. 861-864, 2011.
- (2011) Proceedings of Interspeech , pp. 861-864
- Martinez, D.¹ Plchot, O.² Burget, L.³ Glembek, O.⁴ Matejka, P.⁵

13
- 84906242625
- TRAP language identification system for RATS phase II evaluation
- K. J. Han, S. Ganapathy, M. Li, M. K. Omar, and S. Narayanan, "TRAP language identification system for RATS phase II evaluation, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Han, K.J.¹ Ganapathy, S.² Li, M.³ Omar, M.K.⁴ Narayanan, S.⁵

14
- 84865750857
- Language recognition via i-vectors and dimensionality reduction
- N. Dehak, P. A. Torres-Carrasquillo, D. A. Reynolds, and R. Dehak, "Language recognition via i-vectors and dimensionality reduction, " in Proc. Interspeech, 2011, pp. 857-860.
- (2011) Proc. Interspeech , pp. 857-860
- Dehak, N.¹ Torres-Carrasquillo, P.A.² Reynolds, D.A.³ Dehak, R.⁴

15
- 84900522099
- Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification
- M. Li and S. Narayanan, "Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification, " Computer Speech & Language, 2014.
- (2014) Computer Speech & Language
- Li, M.¹ Narayanan, S.²

16
- 84890466219
- Speaker verification using simplified and supervised i-vector modeling
- M. Li, A. Tsiartas, M. Segbroeck, and S. Narayanan, "Speaker verification using simplified and supervised i-vector modeling, " in Proc. ICASSP, 2013.
- (2013) Proc. ICASSP
- Li, M.¹ Tsiartas, A.² Segbroeck, M.³ Narayanan, S.⁴

17
- 85073251381
- The RATS radio traffic collection system
- K. Walker and S. Strassel, "The RATS Radio Traffic Collection System, " in Odyssey 2012-The Speaker and Language Recognition Workshop, 2012.
- (2012) Odyssey 2012-The Speaker and Language Recognition Workshop
- Walker, K.¹ Strassel, S.²

18
- 84865718184
- I-vector based speaker recognition on short utterances
- International Speech Communication Association (ISCA)
- A. Kanagasundaram, R. Vogt, D. B. Dean, S. Sridharan, and M. W. Mason, "I-vector based speaker recognition on short utterances, " in Proceedings of the 12th Annual Conference of the International Speech Communication Association. International Speech Communication Association (ISCA), 2011, pp. 2341- 2344.
- (2011) Proceedings of the 12th Annual Conference of the International Speech Communication Association , pp. 2341-2344
- Kanagasundaram, A.¹ Vogt, R.² Dean, D.B.³ Sridharan, S.⁴ Mason, M.W.⁵

19
- 84867598363
- I-vectors in the context of phonetically-constrained short utterances for speaker verification
- A. Larcher, P. Bousquet, K. A. Lee, D. Matrouf, H. Li, and J.- F. Bonastre, "I-vectors in the context of phonetically-constrained short utterances for speaker verification, " in Proc. ICASSP IEEE, 2012, pp. 4773-4776.
- (2012) Proc. ICASSP IEEE , pp. 4773-4776
- Larcher, A.¹ Bousquet, P.² Lee, K.A.³ Matrouf, D.⁴ Li, H.⁵ Bonastre J.-., F.⁶

20
- 84906217020
- Improving language identification robustness to highly channel-degraded speech through multiple system fusion
- A. Lawson, M. McLaren, Y. Lei, V. Mitra, N. Scheffer, L. Ferrer, and M. Graciarena, "Improving language identification robustness to highly channel-degraded speech through multiple system fusion, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Lawson, A.¹ McLaren, M.² Lei, Y.³ Mitra, V.⁴ Scheffer, N.⁵ Ferrer, L.⁶ Graciarena, M.⁷

21
- 84906251395
- Improvements in language identification on the RATs noisy speech corpus
- J. Ma, B. Zhang, S. Matsoukas, S. H. Mallidi, F. Li, and H. Hermansky, "Improvements in language identification on the RATS noisy speech corpus, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Ma, J.¹ Zhang, B.² Matsoukas, S.³ Mallidi, S.H.⁴ Li, F.⁵ Hermansky, H.⁶

22
- 84910028543
- Modifiedprior i-vector estimation for language identification of short duration utterances
- submitted
- R. Travadi, M. Van Segbroeck, and S. S. Narayanan, "Modifiedprior i-vector estimation for language identification of short duration utterances, " in Proc. Interspeech, 2014, submitted.
- (2014) Proc. Interspeech
- Travadi, R.¹ Van Segbroeck, M.² Narayanan, S.S.³

23
- 44949114401
- Within-class covariance normalization for SVM-based speaker recognition
- A. O. Hatch, S. S. Kajarekar, and A. Stolcke, "Within-class covariance normalization for SVM-based speaker recognition, " in Proc. Interspeech, 2006.
- (2006) Proc. Interspeech
- Hatch, A.O.¹ Kajarekar, S.S.² Stolcke, A.³

24
- 33947637189
- Joint factor analysis of speaker and session variability: Theory and algorithms
- P. Kenny, "Joint factor analysis of speaker and session variability: Theory and algorithms, " CRIM, Montreal, (Report) CRIM-06/08- 13, 2005.
- (2005) CRIM, Montreal, (Report)
- Kenny, P.¹

25
- 84874234665
- Frame-based phonotactic language identification
- IEEE
- K. J. Han and J. Pelecanos, "Frame-based phonotactic language identification, " in Spoken Language Technology Workshop (SLT). IEEE, 2012, pp. 303-306.
- (2012) Spoken Language Technology Workshop (SLT) , pp. 303-306
- Han, K.J.¹ Pelecanos, J.²

26
- 0038532436
- Qualcomm-ICSI-OGI features for ASR
- A. G. Adami, L. Burget, S. Dupont, H. Garudadri, F. Grezl, H. Hermansky, P. Jain, S. S. Kajarekar, N. Morgan, and S. Sivadas, "Qualcomm-ICSI-OGI features for ASR." in Proc. Interspeech, 2002.
- (2002) Proc. Interspeech
- Adami, A.G.¹ Burget, L.² Dupont, S.³ Garudadri, H.⁴ Grezl, F.⁵ Hermansky, H.⁶ Jain, P.⁷ Kajarekar, S.S.⁸ Morgan, N.⁹ Sivadas, S.¹⁰

27
- 84906246377
- A robust frontend for VAD: Exploiting contextual, discriminative and spectral cues of human voice
- M. Van Segbroeck, A. Tsiartas, and S. Narayanan, "A robust frontend for VAD: Exploiting contextual, discriminative and spectral cues of human voice, " in Proc. Interspeech, 2013.
- (2013) Proc. Interspeech
- Van Segbroeck, M.¹ Tsiartas, A.² Narayanan, S.³

28
- 0019053271
- Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences
- Aug
- S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognitions in continuously spoken sentences, " IEEE Transactions on Acoustics, Speech and Signal Processing, vol. 28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Transactions on Acoustics, Speech and Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

29
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech, " Journal of the Acoustical Society of America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
- (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

30
- 34547499683
- Incorporating auditory feature uncertainties in robust speaker identification
- Y. Shao, S. Srinivasan, and D. Wang, "Incorporating auditory feature uncertainties in robust speaker identification, " in Proc. ICASSP, 2002, pp. 277-280.
- (2002) Proc. ICASSP , pp. 277-280
- Shao, Y.¹ Srinivasan, S.² Wang, D.³

31
- 84890447859
- Spectro-temporal gabor features as a front end for ASR
- M. Kleinschmidt, "Spectro-temporal gabor features as a front end for ASR, " in Proc. Forum Acusticum Sevilla, 2002.
- (2002) Proc. Forum Acusticum Sevilla
- Kleinschmidt, M.¹

32
- 79959850251
- The NIST 2010 speaker recognition evaluation
- A. F. Martin and C. S. Greenberg, "The NIST 2010 speaker recognition evaluation, " in Proc. Interspeech, 2010, pp. 2726-2729.
- (2010) Proc. Interspeech , pp. 2726-2729
- Martin, A.F.¹ Greenberg, C.S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.