SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 131, Issue 2, 2012, Pages 1515-1528

Acoustic hole filling for sparse enrollment data using a cohort universal corpus for speaker recognition

(2) Suh, Jun Won a Hansen, John H L a

a UNIVERSITY OF TEXAS AT DALLAS (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC MODEL; AUDIO STREAM; EIGENVOICES; GAUSSIAN MIXTURE MODEL; HOLE FILLING; HUMAN LISTENERS; MACHINE PERFORMANCE; ORIGINAL SYSTEMS; SIMILARITY MEASUREMENTS; SPEAKER MODEL; SPEAKER MODELING; SPEAKER RECOGNITION; TEST DURATION; TEST MATERIALS;

ACOUSTICS; PHYSICS;

SPEECH RECOGNITION;

ALGORITHM; ARTICLE; HUMAN; PHONETICS; PHYSIOLOGY; RECOGNITION; SOUND DETECTION; SPEECH; SPEECH PERCEPTION;

ALGORITHMS; HUMANS; PHONETICS; RECOGNITION (PSYCHOLOGY); SOUND SPECTROGRAPHY; SPEECH ACOUSTICS; SPEECH PERCEPTION;

EID: 84857425027 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.3672707 Document Type: Article

Times cited : (5)

References (34)

1
- 50249182472
- Discriminative in-set/out-of-set speaker recognition
- 10.1109/TASL.2006.881689
- Angkititrakul, P., and Hansen, J. H. L. (2007). Discriminative in-set/out-of-set speaker recognition., IEEE Trans. Audio, Speech, Lang. Process. 15, 498-508. 10.1109/TASL.2006.881689
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , pp. 498-508
- Angkititrakul, P.¹ Hansen, J.H.L.²

2
- 27144489164
- A tutorial on support vector machines for pattern recognition
- 10.1023/A:1009715923555
- Burges, C. (1998). A tutorial on support vector machines for pattern recognition., Data Min. Knowl. Discov. 2, 121-167. 10.1023/A:1009715923555
- (1998) Data Min. Knowl. Discov , vol.2 , pp. 121-167
- Burges, C.¹

3
- 84959118000
- The fisher corpus: A resource for the next generations of speech-to-text
- Lisbon, Portugal
- Cieri, C., Miller, D., and Walker, K. (2004). The fisher corpus: A resource for the next generations of speech-to-text., in Fourth International Conference on Language Resources and Evaluation May 2004, Lisbon, Portugal, Vol. 1, pp. 1-3.
- (2004) Fourth International Conference on Language Resources and Evaluation May 2004 , vol.1 , pp. 1-3
- Cieri, C.¹ Miller, D.² Walker, K.³

4
- 84947940111
- (Springer, Berlin, Germany)
- Furui, S. (1997). Recent Advances in Speaker Recognition (Springer, Berlin, Germany), pp. 235-252.
- (1997) Recent Advances in Speaker Recognition , pp. 235-252
- Furui, S.¹

5
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains
- 10.1109/89.279278
- Gauvain, J., and Lee, C. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of markov chains., IEEE Trans. Speech Audio Process. 2, 291-298. 10.1109/89.279278
- (1994) IEEE Trans. Speech Audio Process , vol.2 , pp. 291-298
- Gauvain, J.¹ Lee, C.²

6
- 0028516097
- Text-independent speaker identification
- 10.1109/79.317924
- Gish, H., and Schmidt, M. (1994). Text-independent speaker identification., IEEE Signal Process. Mag. 11, 18-32. 10.1109/79.317924
- (1994) IEEE Signal Process. Mag , vol.11 , pp. 18-32
- Gish, H.¹ Schmidt, M.²

7
- 85073095401
- Human assisted speaker recognition in NIST SRE10
- Brno, Czech Republic
- Greenberg, G., Martin, A., Brandschain, L., Campbell, J., Cieri, C., Doddington, G., and Godfrey, J. (2010). Human assisted speaker recognition in NIST SRE10., in Odyssey 2010, Brno, Czech Republic, pp. 180-185.
- (2010) Odyssey 2010 , pp. 180-185
- Greenberg, G.¹ Martin, A.² Brandschain, L.³ Campbell, J.⁴ Cieri, C.⁵ Doddington, G.⁶ Godfrey, J.⁷

8
- 85008020310
- Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word
- 10.1109/TSA.2005.852088
- Hansen, J. H. L., Huang, R., Zhou, B., Seadle, M., Deller, J., Gurijala, A., Kurimo, M., and Angkititrakul, P. (2005). Speechfind: advances in spoken document retrieval for a national gallery of the spoken word., IEEE Trans. Speech Audio Process. 13, 712-730. 10.1109/TSA.2005.852088
- (2005) IEEE Trans. Speech Audio Process , vol.13 , pp. 712-730
- Hansen, J.H.L.¹ Huang, R.² Zhou, B.³ Seadle, M.⁴ Deller, J.⁵ Gurijala, A.⁶ Kurimo, M.⁷ Angkititrakul, P.⁸

9
- 85009152939
- CU-move: Robust speech processing for in-vehicle speech systems
- Beijing, China
- Hansen, J. H. L., Plucienkowski, J., Gallant, S., Pellom, B., and Ward, W. (2000). CU-move: Robust speech processing for in-vehicle speech systems., in ICSLP 2000, Beijing, China, pp. 524-527.
- (2000) ICSLP 2000 , pp. 524-527
- Hansen, J.H.L.¹ Plucienkowski, J.² Gallant, S.³ Pellom, B.⁴ Ward, W.⁵

10
- 0000375621
- A robust version of the probability ratio test
- 10.1214/aoms/1177699803
- Huber, P. (1965). A robust version of the probability ratio test., Ann Math. Stat. 36, 1753-1758. 10.1214/aoms/1177699803
- (1965) Ann Math. Stat , vol.36 , pp. 1753-1758
- Huber, P.¹

11
- 18744386134
- Eigenvoice modeling with sparse training data
- 10.1109/TSA.2004.840940
- Kenny, P., Boulianne, G., and Dumouchel, P. (2005). Eigenvoice modeling with sparse training data., IEEE Trans. Audio, Speech, Lang. Proc. 13, 345-354. 10.1109/TSA.2004.840940
- (2005) IEEE Trans. Audio, Speech, Lang. Proc , vol.13 , pp. 345-354
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

12
- 43249091937
- Speaker and session variability in GMM-based speaker verification
- 10.1109/TASL.2007.894527
- Kenny, P., Boulianne, G., Ouellet, P., and Dumouchel, P. (2007). Speaker and session variability in GMM-based speaker verification., IEEE Trans. Audio, Speech, Lang. Process. 15, 1448-1460. 10.1109/TASL.2007.894527
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , pp. 1448-1460
- Kenny, P.¹ Boulianne, G.² Ouellet, P.³ Dumouchel, P.⁴

13
- 0002044263
- (World Scientific Publishing, MA)
- Kressel, U., and Schurmann, J. (1997). Pattern Classification Techniques Based on Function Approximation, (World Scientific Publishing, MA), pp. 49-78.
- (1997) Pattern Classification Techniques Based on Function Approximation , pp. 49-78
- Kressel, U.¹ Schurmann, J.²

14
- 0034320005
- Rapid speaker adaptation in Eigenvoice space
- 10.1109/89.876308
- Kuhn, R., Junqua, J., Nguyen, P., and Niedzielski, N. (2000). Rapid speaker adaptation in Eigenvoice space., IEEE Trans. Speech Audio Process. 8, 695-707. 10.1109/89.876308
- (2000) IEEE Trans. Speech Audio Process , vol.8 , pp. 695-707
- Kuhn, R.¹ Junqua, J.² Nguyen, P.³ Niedzielski, N.⁴

15
- 0001927585
- On information and sufficiency
- 10.1214/aoms/1177729694
- Kullback, S., and Leibler, R. (1951). On information and sufficiency., Ann. Math. Stat. 22, 79-86. 10.1214/aoms/1177729694
- (1951) Ann. Math. Stat , vol.22 , pp. 79-86
- Kullback, S.¹ Leibler, R.²

16
- 85135371588
- High performance speaker-independent phone recognition using CDHMM
- Berlin, Germany
- Lamel, L., and Gauvain, J. (1993). High performance speaker-independent phone recognition using CDHMM., in EUROSPEECH 1993, Berlin, Germany, pp. 121-124.
- (1993) EUROSPEECH 1993 , pp. 121-124
- Lamel, L.¹ Gauvain, J.²

17
- 0024768209
- Speaker-independent phone recognition using hidden Markov models
- 10.1109/29.46546
- Lee, K., and Hon, H. (1989). Speaker-independent phone recognition using hidden Markov models., IEEE Trans. Acoust., Speech, Signal Process. 37, 1641-1648. 10.1109/29.46546
- (1989) IEEE Trans. Acoust., Speech, Signal Process , vol.37 , pp. 1641-1648
- Lee, K.¹ Hon, H.²

18
- 84857387950
- (CRC Press, New York)
- Li, Q., Juang, B., Lee, C., Zhou, Q., and Soong, F. (2003). Speaker Authentication (CRC Press, New York), pp. 229-259.
- (2003) Speaker Authentication , pp. 229-259
- Li, Q.¹ Juang, B.² Lee, C.³ Zhou, Q.⁴ Soong, F.⁵

19
- 44949143337
- Speaker cluster based GMM tokenization for speaker recognition
- Pittsburgh, PA
- Ma, B., Zhu, D., Tong, R., and Li, H. (2006). Speaker cluster based GMM tokenization for speaker recognition., in INTERSPEECH 2006, Pittsburgh, PA, Vol. 1, pp. 505-508.
- (2006) INTERSPEECH 2006 , vol.1 , pp. 505-508
- Ma, B.¹ Zhu, D.² Tong, R.³ Li, H.⁴

20
- 33947670488
- A comparison of various adaptation methods for speaker verification with limited enrollment data
- Toulouse, France
- Mak, M., Hsiao, R., and Mak, B. (2006). A comparison of various adaptation methods for speaker verification with limited enrollment data., in ICASSP 2006, Toulouse, France, Vol. 1, pp. 929-932.
- (2006) ICASSP 2006 , vol.1 , pp. 929-932
- Mak, M.¹ Hsiao, R.² Mak, B.³

21
- 84857407484
- Structural linear model-space transformations for speaker adaptation
- Geneva, Switzerland
- Matrouf, D., Bellot, O., Nocera, P., Linares, G., and Bonastre, J. (2003). Structural linear model-space transformations for speaker adaptation., in EUROSPEECH 2003, Geneva, Switzerland, pp. 1625-1628.
- (2003) EUROSPEECH 2003 , pp. 1625-1628
- Matrouf, D.¹ Bellot, O.² Nocera, P.³ Linares, G.⁴ Bonastre, J.⁵

22
- 0017992187
- Frequency of occurrence of phonemes in conversational English
- Mines, M., Hanson, B., and Shoup, J. (1978). Frequency of occurrence of phonemes in conversational English. Lang. Speech 21, 221-241.
- (1978) Lang. Speech , vol.21 , pp. 221-241
- Mines, M.¹ Hanson, B.² Shoup, J.³

23
- 84867213267
- Language and genre detection in audio content analysis
- Brisbane, Australia
- Mitra, V., Garcia-Romero, D., and Espy-Wilson, C. (2008). Language and genre detection in audio content analysis., INTERSPEECH-2008, Brisbane, Australia, pp. 2506-2509.
- (2008) INTERSPEECH-2008 , pp. 2506-2509
- Mitra, V.¹ Garcia-Romero, D.² Espy-Wilson, C.³

24
- 32644450332
- Feature warping for robust speaker verification
- Pelecanos, J., and Sridharan, S. (2001). Feature warping for robust speaker verification., Proc. Speaker Odyssey 13, 1-5.
- (2001) Proc. Speaker Odyssey , vol.13 , pp. 1-5
- Pelecanos, J.¹ Sridharan, S.²

25
- 63049100830
- In-set/out-of-set speaker recognition under sparse enrollment
- 10.1109/TASL.2007.902058
- Prakash, V., and Hansen, J. H. L. (2007). In-set/out-of-set speaker recognition under sparse enrollment., IEEE Trans. Audio, Speech, Lang. Process. 15, 2044-2052. 10.1109/TASL.2007.902058
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , pp. 2044-2052
- Prakash, V.¹ Hansen, J.H.L.²

26
- 0033884858
- Speaker verification using adapted gaussian mixture models
- 10.1006/dspr.1999.0361
- Reynolds, D., Quatieri, T., and Dunn, R. (2000). Speaker verification using adapted gaussian mixture models., Digit. Signal Process. 10, 19-41. 10.1006/dspr.1999.0361
- (2000) Digit. Signal Process , vol.10 , pp. 19-41
- Reynolds, D.¹ Quatieri, T.² Dunn, R.³

27
- 44949231596
- A multiclass framework for speaker verification within an acoustic event sequence system
- Pittsburgh, PA
- Scheffer, N., and Bonastre, J. (2006). A multiclass framework for speaker verification within an acoustic event sequence system., in INTERSPEECH 2006, Pittsburgh, PA, pp. 501-504.
- (2006) INTERSPEECH 2006 , pp. 501-504
- Scheffer, N.¹ Bonastre, J.²

28
- 0033889739
- Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 Speaker Evaluation Data 1
- 10.1006/dspr.1999.0356
- Schmidt-Nielsen, A., and Crystal, T. (2000). Speaker verification by human listeners: Experiments comparing human and machine performance using the NIST 1998 Speaker Evaluation Data 1., Digit. Signal Process. 10, 249-266. 10.1006/dspr.1999.0356
- (2000) Digit. Signal Process , vol.10 , pp. 249-266
- Schmidt-Nielsen, A.¹ Crystal, T.²

29
- 84857407659
- USSS-MITLL 2010 Human Assisted Speaker Recognition Evaluation System
- Brno, Czech Republic
- Schwartz, R., Campbell, J., Shen, W., D., S., Campbell, W., Richardson, F., Dunn, R., and Granville, R. (2010). USSS-MITLL 2010 Human Assisted Speaker Recognition Evaluation System., in NIST SRE Workshop 2010, Brno, Czech Republic, pp. 1-7.
- (2010) NIST SRE Workshop 2010 , pp. 1-7
- Schwartz, R.¹ Campbell, J.² Shen, W.D.S.³ Campbell, W.⁴ Richardson, F.⁵ Dunn, R.⁶ Granville, R.⁷

30
- 85009207995
- Score normalisation applied to open-set, text-independent speaker identification
- Geneva, Switzerland
- Sivakumaran, P., Fortuna, J., and Ariyaeeinia, A. (2003). Score normalisation applied to open-set, text-independent speaker identification., in EUROSPEECH, Geneva, Switzerland, pp. 2669-2672.
- (2003) EUROSPEECH , pp. 2669-2672
- Sivakumaran, P.¹ Fortuna, J.² Ariyaeeinia, A.³

31
- 0022229052
- A vector quantization approach to speaker recognition
- FL
- Soong, F., Rosenberg, A., Rabiner, L., and Juang, L. (1985). A vector quantization approach to speaker recognition., in ICASSP 1985, FL, pp. 387-390.
- (1985) ICASSP 1985 , pp. 387-390
- Soong, F.¹ Rosenberg, A.² Rabiner, L.³ Juang, L.⁴

32
- 85009275225
- Approaches to language identification using Gaussian mixture models and shifted delta cepstral features
- Tampa, FL
- Torres-Carrasquillo, P., Singer, E., Kohler, M., Greene, R., Reynolds, D., and Deller, Jr., J. (2002). Approaches to language identification using Gaussian mixture models and shifted delta cepstral features., in ICSLP 2002, Tampa, FL, pp. 89-92.
- (2002) ICSLP 2002 , pp. 89-92
- Torres-Carrasquillo, P.¹ Singer, E.² Kohler, M.³ Greene, R.⁴ Reynolds, D.⁵ Deller Jr., J.⁶

33
- 64549092742
- A cohort-based speaker model synthesis for mismatched channels in speaker verification
- 10.1109/TASL.2007.899297
- Wu, W., Zheng, T., Xu, M., and Soong, F. (2007). A cohort-based speaker model synthesis for mismatched channels in speaker verification., IEEE Trans. Audio, Speech, Lang. Process. 15, 1893-1903. 10.1109/TASL.2007.899297
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , pp. 1893-1903
- Wu, W.¹ Zheng, T.² Xu, M.³ Soong, F.⁴

34
- 0041360472
- Efficient text-independent speaker verification with structural gaussian mixture models and neural network
- 10.1109/TSA.2003.815822
- Xiang, B., and Berger, T. (2003). Efficient text-independent speaker verification with structural gaussian mixture models and neural network., IEEE Trans. Audio, Speech, Lang. Process. 11, 447-456. 10.1109/TSA.2003.815822
- (2003) IEEE Trans. Audio, Speech, Lang. Process , vol.11 , pp. 447-456
- Xiang, B.¹ Berger, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.