SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 5, 2006, Pages 1557-1565

An overview of automatic speaker diarization systems

(2) Tranter, Sue E a,b Reynolds, Douglas A a,c

a IEEE (United Kingdom)

b UNIVERSITY OF CAMBRIDGE (United Kingdom)

c MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Speaker diarization; Speaker segmentation and clustering

Indexed keywords

INPUT AUDIO CHANNELS; SIGNAL ENERGY; SPEAKER DIARIZATION; SPEAKER SEGMENTATION AND CLUSTERING;

ACOUSTIC NOISE; BROADCASTING; COMMUNICATION CHANNELS (INFORMATION THEORY); INFORMATION ANALYSIS; SPEECH RECOGNITION;

SPEECH PROCESSING;

EID: 34047261805 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.878256 Document Type: Review

Times cited : (542)

References (58)

1
- 33646380923
- Approaches and applications of audio diarization
- Philadelphia, PA, Mar
- D. A. Reynolds and P. Torres-Carrasquillo, "Approaches and applications of audio diarization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. V, Philadelphia, PA, Mar. 2005, pp. 953-956.
- (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.5 , pp. 953-956
- Reynolds, D.A.¹ Torres-Carrasquillo, P.²

2
- 0029765670
- Real-time discrimination of broadcast speech/music
- Atlanta, GA, May
- J. Saunders, "Real-time discrimination of broadcast speech/music," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. II, Atlanta, GA, May 1996, pp. 993-996.
- (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 993-996
- Saunders, J.¹

3
- 0032181880
- Audio feature extraction and analysis for scene segmentation and classification
- Oct
- Z. Liu, Y. Wang, and T. Chen, "Audio feature extraction and analysis for scene segmentation and classification," J. VLSI Signal Process. Syst., vol. 20, no. 1-2, pp. 61-79, Oct. 1998.
- (1998) J. VLSI Signal Process. Syst , vol.20 , Issue.1-2 , pp. 61-79
- Liu, Z.¹ Wang, Y.² Chen, T.³

4
- 0033677117
- A method for direct audio search with applications to indexing and retrieval
- Istanbul, Turkey, Jun
- S. E. Johnson and P. C. Woodland, "A method for direct audio search with applications to indexing and retrieval," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 3, Istanbul, Turkey, Jun. 2000, pp. 1427-1430.
- (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.3 , pp. 1427-1430
- Johnson, S.E.¹ Woodland, P.C.²

5
- 0141702097
- Toward domain independent clustering
- China, Apr
- Y. Moh, P. Nguyen, and J.-C. Junqua, "Toward domain independent clustering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. II, China, Apr. 2003, pp. 85-88.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 85-88
- Moh, Y.¹ Nguyen, P.² Junqua, J.-C.³

6
- 33745560829
- Robust speaker segmentation for meetings: The ICSI-SRI Spring 2005 Diarization System
- Edinburgh, U.K, Jul
- X. Anguera, C. Wooters, B. Peskin, and M. Aguiló, "Robust speaker segmentation for meetings: The ICSI-SRI Spring 2005 Diarization System," in Proc. Machine Learning for Multimodal Interaction Workshop (MLMI), Edinburgh, U.K., Jul. 2005, pp. 402-414.
- (2005) Proc. Machine Learning for Multimodal Interaction Workshop (MLMI) , pp. 402-414
- Anguera, X.¹ Wooters, C.² Peskin, B.³ Aguiló, M.⁴

7
- 34047262286
- Online, Available
- Benchmark Tests: Rich Transcription (RT). NIST. [Online]. Available: http://www.nist.gov/speech/tests/rt/
- Benchmark Tests: Rich Transcription (RT). NIST

8
- 34047246425
- Online, Available
- Benchmark Tests: Speaker Recognition. NIST. [Online]. Available: http://www.nist.gov/speech/tests/spk/
- Benchmark Tests: Speaker Recognition. NIST

9
- 85009110150
- Speaker recognition in a multi-speaker environment
- Aalborg, Denmark, Sep
- A. Martin and M. Przybocki, "Speaker recognition in a multi-speaker environment," in Proc. Eur. Conf. Speech Commun. Technol., vol. 2, Aalborg, Denmark, Sep. 2001, pp. 787-790.
- (2001) Proc. Eur. Conf. Speech Commun. Technol , vol.2 , pp. 787-790
- Martin, A.¹ Przybocki, M.²

10
- 33947623018
- Using a-priori information for speaker diarization
- Toledo, Spain, May
- D. Moraru, L. Besacier, and E. Castelli, "Using a-priori information for speaker diarization," in Proc. Odyssey Speaker and Language Recognition Workshop, Toledo, Spain, May 2004, pp. 355-362.
- (2004) Proc. Odyssey Speaker and Language Recognition Workshop , pp. 355-362
- Moraru, D.¹ Besacier, L.² Castelli, E.³

11
- 34047268275
- Toward Robust speaker segmentation: The ICSI-SRI Fall 2004 Diarization System
- Palisades, NY, Nov, Online, Available
- C. Wooters, J. Fung, B. Peskin, and X. Anguera, "Toward Robust speaker segmentation: The ICSI-SRI Fall 2004 Diarization System," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004, [Online]. Available: http://www.icsi.berkeley.edu/cgi-bin/pubs/ publication.pl?ID=000100.
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Wooters, C.¹ Fung, J.² Peskin, B.³ Anguera, X.⁴

12
- 34047267089
- P. Nguyen, L. Rigazio, Y. Moh, and J. C. Junqua. Rich transcription 2002 site report. Panasonic speech technology laboratory (PSTL). presented at Proc. Rich Transcription Workshop (RT-02). [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2002/presentations/rt02.pdf
- P. Nguyen, L. Rigazio, Y. Moh, and J. C. Junqua. Rich transcription 2002 site report. Panasonic speech technology laboratory (PSTL). presented at Proc. Rich Transcription Workshop (RT-02). [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2002/presentations/rt02.pdf

13
- 85128356454
- Partitioning and transcription of broadcast news data
- Sydney, Australia, Dec
- J.-L. Gauvain, L. Lamel, and G. Adda, "Partitioning and transcription of broadcast news data," in Proc. Int. Conf. Spoken Lang. Process., vol. 4, Sydney, Australia, Dec. 1998, pp. 1335-1338.
- (1998) Proc. Int. Conf. Spoken Lang. Process , vol.4 , pp. 1335-1338
- Gauvain, J.-L.¹ Lamel, L.² Adda, G.³

14
- 33745185104
- Combining speaker identification and BIC for speaker diarization
- Lisbon, Portugal, Sep
- X. Zhu, C. Barras, S. Meignier, and J.-L. Gauvain, "Combining speaker identification and BIC for speaker diarization," in Proc. Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2441-2444.
- (2005) Proc. Eur. Conf. Speech Commun. Technol , pp. 2441-2444
- Zhu, X.¹ Barras, C.² Meignier, S.³ Gauvain, J.-L.⁴

15
- 34047264090
- The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast audio and telephone conversations
- Palisades, NY, Nov
- D. A. Reynolds and P. Torres-Carrasquillo, "The MIT Lincoln Laboratory RT-04F diarization systems: Applications to broadcast audio and telephone conversations," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004.
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Reynolds, D.A.¹ Torres-Carrasquillo, P.²

16
- 0002751623
- Segment generation and clustering in the HTK broadcast news transcription system
- presented at, Online, Available
- T. Hain, S. E. Johnson, A. Tuerk, P. C. Woodland, and S. J. Young. Segment generation and clustering in the HTK broadcast news transcription system, presented at Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop. [Online]. Available: http://mi.eng.cam.ac.uk/ reports/abstracts/hain_darpa98.html
- Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop
- Hain, T.¹ Johnson, S.E.² Tuerk, A.³ Woodland, P.C.⁴ Young, S.J.⁵

17
- 33745200276
- The Cambridge University March 2005 speaker diarization system
- Lisbon, Portugal, Sep
- R. Sinha, S. E. Tranter, M. J. F. Gales, and P. C. Woodland, "The Cambridge University March 2005 speaker diarization system," in Proc. Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2437-2440.
- (2005) Proc. Eur. Conf. Speech Commun. Technol , pp. 2437-2440
- Sinha, R.¹ Tranter, S.E.² Gales, M.J.F.³ Woodland, P.C.⁴

18
- 85119434191
- Fast speaker change detection for broadcast news transcription and indexing
- Budapest, Hungary, Sep
- D. Liu and F. Kubala, "Fast speaker change detection for broadcast news transcription and indexing," in Proc. Eur. Conf. Speech Commun. Technol., vol. III, Budapest, Hungary, Sep. 1999, pp. 1031-1034.
- (1999) Proc. Eur. Conf. Speech Commun. Technol , vol.3 , pp. 1031-1034
- Liu, D.¹ Kubala, F.²

19
- 29044442235
- Step-by-Step and integrated approaches in broadcast news speaker diarization
- to be published, Sep
- S. Meignier, D. Moraru, C. Fredouille, J.-F. Bonastre, and L. Besacier, "Step-by-Step and integrated approaches in broadcast news speaker diarization," Comput. Speech Lang., no. 20, pp. 303-330, Sep. 2005, to be published.
- (2005) Comput. Speech Lang , Issue.20 , pp. 303-330
- Meignier, S.¹ Moraru, D.² Fredouille, C.³ Bonastre, J.-F.⁴ Besacier, L.⁵

20
- 33646779383
- Speaker diarization for broadcast news
- Toledo, Spain, Jun
- S. E. Tranter and D. A. Reynolds, "Speaker diarization for broadcast news," in Proc. Odyssey Speaker and Language Recognition Workshop, Toledo, Spain, Jun. 2004, pp. 337-344.
- (2004) Proc. Odyssey Speaker and Language Recognition Workshop , pp. 337-344
- Tranter, S.E.¹ Reynolds, D.A.²

21
- 4544280424
- Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech
- Montreal, QC, Canada, May
- S. E. Tranter, K. Yu, G. Evermann, and P. C. Woodland, "Generating and evaluating segmentations for automatic speech recognition of conversational telephone speech," in Proc. ICASSP, vol. I, Montreal, QC, Canada, May 2004, pp. 753-756.
- (2004) Proc. ICASSP , vol.1 , pp. 753-756
- Tranter, S.E.¹ Yu, K.² Evermann, G.³ Woodland, P.C.⁴

22
- 4544259164
- A cross-channel modeling approach for automatic segmentation of conversational telephone speech
- St. Thomas, U.S. Virgin Islands, Dec
- D. Liu and F. Kubala, "A cross-channel modeling approach for automatic segmentation of conversational telephone speech," in Proc. IEEE ASRU Workshop, St. Thomas, U.S. Virgin Islands, Dec. 2003, pp. 333-338.
- (2003) Proc. IEEE ASRU Workshop , pp. 333-338
- Liu, D.¹ Kubala, F.²

23
- 34047260199
- The TNO speaker diarization system for NIST RT05s meeting data
- Edinburgh, UK, Jul
- D. A. van Leeuwan, "The TNO speaker diarization system for NIST RT05s meeting data," in Proc. Machine Learning for Multimodal Interaction Workshop (MLMI), Edinburgh, UK, Jul. 2005, pp. 440-449.
- (2005) Proc. Machine Learning for Multimodal Interaction Workshop (MLMI) , pp. 440-449
- van Leeuwan, D.A.¹

24
- 33745572731
- NIST RT'05 evaluation: Preprocessing techniques and speaker diarization on multiple microphone meetings
- Edinburgh, U.K, Jul
- D. Istrate, C. Fredouille, S. Meignier, L. Besacier, and J.-F. Bonastre, "NIST RT'05 evaluation: Preprocessing techniques and speaker diarization on multiple microphone meetings," in Proc. Machine Learning for Multimodal Interaction Workshop (MLMI), Edinburgh, U.K., Jul. 2005, pp. 428-439.
- (2005) Proc. Machine Learning for Multimodal Interaction Workshop (MLMI) , pp. 428-439
- Istrate, D.¹ Fredouille, C.² Meignier, S.³ Besacier, L.⁴ Bonastre, J.-F.⁵

25
- 34047264756
- The macquarie speaker diarization system for RT05s
- Edinburgh, UK, Jul
- S. Cassidy, "The macquarie speaker diarization system for RT05s," in Proc. NIST Spring Rich Transcription Evaluation Workshop (RT-05s), Edinburgh, UK, Jul. 2005.
- (2005) Proc. NIST Spring Rich Transcription Evaluation Workshop (RT-05s)
- Cassidy, S.¹

26
- 0141469852
- Multispeaker speech activity detection for the ICSI meeting recorder
- Trento, Italy, Dec
- T. Pfau, D. Ellis, and A. Stolcke, "Multispeaker speech activity detection for the ICSI meeting recorder," in Proc. IEEE ASRU Workshop, Trento, Italy, Dec. 2001, pp. 107-110.
- (2001) Proc. IEEE ASRU Workshop , pp. 107-110
- Pfau, T.¹ Ellis, D.² Stolcke, A.³

27
- 33745577702
- The rich transcription 2005 spring meeting recogntion evaluation
- Edinburgh, UK, Jul
- J. G. Fiscus, N. Radde, J. S. Garofolo, A. Le, J. Ajot, and C. Laprun, "The rich transcription 2005 spring meeting recogntion evaluation," in Proc. Machine Learning for Multimodal Interaction Workshop (MLMI), Edinburgh, UK, Jul. 2005, pp. 369-389.
- (2005) Proc. Machine Learning for Multimodal Interaction Workshop (MLMI) , pp. 369-389
- Fiscus, J.G.¹ Radde, N.² Garofolo, J.S.³ Le, A.⁴ Ajot, J.⁵ Laprun, C.⁶

28
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- Lansdowne, VA
- S. S. Chen and P. S. Gopalakrishnam, "Speaker, environment and channel change detection and clustering via the bayesian information criterion," in Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA, 1998, pp. 127-132.
- (1998) Proc. 1998 DARPA Broadcast News Transcription and Understanding Workshop , pp. 127-132
- Chen, S.S.¹ Gopalakrishnam, P.S.²

29
- 85009089453
- Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
- Beijing, China, Oct
- B. Zhou and J. Hansen, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. Int. Conf. Spoken Language Process., vol. 3, Beijing, China, Oct. 2000, pp. 714-717.
- (2000) Proc. Int. Conf. Spoken Language Process , vol.3 , pp. 714-717
- Zhou, B.¹ Hansen, J.²

30
- 0002782496
- Automatic segmentation, classification and clustering of broadcast news
- Chantilly, VA, Feb
- M. A. Siegler, U. Jain, B. Raj, and R. M. Stem, "Automatic segmentation, classification and clustering of broadcast news," in Proc. DARPA Speech Recognition Workshop, Chantilly, VA, Feb. 1997, pp. 97-99.
- (1997) Proc. DARPA Speech Recognition Workshop , pp. 97-99
- Siegler, M.A.¹ Jain, U.² Raj, B.³ Stem, R.M.⁴

31
- 33745218307
- Improving speaker diarization
- Palisades, NY, Nov, Online, Available
- C. Barras, X. Zhu, S. Meignier, and J.-L. Gauvain, "Improving speaker diarization," in Proc. Fall Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004, [Online]. Available: http://www.limsi.fr/ Individu/barras/publis/rt04f_diarization.pdf.
- (2004) Proc. Fall Rich Transcription Workshop (RT-04)
- Barras, C.¹ Zhu, X.² Meignier, S.³ Gauvain, J.-L.⁴

32
- 85009266843
- Unsupervised speaker segmentation of telephone conversations
- Denver, CO, Sep
- A. E. Rosenberg, A. Gorin, Z. Liu, and S. Parthasarathy, "Unsupervised speaker segmentation of telephone conversations," in Proc. Int. Conf. Spoken Language Process., Denver, CO, Sep. 2002, pp. 565-568.
- (2002) Proc. Int. Conf. Spoken Language Process , pp. 565-568
- Rosenberg, A.E.¹ Gorin, A.² Liu, Z.³ Parthasarathy, S.⁴

33
- 4544282389
- Benefits of prior acoustic segmentation for automatic speaker segmentation
- Montreal, QC, Canada, May
- S. Meignier, D. Moraru, C. Fredouille, L. Besacier, and J.-F. Bonastre, "Benefits of prior acoustic segmentation for automatic speaker segmentation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. I, Montreal, QC, Canada, May 2004, pp. 397-400.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 397-400
- Meignier, S.¹ Moraru, D.² Fredouille, C.³ Besacier, L.⁴ Bonastre, J.-F.⁵

34
- 0141814603
- Online speaker clustering
- Hong Kong, China, Apr
- D. Liu and F. Kubala, "Online speaker clustering," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. I, Hong Kong, China, Apr. 2003, pp. 572-575.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 572-575
- Liu, D.¹ Kubala, F.²

35
- 34047268274
- D. Moraru, S. Meignier, L. Besacier, J.-F. Bonastre, and I. Magrin-Chagnolleau. The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation, presented at Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.. [Online]. Available: http://www.lia.univ-avignon.fr/fich.art/339-moricassp2003.pdf
- D. Moraru, S. Meignier, L. Besacier, J.-F. Bonastre, and I. Magrin-Chagnolleau. The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation, presented at Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.. [Online]. Available: http://www.lia.univ-avignon.fr/fich.art/339-moricassp2003.pdf

36
- 34047263722
- Segmentation, classification and clustering of an Italian corpus
- Paris, France, Apr, Online, Available
- M. Cettolo, "Segmentation, classification and clustering of an Italian corpus," in Proc. Recherche d'Information Assisté par Ordinateur (RIAO), Paris, France, Apr. 2000, [Online]. Available: http://munst.itc.it/people/cettolo/papers/riao00a.ps.gz.
- (2000) Proc. Recherche d'Information Assisté par Ordinateur (RIAO)
- Cettolo, M.¹

37
- 84946742526
- J. Ajmera and C. Wooters, A Robust Speaker Clustering Algorithm, in Proc. IEEE ASRU Workshop, St Thomas, U.S. Virgin Islands, Nov. 2003, pp. 411-416.
- J. Ajmera and C. Wooters, "A Robust Speaker Clustering Algorithm," in Proc. IEEE ASRU Workshop, St Thomas, U.S. Virgin Islands, Nov. 2003, pp. 411-416.

38
- 33745219648
- The development of the Cambridge University RT-04 diarization system
- Palisades, NY, Nov, Online, Available
- S. E. Tranter, M. J. F. Gales, R. Sinha, S. Umesh, and P. C. Woodland, "The development of the Cambridge University RT-04 diarization system," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004, [Online]. Available: http://mi.eng.cam.ac.uk/reports/ abstracts/tranter_rt04.html.
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Tranter, S.E.¹ Gales, M.J.F.² Sinha, R.³ Umesh, S.⁴ Woodland, P.C.⁵

39
- 0003128649
- Automatic speaker clustering
- Chantilly, VA, Feb
- H. Jin, F. Kubala, and R. Schwartz, "Automatic speaker clustering," in Proc. DARPA Speech Recognition Workshop, Chantilly, VA, Feb. 1997, pp. 108-111.
- (1997) Proc. DARPA Speech Recognition Workshop , pp. 108-111
- Jin, H.¹ Kubala, F.² Schwartz, R.³

40
- 77951283289
- Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs
- Jeju Island, Korea, Oct
- M. Ben, M. Betser, F. Bimbot, and G. Gravier, "Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs," in Proc. Int. Conf. Spoken Language Processing, Jeju Island, Korea, Oct. 2004, pp. 2329-2332.
- (2004) Proc. Int. Conf. Spoken Language Processing , pp. 2329-2332
- Ben, M.¹ Betser, M.² Bimbot, F.³ Gravier, G.⁴

41
- 0033677065
- Evolutive HMM for multispeaker tracking system
- Istanbul, Turkey, Jun
- S. Meignier, J.-F. Bonastre, C. Fredouille, and T. Merlin, "Evolutive HMM for multispeaker tracking system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. II, Istanbul, Turkey, Jun. 2000, pp. 1201-1204.
- (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1201-1204
- Meignier, S.¹ Bonastre, J.-F.² Fredouille, C.³ Merlin, T.⁴

42
- 0141809272
- E-HMM approach for learning and adapting sound models for speaker indexing
- Crete, Greece, Jun
- S. Meignier, J.-F. Bonastre, and S. Igounet, "E-HMM approach for learning and adapting sound models for speaker indexing," in Proc. Odyssey Speaker and Language Recognition Workshop, Crete, Greece, Jun. 2001, pp. 175-180.
- (2001) Proc. Odyssey Speaker and Language Recognition Workshop , pp. 175-180
- Meignier, S.¹ Bonastre, J.-F.² Igounet, S.³

43
- 85128386923
- Blind clustering of speech utterances based on speaker and language characteristics
- Sydney, Australia, Dec
- D. Reynolds, E. Singer, B. Carlson, J. O'Leary, J. McLaughlin, and M. Zissman, "Blind clustering of speech utterances based on speaker and language characteristics," in Proc. Int. Conf. Spoken Language Process., vol. 7, Sydney, Australia, Dec. 1998, pp. 3193-3196.
- (1998) Proc. Int. Conf. Spoken Language Process , vol.7 , pp. 3193-3196
- Reynolds, D.¹ Singer, E.² Carlson, B.³ O'Leary, J.⁴ McLaughlin, J.⁵ Zissman, M.⁶

44
- 85073258179
- Feature warping for Robust speaker verification
- Crete, Greece, Jun
- J. Pelecanos and S. Sridharan, "Feature warping for Robust speaker verification," in Proc. Odyssey Speaker and Language Recognition Workshop, Crete, Greece, Jun. 2001, pp. 213-218.
- (2001) Proc. Odyssey Speaker and Language Recognition Workshop , pp. 213-218
- Pelecanos, J.¹ Sridharan, S.²

45
- 0141702107
- Feature and score normalization for speaker verification of cellular data
- Hong Kong, China, Apr
- C. Barras and J.-L. Gauvain, "Feature and score normalization for speaker verification of cellular data," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. II, Hong Kong, China, Apr. 2003, pp. 49-52.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 49-52
- Barras, C.¹ Gauvain, J.-L.²

46
- 84865749221
- Speaker Diarization from Speech Transcripts
- Jeju Island, Korea, Oct
- L. Canseco-Rodriguez, L. Lamel, and J.-L. Gauvain, "Speaker Diarization from Speech Transcripts," in Proc. Int. Conf. Spoken Language Process., Jeju Island, Korea, Oct. 2004, pp. 1272-1275.
- (2004) Proc. Int. Conf. Spoken Language Process , pp. 1272-1275
- Canseco-Rodriguez, L.¹ Lamel, L.² Gauvain, J.-L.³

47
- 33947677676
- Who really spoke when? - Finding speaker turns and identities in audio
- Toulouse, France, May
- S. E. Tranter, "Who really spoke when? - Finding speaker turns and identities in audio," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. I, Toulouse, France, May 2006, pp. 1013-1016.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 1013-1016
- Tranter, S.E.¹

48
- 34047258805
- Progress in the CU-HTK transcription system
- Sep
- M. J. F. Gales, D. Y. Kim, P. C. Woodland, H. Y. Chan, D. Mrva, R. Sinha, and S. E. Tranter, "Progress in the CU-HTK transcription system," IEEE Trans. Audio, Speech, Lang, Process., vol. 14, no. 5, pp. 1511-1523, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang, Process , vol.14 , Issue.5 , pp. 1511-1523
- Gales, M.J.F.¹ Kim, D.Y.² Woodland, P.C.³ Chan, H.Y.⁴ Mrva, D.⁵ Sinha, R.⁶ Tranter, S.E.⁷

49
- 4544361649
- The ELISA consortium approaches in speaker segmentation during the NIST 2003 Rich Transcription evaluation
- Montreal, QC, Canada, May
- D. Moraru, S. Meignier, C. Fredouille, L. Besacier, and J.-F. Donastre, "The ELISA consortium approaches in speaker segmentation during the NIST 2003 Rich Transcription evaluation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. 1, Montreal, QC, Canada, May 2004, pp. 373-376.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 373-376
- Moraru, D.¹ Meignier, S.² Fredouille, C.³ Besacier, L.⁴ Donastre, J.-F.⁵

50
- 33646790196
- Two-way cluster voting to improve speaker diarization performance
- Philadelphia, PA, Mar
- S. E. Tranter, "Two-way cluster voting to improve speaker diarization performance," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. I, Philadelphia, PA, Mar. 2005, pp. 753-756.
- (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 753-756
- Tranter, S.E.¹

51
- 33745212266
- Online speaker adaptation and tracking for real-time speech recognition
- Lisbon, Portugal, Sep
- D. Liu, D. Kiecza, A. Srivastava, and F. Kubala, "Online speaker adaptation and tracking for real-time speech recognition," in Proc. Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 281-284.
- (2005) Proc. Eur. Conf. Speech Commun. Technol , pp. 281-284
- Liu, D.¹ Kiecza, D.² Srivastava, A.³ Kubala, F.⁴

52
- 33745186465
- Results of the fall 2004 STT and MDE evaluation
- Palisades, NY, Nov
- J. G. Fiscus, J. S. Garofolo, A. Le, A. F. Martin, D. S. Pallett, M. A. Przybocki, and G. Sanders, "Results of the fall 2004 STT and MDE evaluation," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004.
- (2004) Proc. Fall 2004 Rich Transcription Workshop (RT-04)
- Fiscus, J.G.¹ Garofolo, J.S.² Le, A.³ Martin, A.F.⁴ Pallett, D.S.⁵ Przybocki, M.A.⁶ Sanders, G.⁷

53
- 85009080849
- Speaker segmentation and clustering in meetings
- Montreal, QC, Canada, May, Online, Available
- Q. Jin, K. Laskowski, T. Schultz, and A. Waibel, "Speaker segmentation and clustering in meetings," in Proc. ICASSP Meeting Recognition Workshop, Montreal, QC, Canada, May 2004, [Online]. Available: http://isl.ira.uka.de/publications/SchultzJin_NIST04.pdf.
- (2004) Proc. ICASSP Meeting Recognition Workshop
- Jin, Q.¹ Laskowski, K.² Schultz, T.³ Waibel, A.⁴

54
- 33745186675
- Broadcast news speaker tracking for ESTER 2005 campaign
- Lisbon, Portugal, Sep
- D. Istrate, N. Scheffler, C. Fredouille, and J.-F. Bonastre, "Broadcast news speaker tracking for ESTER 2005 campaign," in Proc. Eur. Conf. Speech Commun. Technol., Lisbon, Portugal, Sep. 2005, pp. 2445-2448.
- (2005) Proc. Eur. Conf. Speech Commun. Technol , pp. 2445-2448
- Istrate, D.¹ Scheffler, N.² Fredouille, C.³ Bonastre, J.-F.⁴

55
- 0002871462
- Integrated technologies for indexing spoken language
- Feb
- F. Kubala, S. Colbath, D. Liu, A. Srivastava, and J. Makhoul, "Integrated technologies for indexing spoken language," Commun. ACM, vol. 43, no. 2, pp. 48-56, Feb. 2000.
- (2000) Commun. ACM , vol.43 , Issue.2 , pp. 48-56
- Kubala, F.¹ Colbath, S.² Liu, D.³ Srivastava, A.⁴ Makhoul, J.⁵

56
- 85008020310
- Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word
- Sep
- J. H. L. Hansen, R. Huang, B. Z. M. Seadle, J. J. R. Deller, A. R. Gurijala, M. Kurimo, and P. Angkititrakul, "Speechfind: Advances in spoken document retrieval for a national gallery of the spoken word," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 712-730, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 712-730
- Hansen, J.H.L.¹ Huang, R.² Seadle, B.Z.M.³ Deller, J.J.R.⁴ Gurijala, A.R.⁵ Kurimo, M.⁶ Angkititrakul, P.⁷

57
- 33646807492
- Alize: A free toolkit for speaker recogntion
- Philadelphia, PA, Mar
- J. F. Bonastre, F. Wils, and S. Meignier, "Alize: A free toolkit for speaker recogntion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. I, Philadelphia, PA, Mar. 2005, pp. 737-740.
- (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 737-740
- Bonastre, J.F.¹ Wils, F.² Meignier, S.³

58
- 0141744710
- The superSID project: Exploiting high-level information for high-accuracy speaker recognition
- Hong Kong, China, Apr
- D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones, and B. Xiang, "The superSID project: Exploiting high-level information for high-accuracy speaker recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., vol. IV, Hong Kong, China, Apr. 2003, pp. 784-787.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.4 , pp. 784-787
- Reynolds, D.¹ Andrews, W.² Campbell, J.³ Navratil, J.⁴ Peskin, B.⁵ Adami, A.⁶ Jin, Q.⁷ Klusacek, D.⁸ Abramson, J.⁹ Mihaescu, R.¹⁰ Godfrey, J.¹¹ Jones, D.¹² Xiang, B.¹³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.