메뉴 건너뛰기




Volumn 20, Issue 2, 2012, Pages 404-413

Large-Scale Speaker Diarization for Long Recordings and Small Collections

Author keywords

Collection wide diarization; information retrieval; large scale diarization; speaker detection; speaker diarization

Indexed keywords


EID: 85008566664     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2011.2162320     Document Type: Article
Times cited : (28)

References (25)
  • 2
    • 0033738539 scopus 로고    scopus 로고
    • The NIST speaker recognition evaluation--Overview, methodology, systems, results, perspective
    • G. R. Doddington, M. A. Przybocki, A. F. Martin, and D. A. Reynolds, “The NIST speaker recognition evaluation--Overview, methodology, systems, results, perspective,” Speech Commun., vol. 31, pp. 225–254, 2000.
    • (2000) Speech Commun. , vol.31 , pp. 225-254
    • Doddington, G.R.1    Przybocki, M.A.2    Martin, A.F.3    Reynolds, D.A.4
  • 3
    • 47749119617 scopus 로고    scopus 로고
    • The ICSI RT07s speaker diariza-tion system
    • Berlin, Germany: Springer-Verlag, Lecture Notes in Computer Science
    • C. Wooters and M. Huijbregts, “The ICSI RT07s speaker diariza-tion system,” in Multimodal Technologies for Perception of Humans. Berlin, Germany: Springer-Verlag, 2008, vol. 4625, Lecture Notes in Computer Science.
    • (2008) Multimodal Technologies for Perception of Humans , vol.4625
    • Wooters, C.1    Huijbregts, M.2
  • 4
    • 77249176190 scopus 로고    scopus 로고
    • The AMI speaker diarization system for NIST RT06s meeting data
    • Berlin, Germany: Springer-Verlag, October, Lecture Notes in Computer Science
    • D. van Leeuwen and M. Huijbregts, “The AMI speaker diarization system for NIST RT06s meeting data,” in Machine Learning for Multimodal Interaction (MLMI). Berlin, Germany: Springer-Verlag, October 2007, vol. 4299, Lecture Notes in Computer Science, pp. 371–384.
    • (2007) Machine Learning for Multimodal Interaction (MLMI) , vol.4299 , pp. 371-384
    • van Leeuwen, D.1    Huijbregts, M.2
  • 5
    • 72449169479 scopus 로고    scopus 로고
    • Segmentation, diarization and speech transcription: surprise data unraveled
    • Enschede, The Netherlands, Nov.
    • M. Huijbregts, “Segmentation, diarization and speech transcription: surprise data unraveled,” Ph. D. dissertation, Enschede, The Netherlands, Nov. 2008.
    • (2008) Ph. D. dissertation
    • Huijbregts, M.1
  • 6
    • 0033901151 scopus 로고    scopus 로고
    • The NIST 1999 speaker recognition evaluation-an overview
    • A. Martin and M. Przybocki, “The NIST 1999 speaker recognition evaluation-an overview,” Digital Signal Process., vol. 10, no. 1-3, pp. 1–18, 2000.
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 1-18
    • Martin, A.1    Przybocki, M.2
  • 7
    • 0033872977 scopus 로고    scopus 로고
    • Approaches to speaker detection and tracking in conversational speech
    • R. B. Dunn, D. A. Reynolds, and T. F. Quatieri, “Approaches to speaker detection and tracking in conversational speech,” Digital Signal Process., vol. 10, no. 1-3, pp. 93–112, 2000.
    • (2000) Digital Signal Process. , vol.10 , Issue.1-3 , pp. 93-112
    • Dunn, R.B.1    Reynolds, D.A.2    Quatieri, T.F.3
  • 9
    • 38049096154 scopus 로고    scopus 로고
    • A simple but effective approach to speaker tracking in broadcast news
    • L. J. Rodriguez, M. Penagarikano, and G. Bordel, “A simple but effective approach to speaker tracking in broadcast news,” Girona, Spain, 2007, vol. 4478, pp. 48–55.
    • (2007) Girona, Spain , vol.4478 , pp. 48-55
    • Rodriguez, L.J.1    Penagarikano, M.2    Bordel, G.3
  • 10
    • 44649092738 scopus 로고    scopus 로고
    • A system for speaker detection and tracking in audio broadcast news
    • J. Zibert, B. Vesnicer, and F. Mihelic, “A system for speaker detection and tracking in audio broadcast news,” Informatica (Slovenia), vol. 32, no. 1, pp. 51–61, 2008.
    • (2008) Informatica (Slovenia) , vol.32 , Issue.1 , pp. 51-61
    • Zibert, J.1    Vesnicer, B.2    Mihelic, F.3
  • 11
    • 78649270455 scopus 로고    scopus 로고
    • Diarization of telephone conversations using factor analysis
    • Dec.
    • P. Kenny, D. Reynolds, and F. Castaldo, “Diarization of telephone conversations using factor analysis,” IEEE J. Sei. Topics Signal Process., vol. 4, no. 6, pp. 1059–1070, Dec. 2010.
    • (2010) IEEE J. Sei. Topics Signal Process. , vol.4 , Issue.6 , pp. 1059-1070
    • Kenny, P.1    Reynolds, D.2    Castaldo, F.3
  • 14
    • 56149122874 scopus 로고    scopus 로고
    • Filtering the unknown: Speech activity detection in heterogeneous video collections
    • Aug.
    • M. Huijbregts, C. Wooters, and R. Ordelman, “Filtering the unknown: Speech activity detection in heterogeneous video collections,” in Proc. Interspeech, Antwerp, Belgium, Aug. 2007.
    • (2007) Proc. Interspeech, Antwerp, Belgium
    • Huijbregts, M.1    Wooters, C.2    Ordelman, R.3
  • 15
    • 77955821835 scopus 로고    scopus 로고
    • The rich transcription 2007 meeting recognition evaluation
    • Berlin, Germany: Springer-Verlag, Lecture Notes in Computer Science
    • J. G. Fiscus, J. Ajot, and J. S. Garofolo, “The rich transcription 2007 meeting recognition evaluation,” in Multimodal Technologies for Perception ofHumans. Berlin, Germany: Springer-Verlag, 2008, Lecture Notes in Computer Science.
    • (2008) Multimodal Technologies for Perception ofHumans
    • Fiscus, J.G.1    Ajot, J.2    Garofolo, J.S.3
  • 17
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwartz, “Estimating the dimension of a model,” Ann. Statist., vol. 6, no. 2, pp. 461–464, 1978.
    • (1978) Ann. Statist. , vol.6 , Issue.2 , pp. 461-464
    • Schwartz, G.1
  • 21
    • 70450183185 scopus 로고    scopus 로고
    • Results of the n-best 2008 dutch speech recognition evaluation
    • Brighton, U. K., Sep.
    • D. van Leeuwen, J. Kessens, E. Sanders, and H. van den Heuvel, “Results of the n-best 2008 dutch speech recognition evaluation,” in Proc. Interspeech, Brighton, U. K., Sep. 2009.
    • (2009) Proc. Interspeech
    • van Leeuwen, D.1    Kessens, J.2    Sanders, E.3    van den Heuvel, H.4
  • 25
    • 44849123928 scopus 로고    scopus 로고
    • Robust speaker diarization for meetings
    • Ph. D. dissertation, Univ. Politecnica De Catalunya
    • X. Anguera, “Robust speaker diarization for meetings,” Ph. D. dissertation, Univ. Politecnica De Catalunya, 2006.
    • (2006)
    • Anguera, X.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.