메뉴 건너뛰기




Volumn 50, Issue 5, 2008, Pages 355-365

Speaker diarization using one-class support vector machines

Author keywords

Kernel Change Detection; One class support vector machine; Speaker diarization; Speaker indexing

Indexed keywords

CLUSTERING ALGORITHMS; COMPUTATIONAL METHODS; COST EFFECTIVENESS; GAUSSIAN DISTRIBUTION; SPEECH ANALYSIS;

EID: 41149119412     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.11.006     Document Type: Article
Times cited : (29)

References (33)
  • 1
    • 85009289298 scopus 로고    scopus 로고
    • Ajmera, J., Bourlard, H., Lapidot, I., Cowan, I.M., 2002. Unknown-multiple speaker clustering using hmm. In: Proc. ICSLP02, Denver, CO, United Sates of America, September.
    • Ajmera, J., Bourlard, H., Lapidot, I., Cowan, I.M., 2002. Unknown-multiple speaker clustering using hmm. In: Proc. ICSLP02, Denver, CO, United Sates of America, September.
  • 2
    • 5844297152 scopus 로고
    • Theory of reproducing kernels
    • Aronszajn N. Theory of reproducing kernels. Trans. Amer. Math. Soc. 68 (1950) 337-404
    • (1950) Trans. Amer. Math. Soc. , vol.68 , pp. 337-404
    • Aronszajn, N.1
  • 4
    • 41149127040 scopus 로고    scopus 로고
    • Ben, M., Bester, M., Bimbot, F., Gravier, G., 2004. Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMS. In: Proc. ICSLP04, Jeju, Korea, October, pp. 1125-1128.
    • Ben, M., Bester, M., Bimbot, F., Gravier, G., 2004. Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMS. In: Proc. ICSLP04, Jeju, Korea, October, pp. 1125-1128.
  • 5
    • 84887001725 scopus 로고    scopus 로고
    • Kernel methods and the exponential family
    • Brugge, Belgium
    • Canu S., and Smola A. Kernel methods and the exponential family. Proc. ESANN'05 (2005), Brugge, Belgium
    • (2005) Proc. ESANN'05
    • Canu, S.1    Smola, A.2
  • 6
    • 41149153735 scopus 로고    scopus 로고
    • Chang, C.-C., Lin, C.-J., 2001. LIBSVM: a library for support vector machines, Department of Computer Science, National Taiwan University. Software available at .
    • Chang, C.-C., Lin, C.-J., 2001. LIBSVM: a library for support vector machines, Department of Computer Science, National Taiwan University. Software available at .
  • 7
    • 41149170538 scopus 로고    scopus 로고
    • Christensen, H., 2002. Speech recognition using heterogenous information extraction in multi-stream based systems. Ph.D. Thesis, Aalborg University, Denmark.
    • Christensen, H., 2002. Speech recognition using heterogenous information extraction in multi-stream based systems. Ph.D. Thesis, Aalborg University, Denmark.
  • 8
    • 33646950083 scopus 로고    scopus 로고
    • An online support vector machine for abnormal events detection
    • Davy M., Desobry F., Gretton A., and Doncarli C. An online support vector machine for abnormal events detection. Signal Process. 86 8 (2006) 2009-2025
    • (2006) Signal Process. , vol.86 , Issue.8 , pp. 2009-2025
    • Davy, M.1    Desobry, F.2    Gretton, A.3    Doncarli, C.4
  • 9
    • 0034273195 scopus 로고    scopus 로고
    • DISTBIC: a speaker-based segmentation for audio data indexing
    • Delacourt P., and Wellekens C. DISTBIC: a speaker-based segmentation for audio data indexing. Speech Comm. 32 1 (2000) 111-126
    • (2000) Speech Comm. , vol.32 , Issue.1 , pp. 111-126
    • Delacourt, P.1    Wellekens, C.2
  • 10
    • 4544242291 scopus 로고    scopus 로고
    • Desobry, F., Davy, M., 2004. Dissimilarity measures in feature space. In: Proc. IEEE ICASSP'04, Montreal, Canada.
    • Desobry, F., Davy, M., 2004. Dissimilarity measures in feature space. In: Proc. IEEE ICASSP'04, Montreal, Canada.
  • 11
  • 13
    • 0033872977 scopus 로고    scopus 로고
    • Approaches to speaker detection and tracking in conversational speech
    • Dunn R., Reynolds D., and Quatieri T. Approaches to speaker detection and tracking in conversational speech. Digital Signal Process. 10 1 (2000) 93-112
    • (2000) Digital Signal Process. , vol.10 , Issue.1 , pp. 93-112
    • Dunn, R.1    Reynolds, D.2    Quatieri, T.3
  • 14
    • 84862162991 scopus 로고    scopus 로고
    • Gravier, G., Bonastre, J.-F., Galliano, S., Geoffrois, E., Tait, K.M., Choukri, K., 2004. The ester evaluation campaign of rich transcription of French broadcast news. In: Proc. Lang. Evaluation Resources Conf. (LREC2004), Lisbon, Portugal, pp. 885-888.
    • Gravier, G., Bonastre, J.-F., Galliano, S., Geoffrois, E., Tait, K.M., Choukri, K., 2004. The ester evaluation campaign of rich transcription of French broadcast news. In: Proc. Lang. Evaluation Resources Conf. (LREC2004), Lisbon, Portugal, pp. 885-888.
  • 15
    • 33845951461 scopus 로고    scopus 로고
    • Significance of joint features derived from the modified group delay function in speech processing
    • Hegde R.M. Significance of joint features derived from the modified group delay function in speech processing. EURASIP J. Audio Speech Music Process. (2007) 1-13
    • (2007) EURASIP J. Audio Speech Music Process. , pp. 1-13
    • Hegde, R.M.1
  • 16
    • 41149137513 scopus 로고    scopus 로고
    • Janin, A., Ellis, D., Morgan, N., 1999. Multi-stream speech recognition:ready for prime time. In: Proc. 6th European Conf. on Speech Communication Technology, Budapest, Hungary.
    • Janin, A., Ellis, D., Morgan, N., 1999. Multi-stream speech recognition:ready for prime time. In: Proc. 6th European Conf. on Speech Communication Technology, Budapest, Hungary.
  • 17
    • 41149104768 scopus 로고    scopus 로고
    • Speaker change detection using support vector machines
    • Barcelona, Spain
    • Kartik V., Satish D.S., and Sekhar C.C. Speaker change detection using support vector machines. Proc. NOLISP'05 (2005), Barcelona, Spain
    • (2005) Proc. NOLISP'05
    • Kartik, V.1    Satish, D.S.2    Sekhar, C.C.3
  • 18
    • 27644599375 scopus 로고    scopus 로고
    • Unsupervised speaker indexing using generic models
    • Kwon S., and Narayanan S. Unsupervised speaker indexing using generic models. IEEE Trans. Speech Audio Process. 13 5 (2005) 1004-1013
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 1004-1013
    • Kwon, S.1    Narayanan, S.2
  • 19
    • 0141814603 scopus 로고    scopus 로고
    • Online speaker clustering
    • Montreal, Canada
    • Liu D., and Kubala F. Online speaker clustering. Proc. IEEE ICASSP'04 (2004), Montreal, Canada
    • (2004) Proc. IEEE ICASSP'04
    • Liu, D.1    Kubala, F.2
  • 20
    • 41149120936 scopus 로고    scopus 로고
    • Meignier, S., 2002. Indexation en locuteurs de documents sonores: Segmentation d'un document et appariement d'une collection. Ph.D. Thesis, Universite d'Avignon et des pays de vaucluse, Laboratoire Informatique d'Avignon.
    • Meignier, S., 2002. Indexation en locuteurs de documents sonores: Segmentation d'un document et appariement d'une collection. Ph.D. Thesis, Universite d'Avignon et des pays de vaucluse, Laboratoire Informatique d'Avignon.
  • 21
    • 41149098459 scopus 로고    scopus 로고
    • Meignier, S., Bonastre, J.-F., Fredouille, C., Merlin, T., 2000. Evolutive hmm for speaker tracking systems. In: Proc. IEEE ICASSP'00, Istambul, Turkey, pp. 1177-1180.
    • Meignier, S., Bonastre, J.-F., Fredouille, C., Merlin, T., 2000. Evolutive hmm for speaker tracking systems. In: Proc. IEEE ICASSP'00, Istambul, Turkey, pp. 1177-1180.
  • 22
    • 85009268831 scopus 로고    scopus 로고
    • Meignier, S., Bonastre, J.-F., Magrin-Chagnolleau, I., 2002. Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases. In: Proc. ICSLP 2002, Vol. 1. Denver, CO, United Sates of America, September, pp. 573-576.
    • Meignier, S., Bonastre, J.-F., Magrin-Chagnolleau, I., 2002. Speaker utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases. In: Proc. ICSLP 2002, Vol. 1. Denver, CO, United Sates of America, September, pp. 573-576.
  • 23
    • 29044442235 scopus 로고    scopus 로고
    • Step-by-step and integrated approaches in broadcast news speaker diarization
    • Meignier S., Moraru D., Fredouille C., Bonastre J.-F., and Besacier L. Step-by-step and integrated approaches in broadcast news speaker diarization. Comp. Speech Lang. 20 2-3 (2006) 303-330
    • (2006) Comp. Speech Lang. , vol.20 , Issue.2-3 , pp. 303-330
    • Meignier, S.1    Moraru, D.2    Fredouille, C.3    Bonastre, J.-F.4    Besacier, L.5
  • 24
    • 4544361649 scopus 로고    scopus 로고
    • The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation
    • Montreal, Canada
    • Moraru D., Meignier S., Fredouille C., Besacier L., and Bonastre J.-F. The ELISA consortium approaches in broadcast news speaker segmentation during the NIST 2003 rich transcription evaluation. Proc. IEEE ICASSP'04 (2004), Montreal, Canada
    • (2004) Proc. IEEE ICASSP'04
    • Moraru, D.1    Meignier, S.2    Fredouille, C.3    Besacier, L.4    Bonastre, J.-F.5
  • 25
    • 41149160071 scopus 로고    scopus 로고
    • Nguyen, P., Junqua, J.-C., 2003. Pstl's speaker diarization. In: Workshop NIST RT03'S Proc.
    • Nguyen, P., Junqua, J.-C., 2003. Pstl's speaker diarization. In: Workshop NIST RT03'S Proc.
  • 26
    • 41149129127 scopus 로고    scopus 로고
    • NIST RT03S The rich transcription spring 2003 (RT-03S) evaluation plan. .
    • NIST RT03S The rich transcription spring 2003 (RT-03S) evaluation plan. .
  • 27
    • 0029748333 scopus 로고    scopus 로고
    • Schmidt, M., Gish, H., 1996. Speaker identification via support vector classifiers. In: Proc. IEEE ICASSP'96, Atlanta, USA.
    • Schmidt, M., Gish, H., 1996. Speaker identification via support vector classifiers. In: Proc. IEEE ICASSP'96, Atlanta, USA.
  • 29
    • 0034843119 scopus 로고    scopus 로고
    • Seck, M., Magrin-Chagnolleau, I., Bimbot, F., 2001. Experiments on speech tracking in audio documents using Gaussian Mixture Modeling. In: Proc. IEEE Internat. Conf. on Audio, Speech and Signal Processing, Salt Lake City, USA.
    • Seck, M., Magrin-Chagnolleau, I., Bimbot, F., 2001. Experiments on speech tracking in audio documents using Gaussian Mixture Modeling. In: Proc. IEEE Internat. Conf. on Audio, Speech and Signal Processing, Salt Lake City, USA.
  • 30
    • 84889324982 scopus 로고    scopus 로고
    • Solomonoff, A., Mielke, A., Schmidt, M., Gish, H., 1998. Clustering speakers by their voices. In: Proc. IEEE ICASSP'98.
    • Solomonoff, A., Mielke, A., Schmidt, M., Gish, H., 1998. Clustering speakers by their voices. In: Proc. IEEE ICASSP'98.
  • 32
    • 14644412368 scopus 로고    scopus 로고
    • Speaker verification using sequence discriminant support vector machines
    • Wan V., and Renals S. Speaker verification using sequence discriminant support vector machines. IEE Trans. Speech Audio Process. 13 2 (2005) 203-210
    • (2005) IEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 203-210
    • Wan, V.1    Renals, S.2
  • 33
    • 41149172472 scopus 로고    scopus 로고
    • Wooters, C., Fung, J., Peskins, B., Anguera, X., 2004. Toward robust speaker segmentation: The icsi-sri fall 2004 diarization system. In: Proc. Fall 2004 Rich Transcription Workshop (RT-04).
    • Wooters, C., Fung, J., Peskins, B., Anguera, X., 2004. Toward robust speaker segmentation: The icsi-sri fall 2004 diarization system. In: Proc. Fall 2004 Rich Transcription Workshop (RT-04).


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.