메뉴 건너뛰기




Volumn 4625 LNCS, Issue , 2008, Pages 543-553

Speaker diarization for conference room: The UPC RT07s evaluation system

Author keywords

[No Author keywords available]

Indexed keywords

COSINE TRANSFORMS; DISCRETE COSINE TRANSFORMS; SPEECH PROCESSING; SPEECH RECOGNITION; SUPPORT VECTOR MACHINES; TRANSCRIPTION; VITERBI ALGORITHM;

EID: 47749127366     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-68585-2_50     Document Type: Conference Paper
Times cited : (16)

References (20)
  • 2
    • 44949197897 scopus 로고    scopus 로고
    • Anguera, X., Wooters, C., Hernando, J.: Robust speaker diarization for meetings: Icsi rt06s evaluation system. In: ICSLP (2006)
    • Anguera, X., Wooters, C., Hernando, J.: Robust speaker diarization for meetings: Icsi rt06s evaluation system. In: ICSLP (2006)
  • 3
    • 85128356454 scopus 로고    scopus 로고
    • Partitioning and transcription of broadcast news data
    • Gauvain, J., Lamel, L., Adda, G.: Partitioning and transcription of broadcast news data. In: ICSLP, pp. 1335-1338 (1998)
    • (1998) ICSLP , pp. 1335-1338
    • Gauvain, J.1    Lamel, L.2    Adda, G.3
  • 4
    • 29044446864 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the bayesian information criterion
    • Chen, S., Gopalakrishnan, P.: Speaker, environment and channel change detection and clustering via the bayesian information criterion. In: DARPA BNTU Workshop (1998)
    • (1998) DARPA BNTU Workshop
    • Chen, S.1    Gopalakrishnan, P.2
  • 6
    • 85009231870 scopus 로고    scopus 로고
    • Qualcomm-icsi-cgi features for asr
    • Adami, A., et al.: Qualcomm-icsi-cgi features for asr. In: ICSLP, pp. 21-24 (2002)
    • (2002) ICSLP , pp. 21-24
    • Adami, A.1
  • 8
    • 34547526911 scopus 로고    scopus 로고
    • Enhanced SVM Training for Robust Speech Activity Detection
    • Temko, A., Macho, D., Nadeu, C.: Enhanced SVM Training for Robust Speech Activity Detection. In: Proc. ICCASP (2007)
    • (2007) Proc. ICCASP
    • Temko, A.1    Macho, D.2    Nadeu, C.3
  • 9
    • 0031221099 scopus 로고    scopus 로고
    • Filtering the time sequence of spectral parameters for speech recognition
    • Nadeu, C., Paches-Leal, P., Juang, B.H.: Filtering the time sequence of spectral parameters for speech recognition. Speech Communication 22, 315-332 (1997)
    • (1997) Speech Communication , vol.22 , pp. 315-332
    • Nadeu, C.1    Paches-Leal, P.2    Juang, B.H.3
  • 10
    • 0022352370 scopus 로고
    • Computer-steered microphone arrays for sound transduction in large rooms
    • Flanagan, J., Johnson, J., Kahn, R., Elko, G.: Computer-steered microphone arrays for sound transduction in large rooms. ASAJ 78(5), 1508-1518 (1985)
    • (1985) ASAJ , vol.78 , Issue.5 , pp. 1508-1518
    • Flanagan, J.1    Johnson, J.2    Kahn, R.3    Elko, G.4
  • 13
    • 0035789613 scopus 로고    scopus 로고
    • Proximal Support Vector Machine Classifiers
    • Fung, G., Mangasarian, O.: Proximal Support Vector Machine Classifiers. In: Proc. KDDM, pp. 77-86 (2001)
    • (2001) Proc. KDDM , pp. 77-86
    • Fung, G.1    Mangasarian, O.2
  • 14
    • 10044256273 scopus 로고    scopus 로고
    • SVM Training Time Reduction using Vector Quantization
    • Lebrun, G., Charrier, C., Cardot, H.: SVM Training Time Reduction using Vector Quantization. In: Proc. ICPR, pp. 160-163 (2004)
    • (2004) Proc. ICPR , pp. 160-163
    • Lebrun, G.1    Charrier, C.2    Cardot, H.3
  • 15
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis, S.B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions ASSP (28), 357-366 (1980)
    • (1980) IEEE Transactions ASSP , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 17
    • 0034817674 scopus 로고    scopus 로고
    • Time and Frequency Filtering of Filter-Bank Energies for Robust Speech Recognition
    • Nadeu, C., Macho, D., Hernando, J.: Time and Frequency Filtering of Filter-Bank Energies for Robust Speech Recognition. Speech Communication 34, 93-114 (2001)
    • (2001) Speech Communication , vol.34 , pp. 93-114
    • Nadeu, C.1    Macho, D.2    Hernando, J.3
  • 18
    • 47749136545 scopus 로고    scopus 로고
    • On the interaction between time and frequency filterinf of speech parameters for robust speech recognition
    • Macho, D., Nadeu, C.: On the interaction between time and frequency filterinf of speech parameters for robust speech recognition. In: ICSLP, 1137 (1999)
    • (1999) ICSLP , vol.1137
    • Macho, D.1    Nadeu, C.2
  • 19
    • 47749124151 scopus 로고    scopus 로고
    • Anguera, X., Hernando, J., Anguita, J.: Xbic: nueva medida para segmentación de locutor hacia el indexado automático de la señal de voz. JTH, 237-242 (2004)
    • Anguera, X., Hernando, J., Anguita, J.: Xbic: nueva medida para segmentación de locutor hacia el indexado automático de la señal de voz. JTH, 237-242 (2004)
  • 20
    • 0005486615 scopus 로고
    • On the Decorrelation of filter-Bank Energies in Speech Recognition
    • Nadeu, C., Hernando, J., Gorricho, M.: On the Decorrelation of filter-Bank Energies in Speech Recognition. In: EuroSpeech, vol. 20, p. 417 (1995)
    • (1995) EuroSpeech , vol.20 , pp. 417
    • Nadeu, C.1    Hernando, J.2    Gorricho, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.