SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 4625 LNCS, Issue , 2008, Pages 543-553

Speaker diarization for conference room: The UPC RT07s evaluation system

(4) Luque, Jordi a Anguera, Xavier b Temko, Andrey a Hernando, Javier a

a UNIVERSITAT POLITÈCNICA DE CATALUNYA (Spain)

b Multilinguism Group (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

COSINE TRANSFORMS; DISCRETE COSINE TRANSFORMS; SPEECH PROCESSING; SPEECH RECOGNITION; SUPPORT VECTOR MACHINES; TRANSCRIPTION; VITERBI ALGORITHM;

AGGLOMERATIVE CLUSTERING; BAYESIAN CRITERION; COMPLEXITY SELECTION; DISCRETE COSINES; EVALUATION SYSTEMS; FILTER BANKS; FREQUENCY FILTERING; HEIDELBERG (CO); INTERNATIONAL (CO); MEL-CEPSTRUM ANALYSIS; MODULE BASED; MULTI-MODAL; POST-PROCESSING; SPEAKER DIARIZATION; SPEECH DETECTION; VITERBI SEGMENTATION; WIENER FILTERING;

SPEECH;

EID: 47749127366 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-68585-2_50 Document Type: Conference Paper

Times cited : (16)

References (20)

1
- 42549103165
- NIST
- NIST: Rich transcription meeting recognition evaluation plan. RT-07s (2007)
- (2007) Rich transcription meeting recognition evaluation plan. RT-07s

2
- 44949197897
- Anguera, X., Wooters, C., Hernando, J.: Robust speaker diarization for meetings: Icsi rt06s evaluation system. In: ICSLP (2006)
- Anguera, X., Wooters, C., Hernando, J.: Robust speaker diarization for meetings: Icsi rt06s evaluation system. In: ICSLP (2006)

3
- 85128356454
- Partitioning and transcription of broadcast news data
- Gauvain, J., Lamel, L., Adda, G.: Partitioning and transcription of broadcast news data. In: ICSLP, pp. 1335-1338 (1998)
- (1998) ICSLP , pp. 1335-1338
- Gauvain, J.¹ Lamel, L.² Adda, G.³

4
- 29044446864
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- Chen, S., Gopalakrishnan, P.: Speaker, environment and channel change detection and clustering via the bayesian information criterion. In: DARPA BNTU Workshop (1998)
- (1998) DARPA BNTU Workshop
- Chen, S.¹ Gopalakrishnan, P.²

5
- 0026400244
- ICASSP 1991
- Gish, H., Siu, M., Rohlicek, R.: Segregation of speakers for speech recognition and speaker identification. In: ICASSP (1991)
- Segregation of speakers for speech recognition and speaker identification
- Gish, H.¹ Siu, M.² Rohlicek, R.³

6
- 85009231870
- Qualcomm-icsi-cgi features for asr
- Adami, A., et al.: Qualcomm-icsi-cgi features for asr. In: ICSLP, pp. 21-24 (2002)
- (2002) ICSLP , pp. 21-24
- Adami, A.¹

7
- 47749153688
- Anguera, X.: The acoustic robust beamforming toolkit (2005)
- (2005) The acoustic robust beamforming toolkit
- Anguera, X.¹

8
- 34547526911
- Enhanced SVM Training for Robust Speech Activity Detection
- Temko, A., Macho, D., Nadeu, C.: Enhanced SVM Training for Robust Speech Activity Detection. In: Proc. ICCASP (2007)
- (2007) Proc. ICCASP
- Temko, A.¹ Macho, D.² Nadeu, C.³

9
- 0031221099
- Filtering the time sequence of spectral parameters for speech recognition
- Nadeu, C., Paches-Leal, P., Juang, B.H.: Filtering the time sequence of spectral parameters for speech recognition. Speech Communication 22, 315-332 (1997)
- (1997) Speech Communication , vol.22 , pp. 315-332
- Nadeu, C.¹ Paches-Leal, P.² Juang, B.H.³

10
- 0022352370
- Computer-steered microphone arrays for sound transduction in large rooms
- Flanagan, J., Johnson, J., Kahn, R., Elko, G.: Computer-steered microphone arrays for sound transduction in large rooms. ASAJ 78(5), 1508-1518 (1985)
- (1985) ASAJ , vol.78 , Issue.5 , pp. 1508-1518
- Flanagan, J.¹ Johnson, J.² Kahn, R.³ Elko, G.⁴

11
- 0016990291
- The generalized correlation method for estimation of time delay
- Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustic, Speech and Signal Processing 24(4), 320-327 (1976)
- (1976) IEEE Transactions on Acoustic, Speech and Signal Processing , vol.24 , Issue.4 , pp. 320-327
- Knapp, C.¹ Carter, G.²

12
- 0004094721
- MIT Press, Cambridge
- Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press, Cambridge (2002)
- (2002) Learning with Kernels
- Schölkopf, B.¹ Smola, A.²

13
- 0035789613
- Proximal Support Vector Machine Classifiers
- Fung, G., Mangasarian, O.: Proximal Support Vector Machine Classifiers. In: Proc. KDDM, pp. 77-86 (2001)
- (2001) Proc. KDDM , pp. 77-86
- Fung, G.¹ Mangasarian, O.²

14
- 10044256273
- SVM Training Time Reduction using Vector Quantization
- Lebrun, G., Charrier, C., Cardot, H.: SVM Training Time Reduction using Vector Quantization. In: Proc. ICPR, pp. 160-163 (2004)
- (2004) Proc. ICPR , pp. 160-163
- Lebrun, G.¹ Charrier, C.² Cardot, H.³

15
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Davis, S.B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions ASSP (28), 357-366 (1980)
- (1980) IEEE Transactions ASSP , vol.28 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

16
- 47749106759
- The same book
- Luque, J., Hernando, J.: Robust Speaker Identification for Meetings: UPC CLEAR-07 Meeting Room Evaluation System. In: The same book (2007)
- (2007) Robust Speaker Identification for Meetings: UPC CLEAR-07 Meeting Room Evaluation System
- Luque, J.¹ Hernando, J.²

17
- 0034817674
- Time and Frequency Filtering of Filter-Bank Energies for Robust Speech Recognition
- Nadeu, C., Macho, D., Hernando, J.: Time and Frequency Filtering of Filter-Bank Energies for Robust Speech Recognition. Speech Communication 34, 93-114 (2001)
- (2001) Speech Communication , vol.34 , pp. 93-114
- Nadeu, C.¹ Macho, D.² Hernando, J.³

18
- 47749136545
- On the interaction between time and frequency filterinf of speech parameters for robust speech recognition
- Macho, D., Nadeu, C.: On the interaction between time and frequency filterinf of speech parameters for robust speech recognition. In: ICSLP, 1137 (1999)
- (1999) ICSLP , vol.1137
- Macho, D.¹ Nadeu, C.²

19
- 47749124151
- Anguera, X., Hernando, J., Anguita, J.: Xbic: nueva medida para segmentación de locutor hacia el indexado automático de la señal de voz. JTH, 237-242 (2004)
- Anguera, X., Hernando, J., Anguita, J.: Xbic: nueva medida para segmentación de locutor hacia el indexado automático de la señal de voz. JTH, 237-242 (2004)

20
- 0005486615
- On the Decorrelation of filter-Bank Energies in Speech Recognition
- Nadeu, C., Hernando, J., Gorricho, M.: On the Decorrelation of filter-Bank Energies in Speech Recognition. In: EuroSpeech, vol. 20, p. 417 (1995)
- (1995) EuroSpeech , vol.20 , pp. 417
- Nadeu, C.¹ Hernando, J.² Gorricho, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.