SCOPUS 정보 검색 플랫폼

2007 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2007, Proceedings

Volumn , Issue , 2007, Pages 683-686

Efficient use of overlap information in speaker diarization

(2) Otterson, Scott a Ostendorf, Mari a

a UNIVERSITY OF WASHINGTON (United States)

Author keywords

Diarization; Localization; Overlap; Speaker identification

Indexed keywords

CROSS CORRELATIONS; DIARIZATION; LOCALIZATION; OVERLAP; OVERLAP DETECTIONS; SPEAKER CLUSTERING; SPEAKER DIARIZATION; SPEAKER IDENTIFICATION;

SPEECH RECOGNITION;

EID: 44849101173 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/asru.2007.4430194 Document Type: Conference Paper

Times cited : (45)

References (23)

1
- 33745525361
- The rich transcription 2004 spring meeting recognition evaluation
- J. S. Garofolo, C. D. Laprun, and J. G. Fiscus, "The rich transcription 2004 spring meeting recognition evaluation," in NIST Meeting Recognition Workshop, 2004.
- (2004) NIST Meeting Recognition Workshop
- Garofolo, J.S.¹ Laprun, C.D.² Fiscus, J.G.³

2
- 85009145345
- Observations on overlap: Findings and implications for automatic processing of multi-party conversation
- E. Shriberg, A. Stoicke, and D. Baron, "Observations on overlap: Findings and implications for automatic processing of multi-party conversation," in Eurospeech, 2001.
- (2001) Eurospeech
- Shriberg, E.¹ Stoicke, A.² Baron, D.³

3
- 44849093045
- Progress in the AMIDA speaker diarization system for meeting data
- D. A. van Leeuwen and M. Konecny, "Progress in the AMIDA speaker diarization system for meeting data," in NIST RT07 Workshop, 2007.
- (2007) NIST RT07 Workshop
- van Leeuwen, D.A.¹ Konecny, M.²

4
- 4444262052
- From blind source separation to blind source cancellation in the underdetermined case: A new approach based on time-frequency analysis
- F. Abrard, Y. Deville, and P. White, "From blind source separation to blind source cancellation in the underdetermined case: A new approach based on time-frequency analysis," in Independent Component Analysis and Blind Signal Separation, Intl. Conf. on, 2001.
- (2001) Independent Component Analysis and Blind Signal Separation, Intl. Conf. on
- Abrard, F.¹ Deville, Y.² White, P.³

5
- 0003371952
- Amplitude modulation decorrelation for convolutive blind source separation
- J. Anemüller and B. Kollmeier, "Amplitude modulation decorrelation for convolutive blind source separation," in Independent component analysis and blind signal separation, Intl. Conf. on, pp. 215-220, 2000.
- (2000) Independent component analysis and blind signal separation, Intl. Conf. on , pp. 215-220
- Anemüller, J.¹ Kollmeier, B.²

6
- 0031352137
- Blind source separation and deconvolution by dynamic component analysis
- H. Attias and C. E. Schreiner, "Blind source separation and deconvolution by dynamic component analysis," Neural Networks for Signal Processing, vol. VII, pp. 456-465, 1997.
- (1997) Neural Networks for Signal Processing , vol.7 , pp. 456-465
- Attias, H.¹ Schreiner, C.E.²

7
- 0031145298
- Multichannel speech separation by eigendecomposition and its application to co-talker interference removal
- Y. Cao, S. Sridharan, and M. Moody, "Multichannel speech separation by eigendecomposition and its application to co-talker interference removal," IEEE Trans. on Speech and Audio Proc., vol. 5, no. 3, pp. 209-219, 1997.
- (1997) IEEE Trans. on Speech and Audio Proc , vol.5 , Issue.3 , pp. 209-219
- Cao, Y.¹ Sridharan, S.² Moody, M.³

8
- 0034848298
- Fundamental limitation of frequency domain blind source separation for convolved mixture of speech
- S. Araki, S. Makino, R. Mukai, T. Nishikawa, and H. Saruwatari, "Fundamental limitation of frequency domain blind source separation for convolved mixture of speech," in Independent Components Analysis, Intl. Conf. on, 2001.
- (2001) Independent Components Analysis, Intl. Conf. on
- Araki, S.¹ Makino, S.² Mukai, R.³ Nishikawa, T.⁴ Saruwatari, H.⁵

9
- 84892176648
- Combining time-delayed decorrelation and ICA: Towards solving the cocktail party problem
- T.-W. Lee, A. Ziehe, R. Orglmeister, and T. Sejnowski, "Combining time-delayed decorrelation and ICA: Towards solving the cocktail party problem," in Proc. ICASSP, vol. 2, pp. 1249-1253, 1998.
- (1998) Proc. ICASSP , vol.2 , pp. 1249-1253
- Lee, T.-W.¹ Ziehe, A.² Orglmeister, R.³ Sejnowski, T.⁴

10
- 0011990786
- The meeting project at ICSI
- N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, and A. Stolcke, "The meeting project at ICSI," in Proc., Human Language Technology Conf., 2001.
- (2001) Proc., Human Language Technology Conf
- Morgan, N.¹ Baron, D.² Edwards, J.³ Ellis, D.⁴ Gelbart, D.⁵ Janin, A.⁶ Pfau, T.⁷ Shriberg, E.⁸ Stolcke, A.⁹

11
- 0035280043
- A comparison of auditory and blind separation techniques for speech segregaton
- A. W. van der Kouwe, D. Wang, and G. J. Brown, "A comparison of auditory and blind separation techniques for speech segregaton," IEEE Trans. on Speech and Audio Proc., vol. 9, pp. 189-195, 2000.
- (2000) IEEE Trans. on Speech and Audio Proc , vol.9 , pp. 189-195
- van der Kouwe, A.W.¹ Wang, D.² Brown, G.J.³

12
- 0034843120
- Use of local kurtosis measure for spotting usable speech segments in cochannel speech
- K. R. Krishnamachari, R. E. Yantorno, J. M. Lovekin, D. S. Benincasa, and S. J. Wendt, "Use of local kurtosis measure for spotting usable speech segments in cochannel speech," in Proc. ICASSP, vol. 1, pp. 649-652, 2001.
- (2001) Proc. ICASSP , vol.1 , pp. 649-652
- Krishnamachari, K.R.¹ Yantorno, R.E.² Lovekin, J.M.³ Benincasa, D.S.⁴ Wendt, S.J.⁵

13
- 44849126533
- Adjacent pitch period comparison (APPC) as a usability measure of speech segments under co-channel conditions
- J. Lovekin, K. R. Krishnanmachari, and R. E. Yantorno, "Adjacent pitch period comparison (APPC) as a usability measure of speech segments under co-channel conditions," in ISPACS, 2001.
- (2001) ISPACS
- Lovekin, J.¹ Krishnanmachari, K.R.² Yantorno, R.E.³

14
- 0037856369
- Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual
- N. Chandra and R. E. Yantorno, "Usable speech detection using the modified spectral autocorrelation peak to valley ratio using the LPC residual," in Intl. Conf, Signal and Image Proc., 2002.
- (2002) Intl. Conf, Signal and Image Proc
- Chandra, N.¹ Yantorno, R.E.²

15
- 11144286121
- The spectral autocorrelation peak valley ratio (SAPVR) - a usable speech measure employed as a co-channel detection system
- R. E. Yantorno, K. R. Krishnamachari, D. S. Benincasa, J. M. Lovekin, and S. J. Wenndt, "The spectral autocorrelation peak valley ratio (SAPVR) - a usable speech measure employed as a co-channel detection system," in IEEE Intl. Workshop on Intelligent Signal Processing, 2001.
- (2001) IEEE Intl. Workshop on Intelligent Signal Processing
- Yantorno, R.E.¹ Krishnamachari, K.R.² Benincasa, D.S.³ Lovekin, J.M.⁴ Wenndt, S.J.⁵

16
- 11144232847
- Speech and crosstalk detection in multi-channel audio
- S. N. Wrigley, G. J. Brown, V. Wan, and S. Renals, "Speech and crosstalk detection in multi-channel audio," IEEE Trans., Speech and Audio Proc., vol. 13, no. 1, pp. 84-91, 2005.
- (2005) IEEE Trans., Speech and Audio Proc , vol.13 , Issue.1 , pp. 84-91
- Wrigley, S.N.¹ Brown, G.J.² Wan, V.³ Renals, S.⁴

17
- 0034818605
- Cochannel speaker count labelling based on the use of cepstral and pitch predicton derived features
- M. A. Lewis and R. P. Ramachandran, "Cochannel speaker count labelling based on the use of cepstral and pitch predicton derived features," Pattern Recongnition, vol. 34, pp. 449-507, 2001.
- (2001) Pattern Recongnition , vol.34 , pp. 449-507
- Lewis, M.A.¹ Ramachandran, R.P.²

18
- 44849113330
- Speaker overlap detection with Hough transform pitch features,
- Tech. Rep. 2004-0012, Univesity of Washington
- S. Otterson, S. Furui, and M. Ostendorf, "Speaker overlap detection with Hough transform pitch features," Tech. Rep. 2004-0012, Univesity of Washington, 2004.
- (2004)
- Otterson, S.¹ Furui, S.² Ostendorf, M.³

19
- 0003548585
- DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM.,
- Tech. Rep. NT-STIR 4930, National Institute of Standards and Technology
- J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett, and N. Dahlgren, "DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM.," Tech. Rep. NT-STIR 4930, National Institute of Standards and Technology, 1993.
- (1993)
- Garofolo, J.¹ Lamel, L.² Fisher, W.³ Fiscus, J.⁴ Pallett, D.⁵ Dahlgren, N.⁶

20
- 0033693217
- Multi-source localization in reverberant environments by ROOT-MUSIC and clustering
- E. D. D. Claudio, R. Parisi, and G. Orlandi, "Multi-source localization in reverberant environments by ROOT-MUSIC and clustering," in Proc. ICASSP, vol. 2, pp. 921-924, 2000.
- (2000) Proc. ICASSP , vol.2 , pp. 921-924
- Claudio, E.D.D.¹ Parisi, R.² Orlandi, G.³

21
- 44849118348
- The AMI speaker diarization system for NIST RT06s meeting data
- D. A. van Leeuwen and M. Huijbregts, "The AMI speaker diarization system for NIST RT06s meeting data," in NIST RT06 workshop, 2006.
- (2006) NIST RT06 workshop
- van Leeuwen, D.A.¹ Huijbregts, M.²

22
- 44949197897
- Robust speaker diarization for meetings: ICSI RT06s evaluation system
- X. Anguera, C. Wooters, and J. M. Pardo, "Robust speaker diarization for meetings: ICSI RT06s evaluation system," in Proc. ICSLP, 2006.
- (2006) Proc. ICSLP
- Anguera, X.¹ Wooters, C.² Pardo, J.M.³

23
- 56149094619
- Improved location features for meeting speaker diarization
- S. Otterson, "Improved location features for meeting speaker diarization," in Interspeech, 2007.
- (2007) Interspeech
- Otterson, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.