SCOPUS 정보 검색 플랫폼

Volumn 2, Issue , 2006, Pages 1150-1153

Audio segmentation and speaker localization in meeting videos

Author keywords

[No Author keywords available]

Indexed keywords

FEATURE EXTRACTION; IMAGE RETRIEVAL; IMAGE SEGMENTATION; MULTIMEDIA SYSTEMS; VIDEO STREAMING;

AUDIO SEGMENTATION; CAMERA PANNING; MONOLOGUE DETECTION; VIDEO SOURCES;

SPEECH RECOGNITION;

EID: 34047223614 PISSN: 10514651 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICPR.2006.283 Document Type: Conference Paper

Times cited : (29)

References (11)

1
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- S. Chen and P. Gopalakrishnan. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In DARPA speech recognition workshop, 1998.
- (1998) DARPA speech recognition workshop
- Chen, S.¹ Gopalakrishnan, P.²

2
- 85009212151
- A sequential metric-based audio segmentation method via the bayesian information criterion
- S.-S. Cheng and H.-M. Wang. A sequential metric-based audio segmentation method via the bayesian information criterion. In Proceedings of Eurospeech, pages 945-948, 2003.
- (2003) Proceedings of Eurospeech , pp. 945-948
- Cheng, S.-S.¹ Wang, H.-M.²

3
- 1842830672
- Audio-visual segmentation and "the cocktail party effect
- T. Darrell, J. Fisher, P. A. Viola, and W. T. Freeman. Audio-visual segmentation and "the cocktail party effect". In ICMI, pages 320, 2000.
- (2000) ICMI , pp. 320
- Darrell, T.¹ Fisher, J.² Viola, P.A.³ Freeman, W.T.⁴

4
- 34047200341
- J. Fiscus, N. Radde, J. Garofolo, A. Le, J. Ajot, and C. Laprun. Rich transcription 2005 spring meeting recognition evaluation. In www.nist.gov/speech/publications/papersrc/rt05sresults.pdf.
- Rich transcription 2005 spring meeting recognition evaluation
- Fiscus, J.¹ Radde, N.² Garofolo, J.³ Le, A.⁴ Ajot, J.⁵ Laprun, C.⁶

5
- 84949961905
- Probabalistic models and informative subspaces for audiovisual correspondence
- J. Fisher and T. Darrell. Probabalistic models and informative subspaces for audiovisual correspondence. In ECCV(3), pages 592-603, 2002.
- (2002) ECCV , vol.3 , pp. 592-603
- Fisher, J.¹ Darrell, T.²

6
- 0009622482
- Audio vision: Using audio-visual synchrony to locate sounds
- J. Hershey and J. R. Movellan. Audio vision: Using audio-visual synchrony to locate sounds. In NIPS, 1999.
- (1999) NIPS
- Hershey, J.¹ Movellan, J.R.²

7
- 24644451644
- Pixels that sound
- E. Kidron, Y. Schechner, and M. Elad. Pixels that sound. In CVPR, pages I: 88-95, 2005.
- (2005) CVPR, pages I , pp. 88-95
- Kidron, E.¹ Schechner, Y.² Elad, M.³

8
- 20444478554
- Speaker localisation using audio-visual synchrony: An empirical study
- H. J. Nock, G. Iyengar, and C. Neti. Speaker localisation using audio-visual synchrony: An empirical study. In CIVR, 2003.
- (2003) CIVR
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

9
- 85037085294
- Gesture cues for conversational interaction in monocular video
- F. Quek, D. McNeill, R. Ansari, X.-F. Ma, R. Bryll, S. Duncan, and K. E. McCullough. Gesture cues for conversational interaction in monocular video. In RATFG-RTS, 1999.
- (1999) RATFG-RTS
- Quek, F.¹ McNeill, D.² Ansari, R.³ Ma, X.-F.⁴ Bryll, R.⁵ Duncan, S.⁶ McCullough, K.E.⁷

10
- 0034187513
- May
- S. Sarkar and P. Soundararajan. Supervised learning of large perceptual organization: Graph spectral partitioning and learning automata. 22(5): 504-525, May 2000.
- (2000) Supervised learning of large perceptual organization: Graph spectral partitioning and learning automata , vol.22 , Issue.5 , pp. 504-525
- Sarkar, S.¹ Soundararajan, P.²

11
- 78650540904
- Improved speaker segmentation and segments clustering using the bayesian information criterion
- A. Tritschler and R. Gopinath. Improved speaker segmentation and segments clustering using the bayesian information criterion. In Proceedings of Eurospeech, 1999.
- (1999) Proceedings of Eurospeech
- Tritschler, A.¹ Gopinath, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.