SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn II, Issue , 2005, Pages

Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams

(6) Xie, Lexing a Kennedy, Lyndon a Chang, Shih Fu a Divakaran, Ajay b Sun, Huifang b Lin, Ching Yung c

a Columbia University ^* (United States)

b MITSUBISHI ELECTRIC RESEARCH LABORATORIES (United States)

c IBM T J WATSON RESEARCH CENTER (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV MODEL; MODALITIES; MULTI-MODAL STREAMS;

DATABASE SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; MODAL ANALYSIS; PROBABILISTIC LOGICS; VIDEO SIGNAL PROCESSING;

PATTERN RECOGNITION;

EID: 33646820043 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2005.1415589 Document Type: Conference Paper

Times cited : (12)

References (11)

1
- 0036874999
- Dynamic bayesian networks for audio-visual speech recognition
- November
- A. Nefian, L. Liang, X. Pi, X. Liu, and K. Murphy, "Dynamic bayesian networks for audio-visual speech recognition," EURASIP, Journal of Applied Signal Processing, vol. 2002, p. 1274, November 2002.
- (2002) EURASIP, Journal of Applied Signal Processing , vol.2002 , pp. 1274
- Nefian, A.¹ Liang, L.² Pi, X.³ Liu, X.⁴ Murphy, K.⁵

2
- 11444253586
- Layered representations for learning and inferring office activity from multiple sensory channels
- (Pittsburgh, PA), October
- N. Oliver, E. Horvitz, and A. Garg, "Layered representations for learning and inferring office activity from multiple sensory channels," in Proc. Int. Conf. on Multimodal Interfaces (ICMI'02), (Pittsburgh, PA), October 2002.
- (2002) Proc. Int. Conf. on Multimodal Interfaces (ICMI'02)
- Oliver, N.¹ Horvitz, E.² Garg, A.³

3
- 1542347786
- Automatic image annotation and retrieval using cross-media relevance models
- (Toronto, Canada), July 28-Aug.-1
- J. Jeon, V. Lavrenko, and R. Manmatha, "Automatic image annotation and retrieval using cross-media relevance models," in Proc. SIGIR-03, (Toronto, Canada), pp. 119-126, July 28-Aug.-1 2003.
- (2003) Proc. SIGIR-03 , pp. 119-126
- Jeon, J.¹ Lavrenko, V.² Manmatha, R.³

4
- 85026972772
- Probabilistic latent semantic indexing
- ACM Press
- T. Hofmann, "Probabilistic latent semantic indexing," in Proc. ACM SIGIR, pp. 50-57, ACM Press, 1999.
- (1999) Proc. ACM SIGIR , pp. 50-57
- Hofmann, T.¹

5
- 84870466817
- NIST, "TREC video retrieval evaluation (TRECVID)," 2001-2004. http://www-nlpir.nist.gov/projects/trecvid/.
- (2001) TREC Video Retrieval Evaluation (TRECVID)

6
- 84858880827
- NIST, "Topic detection and tracking (TDT)," 1998-2004. http://www.nist.gov/speech/tests/tdt/.
- (1998) Topic Detection and Tracking (TDT)

7
- 14644416738
- The IBM semantic concept detection framework
- A. Amir, G. lyengar, C.-Y. Lin, C. Dorai, M. Naphade, A. Natsev, C. Neti, H. Nock, I. Sachdev, J. Smith, Y. Wu, B. Tseng, and D. Zhang, "The IBM semantic concept detection framework," in TRECVID Workshop, 2003.
- (2003) TRECVID Workshop
- Amir, A.¹ Lyengar, G.² Lin, C.-Y.³ Dorai, C.⁴ Naphade, M.⁵ Natsev, A.⁶ Neti, C.⁷ Nock, H.⁸ Sachdev, I.⁹ Smith, J.¹⁰ Wu, Y.¹¹ Tseng, B.¹² Zhang, D.¹³

8
- 8844259704
- Discovery and fusion of salient multi-modal features towards news story segmentation
- January
- W. H.-M. Hsu, S.-F. Chang, C.-W. Huang, L. Kennedy, C.-Y. Lin, and G. lyengar, "Discovery and fusion of salient multi-modal features towards news story segmentation," in SPIE Electronic Imaging, January 2004.
- (2004) SPIE Electronic Imaging
- Hsu, W.H.-M.¹ Chang, S.-F.² Huang, C.-W.³ Kennedy, L.⁴ Lin, C.-Y.⁵ Lyengar, G.⁶

9
- 79959829262
- Generation of sports highlights using a combination of supervised and unsupervised learning in audio domain
- R. Radhakrishan, Z. Xiong, A. Divakaran, and Y. Ishikawa, "Generation of sports highlights using a combination of supervised and unsupervised learning in audio domain," in Proc. Pacific Rim Conference on Multimedia, 2003.
- (2003) Proc. Pacific Rim Conference on Multimedia
- Radhakrishan, R.¹ Xiong, Z.² Divakaran, A.³ Ishikawa, Y.⁴

10
- 8844236394
- ch. 10. Kluwer Academic Publishers
- L. Xie, S.-F. Chang, A. Divakaran, and H. Sun, Unsupervised Mining of Statistical Temporal Structures in Video,ch. 10. Kluwer Academic Publishers, 2003.
- (2003) Unsupervised Mining of Statistical Temporal Structures in Video
- Xie, L.¹ Chang, S.-F.² Divakaran, A.³ Sun, H.⁴

11
- 20444449726
- Discover meaningful multimedia patterns with audio-visual concepts and associated text
- October
- L. Xie, L. Kennedy, S.-F. Chang, A. Divakaran, H. Sun, and C.-Y. Lin, "Discover meaningful multimedia patterns with audio-visual concepts and associated text," in Int. Conf. Image Processing (ICIP), October 2004.
- (2004) Int. Conf. Image Processing (ICIP)
- Xie, L.¹ Kennedy, L.² Chang, S.-F.³ Divakaran, A.⁴ Sun, H.⁵ Lin, C.-Y.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.