메뉴 건너뛰기




Volumn 4, Issue 5, 2010, Pages 845-856

Online diarization of streaming audio-visual data for smart environments

Author keywords

ambient communication; diarization; middleware

Indexed keywords

AMBIENT COMMUNICATION; AUDIO-VISUAL DATA; CONTEXT INFORMATION; CONTEXTUAL INFORMATION; DIARIZATION; FACE IDENTIFICATION; I/O DEVICE; LATENCY ASPECTS; MICROPHONE ARRAYS; MULTI-MODAL; ONLINE PROCESSING; REMOTE PARTNERS; SERVICE-ORIENTED MIDDLEWARE; SMART ENVIRONMENT; SPEAKER DIARIZATION; SPEAKER LOCALIZATION; TEMPORAL SEGMENTATIONS; VIDEO DATA;

EID: 77956739906     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2050519     Document Type: Article
Times cited : (19)

References (58)
  • 2
    • 77956745776 scopus 로고    scopus 로고
    • Information Society Technologies Advisory Group Reports, [Online]. Available
    • "Information Society Technologies Advisory Group Reports," 2003 [Online]. Available: http://cordis.europa.eu/fp7/ict/istag/reports- en.html
    • (2003)
  • 3
    • 34047261805 scopus 로고    scopus 로고
    • An overview of automatic speaker diarization systems
    • Sep
    • S. Tranter and D. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.5, pp. 1557-1565, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.1    Reynolds, D.2
  • 4
    • 29044442235 scopus 로고    scopus 로고
    • Step-by-step and integrated approaches in broadcast news speaker diarization
    • Jul.
    • S. Meignier, D. Moraru, C. Fredouille, J. Bonastre, and L. Besacier, "Step-by-step and integrated approaches in broadcast news speaker diarization," Comput. Speech Lang., vol.20, no.2-3, pp. 303-330, Jul. 2006.
    • (2006) Comput. Speech Lang. , vol.20 , Issue.2-3 , pp. 303-330
    • Meignier, S.1    Moraru, D.2    Fredouille, C.3    Bonastre, J.4    Besacier, L.5
  • 12
    • 34548310397 scopus 로고    scopus 로고
    • Speaker diarization for multiple-distant-microphone meetings using several sources of information
    • Sep
    • J. Pardo, X. Anguera, and C. Wooters, "Speaker diarization for multiple-distant-microphone meetings using several sources of information," IEEE Trans. Comput., vol.56, no.9, pp. 1212-1224, Sep. 2007.
    • (2007) IEEE Trans. Comput. , vol.56 , Issue.9 , pp. 1212-1224
    • Pardo, J.1    Anguera, X.2    Wooters, C.3
  • 15
  • 16
    • 50449092763 scopus 로고    scopus 로고
    • Jan. [Online]. Available:
    • "Augmented multi-party interaction," AMI Consortium Jan. 2004 [Online]. Available: http://www.amiproject.org/
    • (2004) Augmented Multi-party Interaction
  • 17
    • 47749089085 scopus 로고    scopus 로고
    • Acoustic event detection and classification in smart-room environments: Evaluation of CHIL project systems
    • Nov
    • A. Temko, R. Malkin, C. Ziegler, D. Macho, C. Nadeu, and M. Omologo, "Acoustic event detection and classification in smart-room environments: Evaluation of CHIL project systems," J. Tecnologia del Habla, vol.4, pp. 1-6, Nov. 2006.
    • (2006) J. Tecnologia Del Habla , vol.4 , pp. 1-6
    • Temko, A.1    Malkin, R.2    Ziegler, C.3    MacHo, D.4    Nadeu, C.5    Omologo, M.6
  • 18
  • 21
    • 4043181855 scopus 로고    scopus 로고
    • Robust joint audio-video localization in video conferencing using reliability information
    • Aug
    • D. Lo, R. A. Goubran, R. M. Dansereau, G. Thompson, and D. Schulz, "Robust joint audio-video localization in video conferencing using reliability information," IEEE Trans. Instrum. Meas., vol.53, no.4, pp. 1132-1139, Aug. 2004.
    • (2004) IEEE Trans. Instrum. Meas. , vol.53 , Issue.4 , pp. 1132-1139
    • Lo, D.1    Goubran, R.A.2    Dansereau, R.M.3    Thompson, G.4    Schulz, D.5
  • 25
    • 59349097768 scopus 로고    scopus 로고
    • SOA what?
    • Mar
    • M. J. Carey, "SOA what?," IEEE Trans. Comput. , vol.41, no.3, pp. 92-94, Mar. 2008.
    • (2008) IEEE Trans. Comput. , vol.41 , Issue.3 , pp. 92-94
    • Carey, M.J.1
  • 28
    • 0011100220 scopus 로고    scopus 로고
    • Use of voicing and pitch information for speaker recognition
    • Canberra, Australia, Dec
    • B. Wildermoth and K. Paliwal, "Use of voicing and pitch information for speaker recognition," in Proc. IEEE Conf. Speech Sci. Technol. (SST'00), Canberra, Australia, Dec. 2000, pp. 324-328.
    • (2000) Proc. IEEE Conf. Speech Sci. Technol. (SST'00) , pp. 324-328
    • Wildermoth, B.1    Paliwal, K.2
  • 29
    • 0034273195 scopus 로고    scopus 로고
    • Distbic: A speaker-based segmentation for audio data indexing
    • Sep
    • P. Delacourt and C. J. Wellekens, "Distbic: A speaker-based segmentation for audio data indexing," Speech Commun., vol.32, no.1-2, pp. 111-126, Sep. 2000.
    • (2000) Speech Commun , vol.32 , Issue.1-2 , pp. 111-126
    • Delacourt, P.1    Wellekens, C.J.2
  • 30
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • Apr
    • J. Ramirez and J. Segura, "Efficient voice activity detection algorithms using long-term speech information," Speech Commun., vol.42, no.3-4, pp. 271-287, Apr. 2004.
    • (2004) Speech Commun , vol.42 , Issue.3-4 , pp. 271-287
    • Ramirez, J.1    Segura, J.2
  • 31
    • 0038494980 scopus 로고    scopus 로고
    • Winscale: An image-scaling algorithm using an area pixel model
    • Jun
    • C. Kim, S. Seong, J. Lee, and L. Kim, "Winscale: An image-scaling algorithm using an area pixel model," IEEE Trans. Circuits Syst. Video Technol., vol.13, no.6, pp. 549-553, Jun. 2003.
    • (2003) IEEE Trans. Circuits Syst. Video Technol. , vol.13 , Issue.6 , pp. 549-553
    • Kim, C.1    Seong, S.2    Lee, J.3    Kim, L.4
  • 34
    • 0031185845 scopus 로고    scopus 로고
    • Eigenfaces vs. fisherfaces: Recognition using class specific linear projection
    • Jul.
    • P. Belhumeur, J. Hespanha, and D. Kriegman, "Eigenfaces vs. fisherfaces: Recognition using class specific linear projection," IEEE Trans. Pattern Anal. Mach. Intell., vol.19, no.7, pp. 711-720, Jul. 1997.
    • (1997) IEEE Trans. Pattern Anal. Mach. Intell. , vol.19 , Issue.7 , pp. 711-720
    • Belhumeur, P.1    Hespanha, J.2    Kriegman, D.3
  • 35
    • 1842705510 scopus 로고    scopus 로고
    • Yale University. New Haven, CT, Apr, [Online]. Available
    • "The Yale Face Database," Yale University. New Haven, CT, Apr. 2009 [Online]. Available: http://cvc.yale.edu/projects/yalefaces/yalefaces. html
    • (2009) The Yale Face Database
  • 36
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • Aug
    • C. Knapp and G. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-24, no.4, pp. 320-327, Aug. 1976.
    • (1976) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-24 , Issue.4 , pp. 320-327
    • Knapp, C.1    Carter, G.2
  • 38
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • Apr
    • J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, no.4, pp. 943-950, Apr. 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
    • Allen, J.B.1    Berkley, D.A.2
  • 39
    • 38949122754 scopus 로고    scopus 로고
    • Speaker segmentation and clustering
    • May
    • M. Kotti, V. Moschou, and C. Kotropoulos, "Speaker segmentation and clustering," Signal Process., vol.88, no.5, pp. 1091-1124, May 2007.
    • (2007) Signal Process , vol.88 , Issue.5 , pp. 1091-1124
    • Kotti, M.1    Moschou, V.2    Kotropoulos, C.3
  • 41
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • Jan
    • D. Reynolds, T. Quatieri, and R. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Process., vol.10, no.1-3, pp. 19-41, Jan. 2000.
    • (2000) Digital Signal Process , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.3
  • 46
    • 63349092936 scopus 로고    scopus 로고
    • Jan, [Online]. Available
    • D. Beckett et al., "Resource description framework, "WorldWideWeb Consortium, Jan. 2008 [Online]. Available: http://www.w3.org/RDF/
    • (2008) Resource Description Framework
    • Beckett, D.1
  • 47
    • 0004135793 scopus 로고    scopus 로고
    • Web Services. [Online]. Available
    • WWW. (2002)Web Services.WorldWideWeb Consortium. [Online]. Available: http://www.w3.org/2002/ws/
    • (2002) World Wide Web Consortium
  • 48
    • 33845938313 scopus 로고    scopus 로고
    • Efficient semantic service discovery in pervasive computing environments
    • S. Mokhtar, A. Kaul, N. Georgantas, and V. Issarny, "Efficient semantic service discovery in pervasive computing environments," Lecture Notes in Computer Science, no.4290, pp. 240-259, 2007.
    • (2007) Lecture Notes in Computer Science , vol.4290 , pp. 240-259
    • Mokhtar, S.1    Kaul, A.2    Georgantas, N.3    Issarny, V.4
  • 49
    • 56649104764 scopus 로고    scopus 로고
    • Jun, [Online]. Available:
    • R. Chinnici et al., "Web services description language," World Wide Web Consortium. Jun. 2007 [Online]. Available: http://www.w3. org/TR/wsdl20/
    • (2007) Web Services Description Language
    • Chinnici, R.1
  • 50
    • 77956730254 scopus 로고    scopus 로고
    • Web ontology language for web services
    • [Online]. Available
    • DAML. (2006) Web Ontology Language for Web Services. DARPA Agent Markup Language. [Online]. Available: http://www.daml.org/ services/owl-s/
    • (2006) DARPA Agent Markup Language
  • 53
    • 70449409320 scopus 로고    scopus 로고
    • Jan. [Online]. Available:
    • Open Service Gateway Initiative, Jan. 2008 [Online]. Available: http:// www.osgi.org
    • (2008) Open Service Gateway Initiative
  • 54
    • 77956762083 scopus 로고    scopus 로고
    • [Online]. Available
    • F. Ramparany et al., "Amigo software repository," Amigo Consortium, 2009 [Online]. Available: http://amigo.gforge.inria.fr
    • (2009) Amigo Software Repository
    • Ramparany, F.1
  • 56
    • 77956771380 scopus 로고    scopus 로고
    • [Online]. Available
    • Audio Codec, 2008 [Online]. Available: http://www.speex.org
    • (2008) Audio Codec
  • 58
    • 77956753880 scopus 로고    scopus 로고
    • [Online]. Available
    • "Theora Codec," 2008 [Online]. Available: http://www.Theora.org
    • (2008)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.