메뉴 건너뛰기




Volumn 16, Issue 4, 2008, Pages 790-796

Temporal compression of speech: An evaluation

Author keywords

Information retrieval; Speech processing; Text processing; User interfaces

Indexed keywords

ACOUSTIC FEATURES; HYBRID ALGORITHMS; OVER-SPEED; POINTS OF INTERESTS; SEMANTIC FEATURES; SPEECH RATES; SPEECH RECORDINGS; SPEED-UP; TEMPORAL COMPRESSIONS;

EID: 57849129912     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.916527     Document Type: Article
Times cited : (13)

References (30)
  • 1
    • 64949186933 scopus 로고    scopus 로고
    • AMI project, Online, Available
    • "AMI project," 2005 [Online]. Available: http://www.ami.org.
    • (2005)
  • 2
    • 64949203576 scopus 로고    scopus 로고
    • IM2 project, Online, Available
    • "IM2 project," 2005 [Online], Available: http://www.im2.ch.
    • (2005)
  • 3
    • 64949173906 scopus 로고    scopus 로고
    • M4 project, Online, Available
    • "M4 project," 2005 [Online], Available: http://www.m4project. org.
    • (2005)
  • 5
    • 85009168880 scopus 로고    scopus 로고
    • B. Wrede and E. Shriberg. Spotting hot spots in meetings: Human judgements and prosodic cues, in Proc. Eur: Conf. Speech Commun. Technol., 2003, pp. 2805-2808.
    • B. Wrede and E. Shriberg. "Spotting "hot spots" in meetings: Human judgements and prosodic cues," in Proc. Eur: Conf. Speech Commun. Technol., 2003, pp. 2805-2808.
  • 6
    • 24144451280 scopus 로고    scopus 로고
    • S. Tucker and S. Whittaker, Accessing Multimodal Meeting Data: Systems, Problems and Possibilities, in Lecture Notes In Computer Science, B. S. and H. Bourlard, Eds. New York: Springer, 2005, 3361. pp. 1-11.
    • S. Tucker and S. Whittaker, "Accessing Multimodal Meeting Data: Systems, Problems and Possibilities," in Lecture Notes In Computer Science, B. S. and H. Bourlard, Eds. New York: Springer, 2005, vol. 3361. pp. 1-11.
  • 8
    • 0000249046 scopus 로고
    • Techniques, perception, and applications of time-compressed speech
    • Voice I/O Soc, Sep
    • B. Arons, "Techniques, perception, and applications of time-compressed speech," in Proc. Conf., Amer. Voice I/O Soc, Sep. 1992, pp. 169-177.
    • (1992) Proc. Conf., Amer , pp. 169-177
    • Arons, B.1
  • 9
    • 0041911174 scopus 로고    scopus 로고
    • A new approach to automatic speech summarization
    • Sep
    • C. Hori and S. Furui, "A new approach to automatic speech summarization," IEEE Trans. Multimedia, vol. 5, no. 3, pp. 368-378, Sep. 2003.
    • (2003) IEEE Trans. Multimedia , vol.5 , Issue.3 , pp. 368-378
    • Hori, C.1    Furui, S.2
  • 11
    • 79951784751 scopus 로고    scopus 로고
    • Automatic summarisation of broadcast news using structural features
    • S. Maskey and J. Hirschberg, "Automatic summarisation of broadcast news using structural features," in Proc. Euwspeech, 2003, pp. 1173-1176.
    • (2003) Proc. Euwspeech , pp. 1173-1176
    • Maskey, S.1    Hirschberg, J.2
  • 12
    • 0019582545 scopus 로고
    • Time-scale modification of speech based on short-time fourier analysis
    • Jun
    • M. Portnoff, "Time-scale modification of speech based on short-time fourier analysis," IEEE Trans. Acoust., Speech, Signal Process., vol. 29, no. 3, pp. 374-390, Jun. 1981.
    • (1981) IEEE Trans. Acoust., Speech, Signal Process , vol.29 , Issue.3 , pp. 374-390
    • Portnoff, M.1
  • 13
    • 0009608547 scopus 로고
    • Real-time time-scale modification of speech via the synchronized overlap-add algorithm,
    • M.S. thesis. Mass. Inst. Technol. Cambridge
    • D. Hejna, "Real-time time-scale modification of speech via the synchronized overlap-add algorithm," M.S. thesis. Mass. Inst. Technol. Cambridge, 1990.
    • (1990)
    • Hejna, D.1
  • 14
    • 0348225754 scopus 로고
    • Time and frequency altered speech
    • Phonetics, N. Lass, Ed. New York: Academic
    • D. Beasley and J. Maki, "Time and frequency altered speech," in Contemporary Issues In Experimental Phonetics, N. Lass, Ed. New York: Academic, 1976, pp. 419-458.,
    • (1976) Contemporary Issues In Experimental , pp. 419-458
    • Beasley, D.1    Maki, J.2
  • 15
    • 70350293916 scopus 로고    scopus 로고
    • T. Sticht, Comprehension of repeated time-compression recordings, J. Experimental Education, 37, no. 4, pp. 60-62, 1969.
    • T. Sticht, "Comprehension of repeated time-compression recordings," J. Experimental Education, vol. 37, no. 4, pp. 60-62, 1969.
  • 16
    • 0013249653 scopus 로고    scopus 로고
    • An intelligent media browser using automatic multimodal analysis
    • Sep
    • J. Foote, G. Boreczky, and L. Wilcox, "An intelligent media browser using automatic multimodal analysis," ACM Multimedia, pp. 375-380, Sep. 1998.
    • (1998) ACM Multimedia , pp. 375-380
    • Foote, J.1    Boreczky, G.2    Wilcox, L.3
  • 17
    • 0031624947 scopus 로고    scopus 로고
    • Mach 1 for nonuniform time - scale modification of speech
    • M. Covell, M. Withgott, and M. Slaney, "Mach 1 for nonuniform time - scale modification of speech," in Proc. ICASSP, 1998, pp. 349-352.
    • (1998) Proc. ICASSP , pp. 349-352
    • Covell, M.1    Withgott, M.2    Slaney, M.3
  • 19
    • 0022671781 scopus 로고
    • Referring as a collaborative process
    • H. Clark and D. Wilkes-Gibbs, "Referring as a collaborative process," Cognition, vol. 22, pp. 1-39, 1986.
    • (1986) Cognition , vol.22 , pp. 1-39
    • Clark, H.1    Wilkes-Gibbs, D.2
  • 20
    • 0017626208 scopus 로고
    • Application of an LPC distance measure to the voiced-unvoiced-silence detection problem
    • Aug
    • L. Rabiner andM. Sambur, "Application of an LPC distance measure to the voiced-unvoiced-silence detection problem," IEEE Trans. Acoust., Speech, Signal Process., vol. 25, no. 4, pp. 338-343, Aug. 1977, 8.
    • (1977) IEEE Trans. Acoust., Speech, Signal Process , vol.25 , Issue.4
    • Rabiner andM, L.1    Sambur2
  • 21
    • 84953744816 scopus 로고
    • A statistical interpretation of term specificity and it's application in retrieval
    • K. Sparck Jones, "A statistical interpretation of term specificity and it's application in retrieval," J. Documentation, vol. 28, pp. 11-21, 1972.
    • (1972) J. Documentation , vol.28 , pp. 11-21
    • Sparck Jones, K.1
  • 22
    • 33644661135 scopus 로고    scopus 로고
    • A glimpsing model of speech perception in noise
    • M. Cooke, "A glimpsing model of speech perception in noise," J. Acoust. Soc. Amen, vol. 119, pp. 1562-1573, 2006.
    • (2006) J. Acoust. Soc. Amen , vol.119 , pp. 1562-1573
    • Cooke, M.1
  • 23
    • 85143190393 scopus 로고    scopus 로고
    • I. McCowan, S. Bengio, D. Gatica-Perez, G. Lathoud, M. F., D. Moore, P. Wellner, and H. Bourlard, Modeling human interaction in meetings, in Proc. ICASSP, Apr. 2003, pp. IV-748-IV-751.
    • I. McCowan, S. Bengio, D. Gatica-Perez, G. Lathoud, M. F., D. Moore, P. Wellner, and H. Bourlard, "Modeling human interaction in meetings," in Proc. ICASSP, Apr. 2003, pp. IV-748-IV-751.
  • 24
    • 64949116393 scopus 로고    scopus 로고
    • IDIAP, Tech
    • The IDIAP smart meeting room, Rep. ID1AP, COM02-07
    • D. Moore, The IDIAP smart meeting room," IDIAP, Tech. Rep. ID1AP - COM02-07, 2002.
    • (2002)
    • Moore, D.1
  • 25
    • 0022995270 scopus 로고
    • Word integlligibility decrements and die comprehension of time-compressed speech
    • G. Heiman, R. Leio, H. Leighbody, and B. K., "Word integlligibility decrements and die comprehension of time-compressed speech," Percept. Psychophys., vol. 40, no. 6, pp. 407-411, 1986.
    • (1986) Percept. Psychophys , vol.40 , Issue.6 , pp. 407-411
    • Heiman, G.1    Leio, R.2    Leighbody, H.3
  • 26
    • 4544349215 scopus 로고    scopus 로고
    • Improving speech playback using time-compression and speech recognition
    • Apr
    • S. Vemuri, P. DeCamp, W. Bender, and C. Schmandt, "Improving speech playback using time-compression and speech recognition," in Proc. CHI'04, Apr. 2004, pp. 295-302.
    • (2004) Proc. CHI'04 , pp. 295-302
    • Vemuri, S.1    DeCamp, P.2    Bender, W.3    Schmandt, C.4
  • 27
    • 0037480836 scopus 로고    scopus 로고
    • S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isen - hour, L. Stead, G. Zamchick, and A. Rosenberg, SCANmail: A voice - mail interface that makes speech browsable, readable and searchable, in Proc. CHI,02, Apr. 2002, pp. 275-282.
    • S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isen - hour, L. Stead, G. Zamchick, and A. Rosenberg, "SCANmail: A voice - mail interface that makes speech browsable, readable and searchable," in Proc. CHI,02, Apr. 2002, pp. 275-282.
  • 28
    • 85142395273 scopus 로고    scopus 로고
    • Play it again: A study of the factors underlying speech browsing behavior
    • S. Whittaker, J. Hirschberg, and C. Nakatani, "Play it again: A study of the factors underlying speech browsing behavior," in Proc. CHI'98, 1998, pp. 247-248.
    • (1998) Proc. CHI'98 , pp. 247-248
    • Whittaker, S.1    Hirschberg, J.2    Nakatani, C.3
  • 29
    • 46749122831 scopus 로고    scopus 로고
    • Time is of the essence: An evaluation of temporal compression algorithms
    • S. Tucker and S. Whittaker, "Time is of the essence: An evaluation of temporal compression algorithms," in Proc. CHI'06, 2006, pp. 71-80.
    • (2006) Proc. CHI'06 , pp. 71-80
    • Tucker, S.1    Whittaker, S.2
  • 30
    • 4544246861 scopus 로고    scopus 로고
    • Semantic speech editing
    • S. Whittaker and B. Amento, "Semantic speech editing," in Proc. CHI'04, 2004, pp. 527-534.
    • (2004) Proc. CHI'04 , pp. 527-534
    • Whittaker, S.1    Amento, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.