-
1
-
-
64949186933
-
-
AMI project, Online, Available
-
"AMI project," 2005 [Online]. Available: http://www.ami.org.
-
(2005)
-
-
-
2
-
-
64949203576
-
-
IM2 project, Online, Available
-
"IM2 project," 2005 [Online], Available: http://www.im2.ch.
-
(2005)
-
-
-
3
-
-
64949173906
-
-
M4 project, Online, Available
-
"M4 project," 2005 [Online], Available: http://www.m4project. org.
-
(2005)
-
-
-
4
-
-
33745198227
-
Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
-
B. Schuller, R. Miiller, M. Lang, and G. Rigoll, "Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles," in Proc. INTERSPEECH'05, Special Session: Emotional Speech Analysis and Synthesis: Towards a Multimodal Approach., 2005, pp. 805-809.
-
(2005)
Proc. INTERSPEECH'05, Special Session: Emotional Speech Analysis and Synthesis: Towards a Multimodal Approach
, pp. 805-809
-
-
Schuller, B.1
Miiller, R.2
Lang, M.3
Rigoll, G.4
-
5
-
-
85009168880
-
-
B. Wrede and E. Shriberg. Spotting hot spots in meetings: Human judgements and prosodic cues, in Proc. Eur: Conf. Speech Commun. Technol., 2003, pp. 2805-2808.
-
B. Wrede and E. Shriberg. "Spotting "hot spots" in meetings: Human judgements and prosodic cues," in Proc. Eur: Conf. Speech Commun. Technol., 2003, pp. 2805-2808.
-
-
-
-
6
-
-
24144451280
-
-
S. Tucker and S. Whittaker, Accessing Multimodal Meeting Data: Systems, Problems and Possibilities, in Lecture Notes In Computer Science, B. S. and H. Bourlard, Eds. New York: Springer, 2005, 3361. pp. 1-11.
-
S. Tucker and S. Whittaker, "Accessing Multimodal Meeting Data: Systems, Problems and Possibilities," in Lecture Notes In Computer Science, B. S. and H. Bourlard, Eds. New York: Springer, 2005, vol. 3361. pp. 1-11.
-
-
-
-
7
-
-
33646786066
-
Novel techniques for time-compressing speech: An exploratory study
-
S. Tucker and S. Whittaker, "Novel techniques for time-compressing speech: An exploratory study," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2005. pp. 477-480.
-
(2005)
Proc. Int. Conf. Acoust., Speech, Signal Process
, pp. 477-480
-
-
Tucker, S.1
Whittaker, S.2
-
8
-
-
0000249046
-
Techniques, perception, and applications of time-compressed speech
-
Voice I/O Soc, Sep
-
B. Arons, "Techniques, perception, and applications of time-compressed speech," in Proc. Conf., Amer. Voice I/O Soc, Sep. 1992, pp. 169-177.
-
(1992)
Proc. Conf., Amer
, pp. 169-177
-
-
Arons, B.1
-
9
-
-
0041911174
-
A new approach to automatic speech summarization
-
Sep
-
C. Hori and S. Furui, "A new approach to automatic speech summarization," IEEE Trans. Multimedia, vol. 5, no. 3, pp. 368-378, Sep. 2003.
-
(2003)
IEEE Trans. Multimedia
, vol.5
, Issue.3
, pp. 368-378
-
-
Hori, C.1
Furui, S.2
-
10
-
-
33646786445
-
From text to speech summarization, in Pmc
-
K. McKeown, J. Hirschberg, M. Galley, and S. Maskey, "From text to speech summarization," in Pmc. ICASSP, 2005, pp. V-997-V-1000.
-
(2005)
ICASSP
-
-
McKeown, K.1
Hirschberg, J.2
Galley, M.3
Maskey, S.4
-
11
-
-
79951784751
-
Automatic summarisation of broadcast news using structural features
-
S. Maskey and J. Hirschberg, "Automatic summarisation of broadcast news using structural features," in Proc. Euwspeech, 2003, pp. 1173-1176.
-
(2003)
Proc. Euwspeech
, pp. 1173-1176
-
-
Maskey, S.1
Hirschberg, J.2
-
12
-
-
0019582545
-
Time-scale modification of speech based on short-time fourier analysis
-
Jun
-
M. Portnoff, "Time-scale modification of speech based on short-time fourier analysis," IEEE Trans. Acoust., Speech, Signal Process., vol. 29, no. 3, pp. 374-390, Jun. 1981.
-
(1981)
IEEE Trans. Acoust., Speech, Signal Process
, vol.29
, Issue.3
, pp. 374-390
-
-
Portnoff, M.1
-
13
-
-
0009608547
-
Real-time time-scale modification of speech via the synchronized overlap-add algorithm,
-
M.S. thesis. Mass. Inst. Technol. Cambridge
-
D. Hejna, "Real-time time-scale modification of speech via the synchronized overlap-add algorithm," M.S. thesis. Mass. Inst. Technol. Cambridge, 1990.
-
(1990)
-
-
Hejna, D.1
-
14
-
-
0348225754
-
Time and frequency altered speech
-
Phonetics, N. Lass, Ed. New York: Academic
-
D. Beasley and J. Maki, "Time and frequency altered speech," in Contemporary Issues In Experimental Phonetics, N. Lass, Ed. New York: Academic, 1976, pp. 419-458.,
-
(1976)
Contemporary Issues In Experimental
, pp. 419-458
-
-
Beasley, D.1
Maki, J.2
-
15
-
-
70350293916
-
-
T. Sticht, Comprehension of repeated time-compression recordings, J. Experimental Education, 37, no. 4, pp. 60-62, 1969.
-
T. Sticht, "Comprehension of repeated time-compression recordings," J. Experimental Education, vol. 37, no. 4, pp. 60-62, 1969.
-
-
-
-
16
-
-
0013249653
-
An intelligent media browser using automatic multimodal analysis
-
Sep
-
J. Foote, G. Boreczky, and L. Wilcox, "An intelligent media browser using automatic multimodal analysis," ACM Multimedia, pp. 375-380, Sep. 1998.
-
(1998)
ACM Multimedia
, pp. 375-380
-
-
Foote, J.1
Boreczky, G.2
Wilcox, L.3
-
17
-
-
0031624947
-
Mach 1 for nonuniform time - scale modification of speech
-
M. Covell, M. Withgott, and M. Slaney, "Mach 1 for nonuniform time - scale modification of speech," in Proc. ICASSP, 1998, pp. 349-352.
-
(1998)
Proc. ICASSP
, pp. 349-352
-
-
Covell, M.1
Withgott, M.2
Slaney, M.3
-
19
-
-
0022671781
-
Referring as a collaborative process
-
H. Clark and D. Wilkes-Gibbs, "Referring as a collaborative process," Cognition, vol. 22, pp. 1-39, 1986.
-
(1986)
Cognition
, vol.22
, pp. 1-39
-
-
Clark, H.1
Wilkes-Gibbs, D.2
-
20
-
-
0017626208
-
Application of an LPC distance measure to the voiced-unvoiced-silence detection problem
-
Aug
-
L. Rabiner andM. Sambur, "Application of an LPC distance measure to the voiced-unvoiced-silence detection problem," IEEE Trans. Acoust., Speech, Signal Process., vol. 25, no. 4, pp. 338-343, Aug. 1977, 8.
-
(1977)
IEEE Trans. Acoust., Speech, Signal Process
, vol.25
, Issue.4
-
-
Rabiner andM, L.1
Sambur2
-
21
-
-
84953744816
-
A statistical interpretation of term specificity and it's application in retrieval
-
K. Sparck Jones, "A statistical interpretation of term specificity and it's application in retrieval," J. Documentation, vol. 28, pp. 11-21, 1972.
-
(1972)
J. Documentation
, vol.28
, pp. 11-21
-
-
Sparck Jones, K.1
-
22
-
-
33644661135
-
A glimpsing model of speech perception in noise
-
M. Cooke, "A glimpsing model of speech perception in noise," J. Acoust. Soc. Amen, vol. 119, pp. 1562-1573, 2006.
-
(2006)
J. Acoust. Soc. Amen
, vol.119
, pp. 1562-1573
-
-
Cooke, M.1
-
23
-
-
85143190393
-
-
I. McCowan, S. Bengio, D. Gatica-Perez, G. Lathoud, M. F., D. Moore, P. Wellner, and H. Bourlard, Modeling human interaction in meetings, in Proc. ICASSP, Apr. 2003, pp. IV-748-IV-751.
-
I. McCowan, S. Bengio, D. Gatica-Perez, G. Lathoud, M. F., D. Moore, P. Wellner, and H. Bourlard, "Modeling human interaction in meetings," in Proc. ICASSP, Apr. 2003, pp. IV-748-IV-751.
-
-
-
-
24
-
-
64949116393
-
IDIAP, Tech
-
The IDIAP smart meeting room, Rep. ID1AP, COM02-07
-
D. Moore, The IDIAP smart meeting room," IDIAP, Tech. Rep. ID1AP - COM02-07, 2002.
-
(2002)
-
-
Moore, D.1
-
25
-
-
0022995270
-
Word integlligibility decrements and die comprehension of time-compressed speech
-
G. Heiman, R. Leio, H. Leighbody, and B. K., "Word integlligibility decrements and die comprehension of time-compressed speech," Percept. Psychophys., vol. 40, no. 6, pp. 407-411, 1986.
-
(1986)
Percept. Psychophys
, vol.40
, Issue.6
, pp. 407-411
-
-
Heiman, G.1
Leio, R.2
Leighbody, H.3
-
26
-
-
4544349215
-
Improving speech playback using time-compression and speech recognition
-
Apr
-
S. Vemuri, P. DeCamp, W. Bender, and C. Schmandt, "Improving speech playback using time-compression and speech recognition," in Proc. CHI'04, Apr. 2004, pp. 295-302.
-
(2004)
Proc. CHI'04
, pp. 295-302
-
-
Vemuri, S.1
DeCamp, P.2
Bender, W.3
Schmandt, C.4
-
27
-
-
0037480836
-
-
S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isen - hour, L. Stead, G. Zamchick, and A. Rosenberg, SCANmail: A voice - mail interface that makes speech browsable, readable and searchable, in Proc. CHI,02, Apr. 2002, pp. 275-282.
-
S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isen - hour, L. Stead, G. Zamchick, and A. Rosenberg, "SCANmail: A voice - mail interface that makes speech browsable, readable and searchable," in Proc. CHI,02, Apr. 2002, pp. 275-282.
-
-
-
-
28
-
-
85142395273
-
Play it again: A study of the factors underlying speech browsing behavior
-
S. Whittaker, J. Hirschberg, and C. Nakatani, "Play it again: A study of the factors underlying speech browsing behavior," in Proc. CHI'98, 1998, pp. 247-248.
-
(1998)
Proc. CHI'98
, pp. 247-248
-
-
Whittaker, S.1
Hirschberg, J.2
Nakatani, C.3
-
29
-
-
46749122831
-
Time is of the essence: An evaluation of temporal compression algorithms
-
S. Tucker and S. Whittaker, "Time is of the essence: An evaluation of temporal compression algorithms," in Proc. CHI'06, 2006, pp. 71-80.
-
(2006)
Proc. CHI'06
, pp. 71-80
-
-
Tucker, S.1
Whittaker, S.2
-
30
-
-
4544246861
-
Semantic speech editing
-
S. Whittaker and B. Amento, "Semantic speech editing," in Proc. CHI'04, 2004, pp. 527-534.
-
(2004)
Proc. CHI'04
, pp. 527-534
-
-
Whittaker, S.1
Amento, B.2
|