-
3
-
-
0032646977
-
An overview of audio information retrieval
-
J. T. Foote, "An overview of audio information retrieval," Multimedia Syst., vol. 7, pp. 2-10, 1999.
-
(1999)
Multimedia Syst
, vol.7
, pp. 2-10
-
-
Foote, J.T.1
-
4
-
-
0037622306
-
Enhancing sonic browsing using audio information retrieval
-
presented at the, Kyoto, Japan, unpublished
-
E. Brazil, M. Fernstrom, G. Tzanetakis, and P. Cook, "Enhancing sonic browsing using audio information retrieval," presented at the Int. Conf. Auditory Display (ICAD), Kyoto, Japan, 2002, unpublished.
-
(2002)
Int. Conf. Auditory Display (ICAD)
-
-
Brazil, E.1
Fernstrom, M.2
Tzanetakis, G.3
Cook, P.4
-
5
-
-
0037491736
-
Audio Information Retrieval (AIR) tools
-
presented at the, Music Information Retrieval ISMIR, Plymouth, MA
-
G. Tzanetakis and P. Cook, "Audio Information Retrieval (AIR) tools," presented at the Int. Symp. Music Information Retrieval (ISMIR), Plymouth, MA, 2000.
-
(2000)
Int. Symp
-
-
Tzanetakis, G.1
Cook, P.2
-
6
-
-
0141867819
-
Concept framework for audio information retrieval: ARF
-
G. H. Li, D. F.Wu, and J. Zhang, "Concept framework for audio information retrieval: ARF," J. Comput. Sci. Technol., vol. 18, pp. 667-673, 2003.
-
(2003)
J. Comput. Sci. Technol
, vol.18
, pp. 667-673
-
-
Li, G.H.1
Wu, D.F.2
Zhang, J.3
-
7
-
-
34547550979
-
Soundspotter-a prototype system for content based audio retrieval
-
presented at the, Hamburg, Germany
-
C. Spevak and E. Favreau, "Soundspotter-a prototype system for content based audio retrieval," presented at the Int. Conf. Digital Audio Effects (DAFx-02), Hamburg, Germany, 2002.
-
(2002)
Int. Conf. Digital Audio Effects (DAFx-02)
-
-
Spevak, C.1
Favreau, E.2
-
9
-
-
0024899342
-
Spotting Japanese CV-syllables and phonemes using the time-delay neural networks
-
presented at the, ICASSP-, Glasgow, U.K
-
H. Sawai, A.Waibel, M. Miyatake, and K. Shikano, "Spotting Japanese CV-syllables and phonemes using the time-delay neural networks," presented at the Int. Conf. Acoust., Speech, Signal Process. (ICASSP- 89), Glasgow, U.K., 1989.
-
(1989)
Int. Conf. Acoust., Speech, Signal Process
, pp. 89
-
-
Sawai, H.1
Waibel, A.2
Miyatake, M.3
Shikano, K.4
-
10
-
-
78651256562
-
Selective phoneme spotting for realisation of an /s, z, C, t/ transpose
-
Linz, Austria: Springer, 2398
-
D. Bauer, A. Plinge, and M. Finke, "Selective phoneme spotting for realisation of an /s, z, C, t/ transpose," in Lecture Notes in Computer Science-ICHHP 2002. Linz, Austria: Springer, 2002, vol. 2398.
-
(2002)
Lecture Notes in Computer Science-ICHHP
, vol.2002
-
-
Bauer, D.1
Plinge, A.2
Finke, M.3
-
11
-
-
64149093868
-
Introducing restoration of selectivity in hearing instrument design through phoneme spotting
-
Assistive Technology: Shaping the Future, G. M. Craddock, L. P. McCormack, R. B. Reilly, and H. Knops, Eds. Amsterdam, The Netherlands: IOS Press
-
A. Plinge and D. Bauer, "Introducing restoration of selectivity in hearing instrument design through phoneme spotting," in Assistive Technology: Shaping the Future, ser. Assistive Technology Research Series, G. M. Craddock, L. P. McCormack, R. B. Reilly, and H. Knops, Eds. Amsterdam, The Netherlands: IOS Press, 2003, vol. 11.
-
(2003)
ser. Assistive Technology Research Series
, vol.11
-
-
Plinge, A.1
Bauer, D.2
-
12
-
-
33745214121
-
Laughter detection in meetings
-
presented at the, Montreal, QC, Canada
-
L. Kennedy and D. Ellis, "Laughter detection in meetings," presented at the NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, QC, Canada, 2004.
-
(2004)
NIST ICASSP 2004 Meeting Recognition Workshop
-
-
Kennedy, L.1
Ellis, D.2
-
13
-
-
50249092288
-
Prosody and parsing
-
presented at the, Cape Cod, MA
-
P. J. Price, M. Ostendorf, and C. W.Wightman, "Prosody and parsing," presented at the DARPA Workshop on Speech and Natural Language, Cape Cod, MA, 1989.
-
(1989)
DARPA Workshop on Speech and Natural Language
-
-
Price, P.J.1
Ostendorf, M.2
Wightman, C.W.3
-
14
-
-
0031624947
-
Mach1: Nonuniform time-scale modification of speech
-
presented at the
-
M. Covell, M. Withgott, and M. Slaney, "Mach1: Nonuniform time-scale modification of speech," presented at the IEEE ICASSP-98, Seattle, WA, 1998.
-
(1998)
IEEE ICASSP-98, Seattle, WA
-
-
Covell, M.1
Withgott, M.2
Slaney, M.3
-
16
-
-
64149128912
-
Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording,
-
U.S. Patent 6 161 087, Dec. 12
-
C. W.Wightman and J. Bachenko, "Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording," U.S. Patent 6 161 087, Dec. 12, 2000.
-
(2000)
-
-
Wightman, C.W.1
Bachenko, J.2
-
17
-
-
77951016396
-
Detection of clicks in audio signals using warped linear prediction
-
presented at the, Santorini, Greece
-
P. A. A. Esquef, M. Karjalainen, and V. Välimäki, "Detection of clicks in audio signals using warped linear prediction," presented at the 14th IEEE Int. Conf. Digital Signal Process. (DSP-02), Santorini, Greece, 2002.
-
(2002)
14th IEEE Int. Conf. Digital Signal Process. (DSP-02)
-
-
Esquef, P.A.A.1
Karjalainen, M.2
Välimäki, V.3
-
18
-
-
0040283968
-
Spontaneous speech effects in large vocabulary speech recognition applications
-
presented at the, New York
-
J. Butzberger, H. Murveit, E. Shriberg, and P. Price, "Spontaneous speech effects in large vocabulary speech recognition applications," presented at the Workshop on Speech and Natural Language, Harimman, New York, 1992.
-
(1992)
Workshop on Speech and Natural Language, Harimman
-
-
Butzberger, J.1
Murveit, H.2
Shriberg, E.3
Price, P.4
-
19
-
-
0028518062
-
Automatic labeling of prosodic patterns
-
Oct
-
C. W. Wightman and M. Ostendorf, "Automatic labeling of prosodic patterns," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 469-481, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.4
, pp. 469-481
-
-
Wightman, C.W.1
Ostendorf, M.2
-
20
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug
-
S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
-
(1980)
IEEE Trans. Acoust., Speech, Signal Process
, vol.ASSP-28
, Issue.4
, pp. 357-366
-
-
Davis, S.B.1
Mermelstein, P.2
-
21
-
-
0030247355
-
Robust speaker recognition-A feature-based approach
-
Sep
-
R. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition-A feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sep. 1996.
-
(1996)
IEEE Signal Process. Mag
, vol.13
, Issue.5
, pp. 58-71
-
-
Mammone, R.1
Zhang, X.2
Ramachandran, R.3
-
24
-
-
0029355999
-
Speaker identification and verification using Gaussian mixture speaker models
-
D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," Speech Commun., vol. 17, pp. 91-108, 1995.
-
(1995)
Speech Commun
, vol.17
, pp. 91-108
-
-
Reynolds, D.A.1
-
25
-
-
0002400882
-
Simplified support vector decision rules
-
presented at the, Bari, Italy
-
C. J. C. Burges, "Simplified support vector decision rules," presented at the 13th Int. Conf. Machine Learning, Bari, Italy, 1996.
-
(1996)
13th Int. Conf. Machine Learning
-
-
Burges, C.J.C.1
-
26
-
-
11244272075
-
Highlight sound effects detection in audio stream
-
presented at the, Baltimore, MD
-
R. Cai, L. Lu, H. J. Zhang, and L. H. Cai, "Highlight sound effects detection in audio stream," presented at the 4th IEEE Int. Conf. Multimedia and Expo, Baltimore, MD, 2003.
-
(2003)
4th IEEE Int. Conf. Multimedia and Expo
-
-
Cai, R.1
Lu, L.2
Zhang, H.J.3
Cai, L.H.4
-
27
-
-
0030364785
-
Automatic transcription of general audio data: Preliminary analyses
-
presented at the, Philadelphia, PA
-
M. Spina and V. Zue, "Automatic transcription of general audio data: preliminary analyses," presented at the Int. Conf. Spoken Lang. Process., Philadelphia, PA, 1996.
-
(1996)
Int. Conf. Spoken Lang. Process
-
-
Spina, M.1
Zue, V.2
-
31
-
-
0016470107
-
An algorithm for determining the endpoints of isolated utterances
-
L. R. Rabiner and M. R. Sambur, "An algorithm for determining the endpoints of isolated utterances," Bell Syst. Tech. J., vol. 54, pp. 297-315, 1975.
-
(1975)
Bell Syst. Tech. J
, vol.54
, pp. 297-315
-
-
Rabiner, L.R.1
Sambur, M.R.2
-
32
-
-
0026368470
-
Automatic recognition of prosodic phrases
-
presented at the, Toronto, ON, Canada
-
C. W. Wightman and M. Ostendorf, "Automatic recognition of prosodic phrases," presented at the IEEE Int. Conf Acoust., Speech, Signal Process., Toronto, ON, Canada, 1991.
-
(1991)
IEEE Int. Conf Acoust., Speech, Signal Process
-
-
Wightman, C.W.1
Ostendorf, M.2
|