SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 3, 2007, Pages 838-850

An effective algorithm for automatic detection and exact demarcation of breath sounds in speech and song signals

(2) Ruinskiy, Dima a,b Lavner, Yizhar a

a TEL HAI COLLEGE (Israel)

b WEIZMANN INSTITUTE OF SCIENCE (Israel)

Author keywords

Breath detection; Event spotting in speech and audio; Mel frequency cepstral coefficient (MFCC)

Indexed keywords

AESTHETIC QUALITIES; AUDIO SIGNALS; AUTOMATIC ALGORITHMS; AUTOMATIC DETECTIONS; BREATH DETECTION; BREATH SOUNDS; EDGE DETECTION ALGORITHMS; EFFECTIVE ALGORITHMS; EVENT SPOTTING IN SPEECH AND AUDIO; FALSE DETECTIONS; FREQUENCY-DOMAIN PARAMETERS; IDENTIFICATION RATES; MEL FREQUENCY CEPSTRAL COEFFICIENT (MFCC); MUSIC INDUSTRIES; SINGULAR VALUES; THREE PHASIS; TIME DOMAINS; TIME FRAMES;

ALGORITHMS; EDGE DETECTION; SIGNAL PROCESSING; SINGULAR VALUE DECOMPOSITION; SPEECH RECOGNITION; SPEECH TRANSMISSION; TEMPLATE MATCHING;

AUDIO ACOUSTICS;

EID: 64149099357 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.889750 Document Type: Article

Times cited : (64)

References (32)

1
- 0003786003
- Cambridge, MA: MIT Press
- F. Jelinek, Statistical Methods for Speech Recognition. Cambridge, MA: MIT Press, 1998.
- (1998) Statistical Methods for Speech Recognition
- Jelinek, F.¹

2
- 0004244302
- Englewood Cliffs, NJ: Prentice-Hall
- L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.H.²

3
- 0032646977
- An overview of audio information retrieval
- J. T. Foote, "An overview of audio information retrieval," Multimedia Syst., vol. 7, pp. 2-10, 1999.
- (1999) Multimedia Syst , vol.7 , pp. 2-10
- Foote, J.T.¹

4
- 0037622306
- Enhancing sonic browsing using audio information retrieval
- presented at the, Kyoto, Japan, unpublished
- E. Brazil, M. Fernstrom, G. Tzanetakis, and P. Cook, "Enhancing sonic browsing using audio information retrieval," presented at the Int. Conf. Auditory Display (ICAD), Kyoto, Japan, 2002, unpublished.
- (2002) Int. Conf. Auditory Display (ICAD)
- Brazil, E.¹ Fernstrom, M.² Tzanetakis, G.³ Cook, P.⁴

5
- 0037491736
- Audio Information Retrieval (AIR) tools
- presented at the, Music Information Retrieval ISMIR, Plymouth, MA
- G. Tzanetakis and P. Cook, "Audio Information Retrieval (AIR) tools," presented at the Int. Symp. Music Information Retrieval (ISMIR), Plymouth, MA, 2000.
- (2000) Int. Symp
- Tzanetakis, G.¹ Cook, P.²

6
- 0141867819
- Concept framework for audio information retrieval: ARF
- G. H. Li, D. F.Wu, and J. Zhang, "Concept framework for audio information retrieval: ARF," J. Comput. Sci. Technol., vol. 18, pp. 667-673, 2003.
- (2003) J. Comput. Sci. Technol , vol.18 , pp. 667-673
- Li, G.H.¹ Wu, D.F.² Zhang, J.³

7
- 34547550979
- Soundspotter-a prototype system for content based audio retrieval
- presented at the, Hamburg, Germany
- C. Spevak and E. Favreau, "Soundspotter-a prototype system for content based audio retrieval," presented at the Int. Conf. Digital Audio Effects (DAFx-02), Hamburg, Germany, 2002.
- (2002) Int. Conf. Digital Audio Effects (DAFx-02)
- Spevak, C.¹ Favreau, E.²

8
- 0029725603
- Keyword spotting for video soundtrack indexing
- presented at the
- P. Gelin and C. J. Wellekens, "Keyword spotting for video soundtrack indexing," presented at the IEEE Int. Conf.Acoust., Speech, Signal Process. (ICASSP-96), 1996.
- (1996) IEEE Int. Conf.Acoust., Speech, Signal Process. (ICASSP-96)
- Gelin, P.¹ Wellekens, C.J.²

9
- 0024899342
- Spotting Japanese CV-syllables and phonemes using the time-delay neural networks
- presented at the, ICASSP-, Glasgow, U.K
- H. Sawai, A.Waibel, M. Miyatake, and K. Shikano, "Spotting Japanese CV-syllables and phonemes using the time-delay neural networks," presented at the Int. Conf. Acoust., Speech, Signal Process. (ICASSP- 89), Glasgow, U.K., 1989.
- (1989) Int. Conf. Acoust., Speech, Signal Process , pp. 89
- Sawai, H.¹ Waibel, A.² Miyatake, M.³ Shikano, K.⁴

10
- 78651256562
- Selective phoneme spotting for realisation of an /s, z, C, t/ transpose
- Linz, Austria: Springer, 2398
- D. Bauer, A. Plinge, and M. Finke, "Selective phoneme spotting for realisation of an /s, z, C, t/ transpose," in Lecture Notes in Computer Science-ICHHP 2002. Linz, Austria: Springer, 2002, vol. 2398.
- (2002) Lecture Notes in Computer Science-ICHHP , vol.2002
- Bauer, D.¹ Plinge, A.² Finke, M.³

11
- 64149093868
- Introducing restoration of selectivity in hearing instrument design through phoneme spotting
- Assistive Technology: Shaping the Future, G. M. Craddock, L. P. McCormack, R. B. Reilly, and H. Knops, Eds. Amsterdam, The Netherlands: IOS Press
- A. Plinge and D. Bauer, "Introducing restoration of selectivity in hearing instrument design through phoneme spotting," in Assistive Technology: Shaping the Future, ser. Assistive Technology Research Series, G. M. Craddock, L. P. McCormack, R. B. Reilly, and H. Knops, Eds. Amsterdam, The Netherlands: IOS Press, 2003, vol. 11.
- (2003) ser. Assistive Technology Research Series , vol.11
- Plinge, A.¹ Bauer, D.²

12
- 33745214121
- Laughter detection in meetings
- presented at the, Montreal, QC, Canada
- L. Kennedy and D. Ellis, "Laughter detection in meetings," presented at the NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, QC, Canada, 2004.
- (2004) NIST ICASSP 2004 Meeting Recognition Workshop
- Kennedy, L.¹ Ellis, D.²

13
- 50249092288
- Prosody and parsing
- presented at the, Cape Cod, MA
- P. J. Price, M. Ostendorf, and C. W.Wightman, "Prosody and parsing," presented at the DARPA Workshop on Speech and Natural Language, Cape Cod, MA, 1989.
- (1989) DARPA Workshop on Speech and Natural Language
- Price, P.J.¹ Ostendorf, M.² Wightman, C.W.³

14
- 0031624947
- Mach1: Nonuniform time-scale modification of speech
- presented at the
- M. Covell, M. Withgott, and M. Slaney, "Mach1: Nonuniform time-scale modification of speech," presented at the IEEE ICASSP-98, Seattle, WA, 1998.
- (1998) IEEE ICASSP-98, Seattle, WA
- Covell, M.¹ Withgott, M.² Slaney, M.³

15
- 84946748692
- Pitch-based emphasis detection for characterization of meeting recordings
- presented at the
- L. Kennedy and D. Ellis, "Pitch-based emphasis detection for characterization of meeting recordings," presented at the Automatic Speech Recognition Understanding Workshop (IEEE ASRU 2003), St. Thomas, VI, 2003.
- (2003) Automatic Speech Recognition Understanding Workshop (IEEE ASRU 2003), St. Thomas , vol.6
- Kennedy, L.¹ Ellis, D.²

16
- 64149128912
- Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording,
- U.S. Patent 6 161 087, Dec. 12
- C. W.Wightman and J. Bachenko, "Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording," U.S. Patent 6 161 087, Dec. 12, 2000.
- (2000)
- Wightman, C.W.¹ Bachenko, J.²

17
- 77951016396
- Detection of clicks in audio signals using warped linear prediction
- presented at the, Santorini, Greece
- P. A. A. Esquef, M. Karjalainen, and V. Välimäki, "Detection of clicks in audio signals using warped linear prediction," presented at the 14th IEEE Int. Conf. Digital Signal Process. (DSP-02), Santorini, Greece, 2002.
- (2002) 14th IEEE Int. Conf. Digital Signal Process. (DSP-02)
- Esquef, P.A.A.¹ Karjalainen, M.² Välimäki, V.³

18
- 0040283968
- Spontaneous speech effects in large vocabulary speech recognition applications
- presented at the, New York
- J. Butzberger, H. Murveit, E. Shriberg, and P. Price, "Spontaneous speech effects in large vocabulary speech recognition applications," presented at the Workshop on Speech and Natural Language, Harimman, New York, 1992.
- (1992) Workshop on Speech and Natural Language, Harimman
- Butzberger, J.¹ Murveit, H.² Shriberg, E.³ Price, P.⁴

19
- 0028518062
- Automatic labeling of prosodic patterns
- Oct
- C. W. Wightman and M. Ostendorf, "Automatic labeling of prosodic patterns," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 469-481, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 469-481
- Wightman, C.W.¹ Ostendorf, M.²

20
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

21
- 0030247355
- Robust speaker recognition-A feature-based approach
- Sep
- R. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition-A feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sep. 1996.
- (1996) IEEE Signal Process. Mag , vol.13 , Issue.5 , pp. 58-71
- Mammone, R.¹ Zhang, X.² Ramachandran, R.³

22
- 0003927842
- Upper Saddle River, NJ: Prentice-Hall
- T. F. Quatieri, Discrete-Time Speech Signal Processing. Upper Saddle River, NJ: Prentice-Hall, 2001.
- (2001) Discrete-Time Speech Signal Processing
- Quatieri, T.F.¹

23
- 0003798635
- Cambridge, U.K, Cambridge Univ. Press
- N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. Cambridge, U.K.: Cambridge Univ. Press, 2000.
- (2000) An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods
- Cristianini, N.¹ Shawe-Taylor, J.²

24
- 0029355999
- Speaker identification and verification using Gaussian mixture speaker models
- D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," Speech Commun., vol. 17, pp. 91-108, 1995.
- (1995) Speech Commun , vol.17 , pp. 91-108
- Reynolds, D.A.¹

25
- 0002400882
- Simplified support vector decision rules
- presented at the, Bari, Italy
- C. J. C. Burges, "Simplified support vector decision rules," presented at the 13th Int. Conf. Machine Learning, Bari, Italy, 1996.
- (1996) 13th Int. Conf. Machine Learning
- Burges, C.J.C.¹

26
- 11244272075
- Highlight sound effects detection in audio stream
- presented at the, Baltimore, MD
- R. Cai, L. Lu, H. J. Zhang, and L. H. Cai, "Highlight sound effects detection in audio stream," presented at the 4th IEEE Int. Conf. Multimedia and Expo, Baltimore, MD, 2003.
- (2003) 4th IEEE Int. Conf. Multimedia and Expo
- Cai, R.¹ Lu, L.² Zhang, H.J.³ Cai, L.H.⁴

27
- 0030364785
- Automatic transcription of general audio data: Preliminary analyses
- presented at the, Philadelphia, PA
- M. Spina and V. Zue, "Automatic transcription of general audio data: preliminary analyses," presented at the Int. Conf. Spoken Lang. Process., Philadelphia, PA, 1996.
- (1996) Int. Conf. Spoken Lang. Process
- Spina, M.¹ Zue, V.²

28
- 0004172718
- London, U.K, Academic
- S. Theodoridis and K. Koutroumbas, Pattern Recognition. London, U.K.: Academic, 1999.
- (1999) Pattern Recognition
- Theodoridis, S.¹ Koutroumbas, K.²

29
- 0003425258
- Englewood Cliffs, NJ: Prentice-Hall
- L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1978.
- (1978) Digital Processing of Speech Signals
- Rabiner, L.R.¹ Schafer, R.W.²

30
- 0003922190
- 2nd ed. New York: Wiley
- R. O. Duda, P. O. Hart, and D. G. Stork, Pattern Classification, 2nd ed. New York: Wiley, 2001.
- (2001) Pattern Classification
- Duda, R.O.¹ Hart, P.O.² Stork, D.G.³

31
- 0016470107
- An algorithm for determining the endpoints of isolated utterances
- L. R. Rabiner and M. R. Sambur, "An algorithm for determining the endpoints of isolated utterances," Bell Syst. Tech. J., vol. 54, pp. 297-315, 1975.
- (1975) Bell Syst. Tech. J , vol.54 , pp. 297-315
- Rabiner, L.R.¹ Sambur, M.R.²

32
- 0026368470
- Automatic recognition of prosodic phrases
- presented at the, Toronto, ON, Canada
- C. W. Wightman and M. Ostendorf, "Automatic recognition of prosodic phrases," presented at the IEEE Int. Conf Acoust., Speech, Signal Process., Toronto, ON, Canada, 1991.
- (1991) IEEE Int. Conf Acoust., Speech, Signal Process
- Wightman, C.W.¹ Ostendorf, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.