메뉴 건너뛰기




Volumn 15, Issue 3, 2007, Pages 838-850

An effective algorithm for automatic detection and exact demarcation of breath sounds in speech and song signals

Author keywords

Breath detection; Event spotting in speech and audio; Mel frequency cepstral coefficient (MFCC)

Indexed keywords

AESTHETIC QUALITIES; AUDIO SIGNALS; AUTOMATIC ALGORITHMS; AUTOMATIC DETECTIONS; BREATH DETECTION; BREATH SOUNDS; EDGE DETECTION ALGORITHMS; EFFECTIVE ALGORITHMS; EVENT SPOTTING IN SPEECH AND AUDIO; FALSE DETECTIONS; FREQUENCY-DOMAIN PARAMETERS; IDENTIFICATION RATES; MEL FREQUENCY CEPSTRAL COEFFICIENT (MFCC); MUSIC INDUSTRIES; SINGULAR VALUES; THREE PHASIS; TIME DOMAINS; TIME FRAMES;

EID: 64149099357     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.889750     Document Type: Article
Times cited : (64)

References (32)
  • 3
    • 0032646977 scopus 로고    scopus 로고
    • An overview of audio information retrieval
    • J. T. Foote, "An overview of audio information retrieval," Multimedia Syst., vol. 7, pp. 2-10, 1999.
    • (1999) Multimedia Syst , vol.7 , pp. 2-10
    • Foote, J.T.1
  • 4
    • 0037622306 scopus 로고    scopus 로고
    • Enhancing sonic browsing using audio information retrieval
    • presented at the, Kyoto, Japan, unpublished
    • E. Brazil, M. Fernstrom, G. Tzanetakis, and P. Cook, "Enhancing sonic browsing using audio information retrieval," presented at the Int. Conf. Auditory Display (ICAD), Kyoto, Japan, 2002, unpublished.
    • (2002) Int. Conf. Auditory Display (ICAD)
    • Brazil, E.1    Fernstrom, M.2    Tzanetakis, G.3    Cook, P.4
  • 5
    • 0037491736 scopus 로고    scopus 로고
    • Audio Information Retrieval (AIR) tools
    • presented at the, Music Information Retrieval ISMIR, Plymouth, MA
    • G. Tzanetakis and P. Cook, "Audio Information Retrieval (AIR) tools," presented at the Int. Symp. Music Information Retrieval (ISMIR), Plymouth, MA, 2000.
    • (2000) Int. Symp
    • Tzanetakis, G.1    Cook, P.2
  • 6
    • 0141867819 scopus 로고    scopus 로고
    • Concept framework for audio information retrieval: ARF
    • G. H. Li, D. F.Wu, and J. Zhang, "Concept framework for audio information retrieval: ARF," J. Comput. Sci. Technol., vol. 18, pp. 667-673, 2003.
    • (2003) J. Comput. Sci. Technol , vol.18 , pp. 667-673
    • Li, G.H.1    Wu, D.F.2    Zhang, J.3
  • 7
    • 34547550979 scopus 로고    scopus 로고
    • Soundspotter-a prototype system for content based audio retrieval
    • presented at the, Hamburg, Germany
    • C. Spevak and E. Favreau, "Soundspotter-a prototype system for content based audio retrieval," presented at the Int. Conf. Digital Audio Effects (DAFx-02), Hamburg, Germany, 2002.
    • (2002) Int. Conf. Digital Audio Effects (DAFx-02)
    • Spevak, C.1    Favreau, E.2
  • 9
    • 0024899342 scopus 로고
    • Spotting Japanese CV-syllables and phonemes using the time-delay neural networks
    • presented at the, ICASSP-, Glasgow, U.K
    • H. Sawai, A.Waibel, M. Miyatake, and K. Shikano, "Spotting Japanese CV-syllables and phonemes using the time-delay neural networks," presented at the Int. Conf. Acoust., Speech, Signal Process. (ICASSP- 89), Glasgow, U.K., 1989.
    • (1989) Int. Conf. Acoust., Speech, Signal Process , pp. 89
    • Sawai, H.1    Waibel, A.2    Miyatake, M.3    Shikano, K.4
  • 10
    • 78651256562 scopus 로고    scopus 로고
    • Selective phoneme spotting for realisation of an /s, z, C, t/ transpose
    • Linz, Austria: Springer, 2398
    • D. Bauer, A. Plinge, and M. Finke, "Selective phoneme spotting for realisation of an /s, z, C, t/ transpose," in Lecture Notes in Computer Science-ICHHP 2002. Linz, Austria: Springer, 2002, vol. 2398.
    • (2002) Lecture Notes in Computer Science-ICHHP , vol.2002
    • Bauer, D.1    Plinge, A.2    Finke, M.3
  • 11
    • 64149093868 scopus 로고    scopus 로고
    • Introducing restoration of selectivity in hearing instrument design through phoneme spotting
    • Assistive Technology: Shaping the Future, G. M. Craddock, L. P. McCormack, R. B. Reilly, and H. Knops, Eds. Amsterdam, The Netherlands: IOS Press
    • A. Plinge and D. Bauer, "Introducing restoration of selectivity in hearing instrument design through phoneme spotting," in Assistive Technology: Shaping the Future, ser. Assistive Technology Research Series, G. M. Craddock, L. P. McCormack, R. B. Reilly, and H. Knops, Eds. Amsterdam, The Netherlands: IOS Press, 2003, vol. 11.
    • (2003) ser. Assistive Technology Research Series , vol.11
    • Plinge, A.1    Bauer, D.2
  • 16
    • 64149128912 scopus 로고    scopus 로고
    • Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording,
    • U.S. Patent 6 161 087, Dec. 12
    • C. W.Wightman and J. Bachenko, "Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording," U.S. Patent 6 161 087, Dec. 12, 2000.
    • (2000)
    • Wightman, C.W.1    Bachenko, J.2
  • 19
    • 0028518062 scopus 로고
    • Automatic labeling of prosodic patterns
    • Oct
    • C. W. Wightman and M. Ostendorf, "Automatic labeling of prosodic patterns," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 469-481, Oct. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.4 , pp. 469-481
    • Wightman, C.W.1    Ostendorf, M.2
  • 20
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 21
    • 0030247355 scopus 로고    scopus 로고
    • Robust speaker recognition-A feature-based approach
    • Sep
    • R. Mammone, X. Zhang, and R. Ramachandran, "Robust speaker recognition-A feature-based approach," IEEE Signal Process. Mag., vol. 13, no. 5, pp. 58-71, Sep. 1996.
    • (1996) IEEE Signal Process. Mag , vol.13 , Issue.5 , pp. 58-71
    • Mammone, R.1    Zhang, X.2    Ramachandran, R.3
  • 24
    • 0029355999 scopus 로고
    • Speaker identification and verification using Gaussian mixture speaker models
    • D. A. Reynolds, "Speaker identification and verification using Gaussian mixture speaker models," Speech Commun., vol. 17, pp. 91-108, 1995.
    • (1995) Speech Commun , vol.17 , pp. 91-108
    • Reynolds, D.A.1
  • 25
    • 0002400882 scopus 로고    scopus 로고
    • Simplified support vector decision rules
    • presented at the, Bari, Italy
    • C. J. C. Burges, "Simplified support vector decision rules," presented at the 13th Int. Conf. Machine Learning, Bari, Italy, 1996.
    • (1996) 13th Int. Conf. Machine Learning
    • Burges, C.J.C.1
  • 27
    • 0030364785 scopus 로고    scopus 로고
    • Automatic transcription of general audio data: Preliminary analyses
    • presented at the, Philadelphia, PA
    • M. Spina and V. Zue, "Automatic transcription of general audio data: preliminary analyses," presented at the Int. Conf. Spoken Lang. Process., Philadelphia, PA, 1996.
    • (1996) Int. Conf. Spoken Lang. Process
    • Spina, M.1    Zue, V.2
  • 31
    • 0016470107 scopus 로고
    • An algorithm for determining the endpoints of isolated utterances
    • L. R. Rabiner and M. R. Sambur, "An algorithm for determining the endpoints of isolated utterances," Bell Syst. Tech. J., vol. 54, pp. 297-315, 1975.
    • (1975) Bell Syst. Tech. J , vol.54 , pp. 297-315
    • Rabiner, L.R.1    Sambur, M.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.