메뉴 건너뛰기




Volumn 24, Issue 2, 2010, Pages 341-357

A wavelet-based parameterization for speech/music discrimination

Author keywords

Dynamic parameters; Long term parameters; Segmentation; Speech music discrimination; Static parameters; Wavelets

Indexed keywords

DYNAMIC PARAMETERS; LONG-TERM PARAMETERS; SEGMENTATION; SPEECH/MUSIC DISCRIMINATION; STATIC PARAMETERS; WAVELETS;

EID: 70349238685     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2009.05.003     Document Type: Article
Times cited : (43)

References (51)
  • 1
    • 0037401304 scopus 로고    scopus 로고
    • Speech/music discrimination using entropy and dynamism features in a HMM classification framework
    • Ajmera J., McCowan I., and Bourlard H. Speech/music discrimination using entropy and dynamism features in a HMM classification framework. Speech Communication 40 (2003) 351-363
    • (2003) Speech Communication , vol.40 , pp. 351-363
    • Ajmera, J.1    McCowan, I.2    Bourlard, H.3
  • 2
    • 33947319848 scopus 로고    scopus 로고
    • Application of fisher linear discriminant analysis to speech/music classification
    • Alexandre-Cortizo, E., Rosa-Zurera, M., Lopez-Ferreras, F., 2005. Application of fisher linear discriminant analysis to speech/music classification. In: IEEE Eurocon, pp. 1666-1669.
    • (2005) IEEE Eurocon , pp. 1666-1669
    • Alexandre-Cortizo, E.1    Rosa-Zurera, M.2    Lopez-Ferreras, F.3
  • 12
    • 33745182925 scopus 로고    scopus 로고
    • Automatic music genre classification using second-order statistical measures for the prescriptive approach
    • Ezzaidi, H., Rouat, J., 2005. Automatic music genre classification using second-order statistical measures for the prescriptive approach. In: Proc. European Conf. on Speech Communication and Technology, pp. 141-144.
    • (2005) Proc. European Conf. on Speech Communication and Technology , pp. 141-144
    • Ezzaidi, H.1    Rouat, J.2
  • 13
    • 70349263032 scopus 로고    scopus 로고
    • Segmentation en macro-classes acoustiques d'Tmissions radiophoniques dans le cadre d'ESTER
    • Fredouille, C., Matrouf, D., Linares, G., Nocera, P., 2004. Segmentation en macro-classes acoustiques d'Tmissions radiophoniques dans le cadre d'ESTER. In: JournTes d'Etude sur la Parole - JEP04.
    • (2004) JournTes d'Etude sur la Parole
    • Fredouille, C.1    Matrouf, D.2    Linares, G.3    Nocera, P.4
  • 15
    • 0036567851 scopus 로고    scopus 로고
    • The LIMSI broadcast news transcription system
    • Gauvain J.-L., Lamel L., and Adda G. The LIMSI broadcast news transcription system. Speech Communication 37 1 (2002) 89-108
    • (2002) Speech Communication , vol.37 , Issue.1 , pp. 89-108
    • Gauvain, J.-L.1    Lamel, L.2    Adda, G.3
  • 24
    • 34247239858 scopus 로고    scopus 로고
    • Speech/music discrimination based on spectral peak analysis and multi-layer perceptron
    • Keum, J.S., Lee, H.S., 2006. Speech/music discrimination based on spectral peak analysis and multi-layer perceptron. In: International Conference on Hybrid Information Technology, vol. 2, pp. 56-61.
    • (2006) International Conference on Hybrid Information Technology , vol.2 , pp. 56-61
    • Keum, J.S.1    Lee, H.S.2
  • 25
    • 33746879922 scopus 로고    scopus 로고
    • Machine learning-based classification of speech and music
    • Khan M., and Al-Khatib W.G. Machine learning-based classification of speech and music. Multi-Media Systems 12 (2006) 55-67
    • (2006) Multi-Media Systems , vol.12 , pp. 55-67
    • Khan, M.1    Al-Khatib, W.G.2
  • 26
    • 0034845044 scopus 로고    scopus 로고
    • Kim, I.J., Yang, S.I., Kwon, Y., 2001. Speech Enhancement using Adaptive Wavelet Shrinkage. In: ISIE-2001, 1, pp. 501-504.
    • Kim, I.J., Yang, S.I., Kwon, Y., 2001. Speech Enhancement using Adaptive Wavelet Shrinkage. In: ISIE-2001, vol. 1, pp. 501-504.
  • 34
    • 13144306118 scopus 로고    scopus 로고
    • A speech/music discriminator based on RMS and zero-crossings
    • Panagiotakis, C., Tziritas, G., 2005. A speech/music discriminator based on RMS and zero-crossings. In: IEEE Transaction on Multimedia, vol. 7(1), pp. 155-166.
    • (2005) IEEE Transaction on Multimedia , vol.7 , Issue.1 , pp. 155-166
    • Panagiotakis, C.1    Tziritas, G.2
  • 39
    • 27644502441 scopus 로고    scopus 로고
    • Image compression from DCT to wavelets: a review
    • Saha S. Image compression from DCT to wavelets: a review. ACM Crossroads 6 3 (2000) 644-651
    • (2000) ACM Crossroads , vol.6 , Issue.3 , pp. 644-651
    • Saha, S.1
  • 40
    • 0033688848 scopus 로고    scopus 로고
    • High resolution speech feature parameterization for monophone-based stressed speech recognition
    • Sarikaya R., and Hansen J.H.L. High resolution speech feature parameterization for monophone-based stressed speech recognition. IEEE Signal Processing Letters 7 7 (2000) 182-185
    • (2000) IEEE Signal Processing Letters , vol.7 , Issue.7 , pp. 182-185
    • Sarikaya, R.1    Hansen, J.H.L.2
  • 43
    • 45849121392 scopus 로고    scopus 로고
    • Detection of speech and music based on spectral tracking
    • Taniguchi, T., Tohyama, M., Katsuhiko, S., 2008. Detection of speech and music based on spectral tracking. In: Speech Communication, vol. 50, pp. 547-563.
    • (2008) Speech Communication , vol.50 , pp. 547-563
    • Taniguchi, T.1    Tohyama, M.2    Katsuhiko, S.3
  • 45
    • 84892175859 scopus 로고    scopus 로고
    • Automatic speech recognition based on ceptral coefficients and a mel-based discrete energy operator
    • ICASSP, pp
    • Tolba, H., O'Shaughnessy, D., 1998. Automatic speech recognition based on ceptral coefficients and a mel-based discrete energy operator. In: Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing, ICASSP, pp. 973-976.
    • (1998) Proc. IEEE Int. Conf. on Acoustic, Speech and Signal Processing , pp. 973-976
    • Tolba, H.1    O'Shaughnessy, D.2
  • 47
    • 16244420091 scopus 로고    scopus 로고
    • Multigroup classification of audio signals using time-frequency parameters
    • Umapathy K., Krishnan S., and Jimaa S. Multigroup classification of audio signals using time-frequency parameters. IEEE Transaction on Multimedia 7 2 (2005) 308-315
    • (2005) IEEE Transaction on Multimedia , vol.7 , Issue.2 , pp. 308-315
    • Umapathy, K.1    Krishnan, S.2    Jimaa, S.3
  • 51
    • 0035340677 scopus 로고    scopus 로고
    • Audio content analysis for online audiovisual data segmentation and classification
    • Zhang, T., Kuo, C.-C.J., 2001. Audio content analysis for online audiovisual data segmentation and classification. In: IEEE Transactions on Speech and Audio Processing, vol. 9(4), pp. 441-457.
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.4 , pp. 441-457
    • Zhang, T.1    Kuo, C.-C.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.