메뉴 건너뛰기




Volumn 53, Issue 5, 2011, Pages 726-735

Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features

Author keywords

Higher order singular value decomposition; Modulation spectrum; Mutual information; Speech discrimination

Indexed keywords

A-FRAMES; ACOUSTIC CONDITIONS; AUDIO CONTENT ANALYSIS; AUDIO SIGNAL; BROADCAST NEWS; CEPSTRAL FEATURES; COMPACT SETS; CONTENT-BASED; DETECTION EXPERIMENTS; HIGHER ORDER SINGULAR VALUE DECOMPOSITION; HIGHER ORDER SVD; MODULATION FREQUENCIES; MODULATION SPECTRUM; MUTUAL INFORMATIONS; PRINCIPAL AXES; PROCESSING STEPS; SEGMENT-BASED; SPEAKER SEGMENTATIONS; SPECTRAL FEATURE; SPEECH DISCRIMINATION; SPEECH SEGMENTATION; SPEECH TRANSCRIPTIONS; SVM CLASSIFIERS; TARGET CLASS; VOICE ACTIVITY;

EID: 79953649657     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2010.08.007     Document Type: Article
Times cited : (17)

References (29)
  • 1
    • 34547525965 scopus 로고    scopus 로고
    • Segmental modeling for audio segmentation
    • Hawaii, USA
    • Aronowitz, H., 2007. Segmental modeling for audio segmentation. In: Proceedings of ICASSP 2007, Hawaii, USA, pp. 393-396.
    • (2007) Proceedings of ICASSP 2007 , pp. 393-396
    • Aronowitz, H.1
  • 7
    • 0030711174 scopus 로고    scopus 로고
    • The modulation spectrogram: In pursuit of an invariant representation of speech
    • Greenberg, S., Kingsbury, B., 1997. The modulation spectrogram: in pursuit of an invariant representation of speech. In: Proceedings of ICASSP 1997, Vol. 3, pp. 1647-1650.
    • (1997) Proceedings of ICASSP 1997 , vol.3 , pp. 47-1650
    • Greenberg, S.1    Kingsbury, B.2
  • 8
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky Perceptual linear predictive (PLP) analysis of speech JASA 87 4 1990 1738 1752 (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 9
    • 25144471298 scopus 로고    scopus 로고
    • Score normalization in multimodal biometric systems
    • DOI 10.1016/j.patcog.2005.01.012, PII S0031320305000592
    • A. Jain, K. Nandakumar, and A. Ross Score normalization in multimodal biometric systems Pattern Recognition 38 2005 2270 2285 (Pubitemid 41336698)
    • (2005) Pattern Recognition , vol.38 , Issue.12 , pp. 2270-2285
    • Jain, A.1    Nandakumar, K.2    Ross, A.3
  • 13
    • 0037708486 scopus 로고    scopus 로고
    • Content-based audio classification and segmentation by using support vector machines
    • L. Lu, H.J. Zhang, and S. Li Content-based audio classification and segmentation by using support vector machines Multimedia Systems 8 2003 482 492
    • (2003) Multimedia Systems , vol.8 , pp. 482-492
    • Lu, L.1    Zhang, H.J.2    Li, S.3
  • 14
    • 33646819697 scopus 로고    scopus 로고
    • Automatic dysphonia recognition using biologically inspired amplitude-modulation features
    • Malyska, N., Quatieri, T.F., Sturim, D., 2005. Automatic dysphonia recognition using biologically inspired amplitude-modulation features. In: Proceedings of ICASSP 2005, pp. 873-876.
    • (2005) Proceedings of ICASSP 2005 , pp. 873-876
    • Malyska, N.1    Quatieri, T.F.2    Sturim, D.3
  • 15
    • 77950996631 scopus 로고    scopus 로고
    • Using modulation spectra for voice pathology detection and classification
    • Markaki, M., Stylianou, Y., 2009. Using modulation spectra for voice pathology detection and classification. In: Proceedings of IEEE EMBC'09.
    • (2009) Proceedings of IEEE EMBC'09
    • Markaki, M.1    Stylianou, Y.2
  • 17
    • 34047272330 scopus 로고    scopus 로고
    • Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations
    • DOI 10.1109/TSA.2005.858055
    • N. Mesgarani, M. Slaney, and S.A. Shamma Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations IEEE Trans. Audio Speech Lang. Process. 14 2006 920 930 (Pubitemid 46547653)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.3 , pp. 920-930
    • Mesgarani, N.1    Slaney, M.2    Shamma, S.A.3
  • 18
    • 24344458137 scopus 로고    scopus 로고
    • Feature selection based on mutual information: Criteria of Max-Dependency, Max-Relevance, and Min-Redundancy
    • DOI 10.1109/TPAMI.2005.159
    • H. Peng, F. Long, and C. Ding Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy IEEE Trans. Pattern Anal. Machine Intell. 27 8 2005 1226 1238 (Pubitemid 41245053)
    • (2005) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.27 , Issue.8 , pp. 1226-1238
    • Peng, H.1    Long, F.2    Ding, C.3
  • 22
    • 0029765670 scopus 로고    scopus 로고
    • Real-time discrimination of broadcast speech/music
    • Saunders, J., 1996. Real-time discrimination of broadcast speech/music. In: Proceedings of ICASSP 1996, pp. 993-996.
    • (1996) Proceedings of ICASSP 1996 , pp. 993-996
    • Saunders, J.1
  • 23
    • 0030648077 scopus 로고    scopus 로고
    • Construction and evaluation of a robust multifeature music/speech discriminator
    • Scheirer, E., Slaney, M., 1997. Construction and evaluation of a robust multifeature music/speech discriminator. In: Proceedings of ICASSP 1997, pp. 1331-1334.
    • (1997) Proceedings of ICASSP 1997 , pp. 1331-1334
    • Scheirer, E.1    Slaney, M.2
  • 24
    • 34547546128 scopus 로고    scopus 로고
    • Feasibility of single channel speaker separation based on modulation frequency analysis
    • Schimmel, S.M., Atlas, L.E., Nie., K., 2007. Feasibility of single channel speaker separation based on modulation frequency analysis. In: Proceedings of ICASSP 2007, pp. 605-608.
    • (2007) Proceedings of ICASSP 2007 , pp. 605-608
    • Schimmel, S.M.1    Atlas, L.E.2    Nie., K.3
  • 26
    • 0030364785 scopus 로고    scopus 로고
    • Automatic transcription of general audio data: Preliminary analysis
    • Spina, M.S., Zue, V.W., 1996. Automatic transcription of general audio data: preliminary analysis. In: Proceedings of ICSLP 1996, pp. 594-597.
    • (1996) Proceedings of ICSLP 1996 , pp. 594-597
    • Spina, M.S.1    Zue, V.W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.