메뉴 건너뛰기




Volumn 20, Issue 4, 2006, Pages 468-494

A study in machine learning from imbalanced data for sentence boundary detection in speech

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL METHODS; DECISION THEORY; LEARNING SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH ANALYSIS;

EID: 33746529930     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2005.06.002     Document Type: Article
Times cited : (96)

References (62)
  • 1
    • 84892144790 scopus 로고    scopus 로고
    • Beeferman, D., Berger, A., Lafferty, J., 1998. Cyperpunc: A lightweight punctuation annotation system for speech. In: Proceedings of the International Conference of Acoustics, Speech, and Signal Processing, 1998.
  • 3
    • 0031191630 scopus 로고    scopus 로고
    • The use of the area under the ROC curve in the evaluation of machine learning algorithms
    • Bradley A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30 6 (1997) 1145-1159
    • (1997) Pattern Recognition , vol.30 , Issue.6 , pp. 1145-1159
    • Bradley, A.P.1
  • 4
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • Breiman L. Bagging predictors. Machine Learning 24 2 (1996) 123-140
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 6
    • 33746579432 scopus 로고    scopus 로고
    • Campbell, W.N., 1993. Durational cues to prominence and grouping. In: Proceedings of ECSA Workshop on Prosody, Lund, Sweden, pp. 33-41.
  • 7
    • 33746502764 scopus 로고    scopus 로고
    • Chan, P., Stolfo, S., 1998. Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 164-168.
  • 9
    • 33746488491 scopus 로고    scopus 로고
    • Chawla, N.V., Japkowicz, N., Kolcz, A., (August 2003). Workshop on learning from imbalanced datasets II. In: Proceedings of the 20th International Conference on Machine Learning.
  • 11
    • 33746546924 scopus 로고    scopus 로고
    • Chawla, N.V., 2003. C4.5 and imbalanced datasets: Investigating the effect of sampling method, probabilistic estimate, and decision tree structure. In: Proceedings of the ICML'03 Workshop on Class Imbalances.
  • 12
    • 33746527167 scopus 로고    scopus 로고
    • Chen, C.J., 1999. Speech recognition with automatic punctuation. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 447-450.
  • 13
    • 33746514709 scopus 로고    scopus 로고
    • Christensen, H., Gotoh, Y., Renal, S., 2001. Punctuation annotation using statistical prosody models. In: ISCA Workshop on Prosody in Speech Recognition and Understanding.
  • 14
    • 33746559398 scopus 로고    scopus 로고
    • DARPA, 2003. Information Processing Technology Office, Effective, Affordable, Reusable Speech-to-text (EARS). Available from: .
  • 15
    • 0028088647 scopus 로고
    • On the perceptual strength of prosodic boundaries and its relation to suprasegmental cues
    • De Pijper J.R., and Sanderman A.A. On the perceptual strength of prosodic boundaries and its relation to suprasegmental cues. Journal of the Acoustical Society of America 96 4 (1994) 2037-2047
    • (1994) Journal of the Acoustical Society of America , vol.96 , Issue.4 , pp. 2037-2047
    • De Pijper, J.R.1    Sanderman, A.A.2
  • 16
    • 0034250160 scopus 로고    scopus 로고
    • An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization
    • Dietterich T.G. An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Machine Learning 40 2 (2000) 139-157
    • (2000) Machine Learning , vol.40 , Issue.2 , pp. 139-157
    • Dietterich, T.G.1
  • 17
    • 33746493052 scopus 로고    scopus 로고
    • Drummond, D., Holte, R., 2003. C4.5, class imbalance, and cost sensitivity: Why under-sampling beats over-sampling. In: Proceedings of ICML'03 Workshop on Learning from Imbalanced Datasets.
  • 19
    • 33746513427 scopus 로고    scopus 로고
    • Ferrer, L., 2002. Prosodic features for the Switchboard database, Tech. rep., SRI International.
  • 20
    • 33746506317 scopus 로고    scopus 로고
    • Freund, Y., Schapire, R., 1996. Experiments with a new boosting algorithm. In: Machine Learning: Proceedings of the Thirteenth National Conference, pp. 148-156.
  • 21
    • 58149321460 scopus 로고    scopus 로고
    • Boosting a weak learning algorithm by majority
    • Freund Y. Boosting a weak learning algorithm by majority. Information and Computation (1996) 256-285
    • (1996) Information and Computation , pp. 256-285
    • Freund, Y.1
  • 22
    • 33746512245 scopus 로고    scopus 로고
    • Gotoh, Y., Renals, S., 2000. Sentence boundary detection in broadcast speech transcripts. In: Proceedings of ISCA Workshop: Automatic Speech Recognition: Challenges for the New Millennium ASR-2000, pp. 228-235.
  • 24
    • 33746563699 scopus 로고    scopus 로고
    • Hirst, D., 1993. Peak, boundary and cohesion characteristics of prosodic grouping. In: Proceedings of ECSA Workshop on Prosody, Lund, Sweden, pp. 32-37.
  • 25
    • 85009291541 scopus 로고    scopus 로고
    • Huang, J., Zweig, G., 2002. Maximum entropy model for punctuation annotation from speech. In: Proceedings of the International Conference on Spoken Language Processing, pp. 917-920.
  • 26
    • 33845536164 scopus 로고    scopus 로고
    • The class imbalance problem: A systematic study
    • Japkowicz N., and Stephen S. The class imbalance problem: A systematic study. Intelligent Data Analysis 6 5 (2002) 429-450
    • (2002) Intelligent Data Analysis , vol.6 , Issue.5 , pp. 429-450
    • Japkowicz, N.1    Stephen, S.2
  • 27
    • 84919457977 scopus 로고    scopus 로고
    • Kim, J., Woodland, P.C., 2001. The use of prosody in a combined system for punctuation generation and speech recognition. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 2757-2760.
  • 29
    • 33746553975 scopus 로고    scopus 로고
    • Kubat, M., Matwin, S., 1997. Addressing the curse of imbalanced training sets. In: Proceedings of the International Conference on Machine Learning, pp. 179-186.
  • 30
    • 84947918649 scopus 로고    scopus 로고
    • Kubat, M., Holte, R., Matwin, S., 1997. Learning when negative examples abound. In: Proceedings of European Conference on Machine Learning, pp. 146-153.
  • 31
    • 33746544432 scopus 로고    scopus 로고
    • Laurikkaka, J., 2001. Improving identification of difficult small classes by balancing class distribution, Tech. rep., Department of Computer and Information Science, University of Tampere, Finland.
  • 32
    • 0034726260 scopus 로고    scopus 로고
    • Noisy replication in skewed binary classification
    • Lee S. Noisy replication in skewed binary classification. Computational Statistics and Data Analysis 34 (2000) 165-191
    • (2000) Computational Statistics and Data Analysis , vol.34 , pp. 165-191
    • Lee, S.1
  • 33
    • 0030355756 scopus 로고    scopus 로고
    • Lickley, R., Bard, E., 1996. On not recognizing disfluencies in dialog. In: Proceedings of the International Conference on Spoken Language Processing, pp. 1876-1879.
  • 34
    • 33746508105 scopus 로고    scopus 로고
    • Ling, C., Li, C., 1998. Data mining for direct marketing problems and solutions. In: Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pp. 73-79.
  • 35
    • 85009223733 scopus 로고    scopus 로고
    • Liu, Y., Shriberg, E., Stolcke, A., 2003. Automatic disfluency identification in conversational speech using multiple knowledge sources. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 957-960.
  • 36
    • 33746494662 scopus 로고    scopus 로고
    • Liu, Y., Stolcke, A., Shriberg, E., Harper, M., 2004. Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing.
  • 37
    • 33746510236 scopus 로고    scopus 로고
    • Liu, Y., Shriberg, E., Stolcke, A., Peskin, B., Harper, M., The ICSI/SRI/UW RT04 structural metadata extraction system. In: EARS Rich Transcription Workshop, 2004.
  • 38
    • 85009142186 scopus 로고    scopus 로고
    • Liu, Y., Shriberg, E., Stolcke, A., Harper, M., 2004. Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection. In: Proceedings of the International Conference on Spoken Language Processing.
  • 39
    • 33746563134 scopus 로고    scopus 로고
    • Liu, Y., 2004. Structural event detection for rich transcription of speech, Ph.D. thesis, Purdue University.
  • 41
    • 33746549964 scopus 로고    scopus 로고
    • National Institute of Standards and Technology, 2003. RT-03F evaluation, http://www.nist.gov/speech/tests/rt/rt2003/fall/rt03f-evaldisc doc/index.htm.
  • 42
    • 33746481099 scopus 로고    scopus 로고
    • National Institute of Standards and Technology, (Nov. 2003) RT-03F workshop agenda and presentations, http://www.nist.gov/speech/tests/rt/rt2003/fall/presentations/.
  • 43
    • 33746482392 scopus 로고    scopus 로고
    • Ostendorf, M., Hillard, D., 2004. Scoring structural MDE: Towards more meaningful error rates. In: EARS Rich Transcription Workshop.
  • 44
    • 33746477166 scopus 로고    scopus 로고
    • Palmer, D.D., Hearst, M.A., 1994. Adaptive sentence boundary disambiguation. In: Proceedings of the Fourth ACL Conference on Applied Natural Language Processing, pp. 78-83.
  • 45
    • 33746520295 scopus 로고    scopus 로고
    • Potisuk, S., 1995. Prosodic disambiguation in automatic speech understanding of Thai, Ph.D. thesis, Purdue University.
  • 47
    • 0035283313 scopus 로고    scopus 로고
    • Robust classification for imprecise environments
    • Provost F., and Fawcett T. Robust classification for imprecise environments. Machine Learning 42 3 (2001) 203-231
    • (2001) Machine Learning , vol.42 , Issue.3 , pp. 203-231
    • Provost, F.1    Fawcett, T.2
  • 48
    • 0022594196 scopus 로고
    • An introduction to hidden Markov models
    • Rabiner L.R., and Juang B.H. An introduction to hidden Markov models. IEEE ASSP Magazine 3 1 (1986) 4-16
    • (1986) IEEE ASSP Magazine , vol.3 , Issue.1 , pp. 4-16
    • Rabiner, L.R.1    Juang, B.H.2
  • 49
    • 33746484164 scopus 로고    scopus 로고
    • Reynar, J., Ratnaparkhi, A., 1997. A maximum entropy approach to identifying sentence boundaries. In: Proceedings of the Fifth Conference on Applied Natural Language Processing, Washington, DC, pp. 16-19.
  • 50
    • 33746496974 scopus 로고    scopus 로고
    • Schmid, H., 2000. Unsupervised learning of period disambiguation for tokenization, University of Stuttgart, Internal Report.
  • 51
    • 0020121734 scopus 로고
    • Duration as a cue to the perception of a phrase boundary
    • Scott D.R. Duration as a cue to the perception of a phrase boundary. Journal of the Acoustical Society of America 71 4 (1982) 996-1007
    • (1982) Journal of the Acoustical Society of America , vol.71 , Issue.4 , pp. 996-1007
    • Scott, D.R.1
  • 52
    • 0034275920 scopus 로고    scopus 로고
    • Prosody-based automatic segmentation of speech into sentences and topics
    • Shriberg E., Stolcke A., Hakkani-Tur D., and Tur G. Prosody-based automatic segmentation of speech into sentences and topics. Speech Communication (2000) 127-154
    • (2000) Speech Communication , pp. 127-154
    • Shriberg, E.1    Stolcke, A.2    Hakkani-Tur, D.3    Tur, G.4
  • 53
    • 33746556270 scopus 로고    scopus 로고
    • Sonmez, K., Shriberg, E., Heck, L., Weintraub, M., 1998. Modeling dynamic prosodic variation for speaker verification. In: Proceedings of the International Conference on Spoken Language Processing, pp. 3189-3192.
  • 54
    • 33746551891 scopus 로고    scopus 로고
    • Stevenson, M., Gaizauskas, R., 2000. Experiments on sentence boundary detection. In: Proceedings of the North American Chapter of the Association for Computational Linguistics annual meeting, pp. 24-30.
  • 55
    • 0030374907 scopus 로고    scopus 로고
    • Stolcke, A., Shriberg, E., 1996. Automatic linguistic segmentation of conversational speech. In: Proceedings of the International Conference on Spoken Language Processing, pp. 1005-1008.
  • 56
    • 33746558309 scopus 로고    scopus 로고
    • Stolcke, A., Bratt, H., Butzberger, J., Franco, H., Rao Gadde, V.R., Plauché, M., Richey, C., Shriberg, E., Sönmez, K., Weng, F., Zheng, J., 2000. The SRI March 2000 Hub-5 conversational speech transcription system. In: Proceedings of NIST Speech Transcription Workshop, College Park, MD, 2000. URL http://www.nist.gov/speech/publications/tw00/html/cts80/cts80.htm.
  • 57
    • 33746475457 scopus 로고    scopus 로고
    • Strassel, S., Walker, C., 2003. Data and annotation issues in RT-03. In: EARS Rich Transcription Workshop.
  • 58
    • 33746568103 scopus 로고    scopus 로고
    • Strassel, S., 2003. Simple Metadata Annotation Specification V5.0, Linguistic Data Consortium. URL http://www.ldc.upenn.edu/projects/MDE/Guidelines/SimpleMDE_V5.0.pdf.
  • 59
    • 0031033301 scopus 로고    scopus 로고
    • Prosodic features at discourse boundaries of different strength
    • Swerts M. Prosodic features at discourse boundaries of different strength. Journal of the Acoustical Society of America 101 1 (1997) 514-521
    • (1997) Journal of the Acoustical Society of America , vol.101 , Issue.1 , pp. 514-521
    • Swerts, M.1
  • 60
    • 4544316886 scopus 로고    scopus 로고
    • Wang, D., Narayanan, S.S., 2004. A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues. In: Proceedings of the International Conference of Acoustics, Speech, and Signal Processing.
  • 61
    • 1442275185 scopus 로고    scopus 로고
    • Learning when training data are costly: The effect of class distribution on tree induction
    • Weiss G., and Provost F. Learning when training data are costly: The effect of class distribution on tree induction. Artificial Intelligence Research (2003) 315-354
    • (2003) Artificial Intelligence Research , pp. 315-354
    • Weiss, G.1    Provost, F.2
  • 62
    • 85009168880 scopus 로고    scopus 로고
    • Wrede, B., Shriberg, E., 2003. Spotting "hotspots" in meetings: Human judgments and prosodic cues. In: Proceedings of the European Conference on Speech Communication and Technology, pp. 2805-2808.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.