메뉴 건너뛰기




Volumn 3445 LNAI, Issue , 2005, Pages 261-290

Text independent methods for speech segmentation

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; SPEECH ANALYSIS; SPEECH CODING; SPEECH SYNTHESIS;

EID: 26844465977     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11520153_12     Document Type: Conference Paper
Times cited : (33)

References (95)
  • 2
    • 0023831656 scopus 로고
    • A new statistical approach for the automatic segmentation of continuous speech signals
    • Andre-Obrecht R.: A New Statistical Approach for the Automatic Segmentation of Continuous Speech Signals. IEEE Transactions on Acoustics, Speech Signal Processing, Vol. 36 (1988) 29-40
    • (1988) IEEE Transactions on Acoustics, Speech Signal Processing , vol.36 , pp. 29-40
    • Andre-Obrecht, R.1
  • 5
    • 84942892779 scopus 로고    scopus 로고
    • Automatic parameter estimation for a context-independent speech segmentation algorithm
    • Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue, Lecture Notes in Artificial Intelligence. Springer-Verlag
    • Aversano, G., Esposito, A: Automatic Parameter Estimation for a Context-Independent Speech Segmentation Algorithm. In Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue, 5th International Conference, Lecture Notes in Artificial Intelligence. Springer-Verlag, (2002) 293 - 300
    • (2002) 5th International Conference , pp. 293-300
    • Aversano, G.1    Esposito, A.2
  • 9
    • 0024925404 scopus 로고
    • Distance measures for signal processing and pattern recognition
    • Basseville, M.: Distance Measures for Signal Processing and Pattern Recognition. Signal Processing, Vol. 18 (1989) 349-369
    • (1989) Signal Processing , vol.18 , pp. 349-369
    • Basseville, M.1
  • 10
    • 23044534737 scopus 로고    scopus 로고
    • Advances in very low bit-rate speech coding using recognition and synthesis
    • Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue. Lecture Notes in Artificial Intelligence. Springer-Verlag, Berlin Heidelberg New York
    • Baudoin, G., Capman F., Cernocky, J., El Chami, F., Charbit, M., Chollet, G., Petrovska-Delacretaz, D.: Advances in Very Low Bit-rate Speech Coding using Recognition and Synthesis. In: Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue, 5th International Conference. Lecture Notes in Artificial Intelligence. Springer-Verlag, Berlin Heidelberg New York (2002) 269-276
    • (2002) 5th International Conference , pp. 269-276
    • Baudoin, G.1    Capman, F.2    Cernocky, J.3    El Chami, F.4    Charbit, M.5    Chollet, G.6    Petrovska-Delacretaz, D.7
  • 18
    • 0027646354 scopus 로고
    • Automatic segmentation and labeling of speech based on hidden markov models
    • Brugnara F., Falavigna, D., Omologo, M.: Automatic Segmentation and Labeling of Speech Based on Hidden Markov Models. Speech Communication, Vol. 12 (1993) 357-370
    • (1993) Speech Communication , vol.12 , pp. 357-370
    • Brugnara, F.1    Falavigna, D.2    Omologo, M.3
  • 23
    • 0002302475 scopus 로고
    • Off-line statistical analysis in change-point models using non-parametric and likelihood methods
    • M. Basseville, A. Beneviste (eds): Springer-Verlag, New-York
    • Deshayes, J., Picard, D.: Off-line Statistical Analysis in Change-point Models Using Non-parametric and Likelihood Methods". In M. Basseville, A. Beneviste (eds): Detection of Abrupt Changes in Signals and Dynamical Systems, Springer-Verlag, New-York (1986)
    • (1986) Detection of Abrupt Changes in Signals and Dynamical Systems
    • Deshayes, J.1    Picard, D.2
  • 25
    • 0039099780 scopus 로고
    • Consistency of judgments in manual labeling of phonetic segments: The distinction between clear and nnclear cases
    • Banf, Canada
    • Eisen, B., Tillman, H. G.: Consistency of Judgments in Manual Labeling of Phonetic Segments: The Distinction between Clear and Nnclear Cases. Proceedings of ICSLP '92. Banf, Canada (1992) 871-874
    • (1992) Proceedings of ICSLP '92 , pp. 871-874
    • Eisen, B.1    Tillman, H.G.2
  • 27
    • 26844505908 scopus 로고    scopus 로고
    • The importance of data for training intelligent devices
    • B. Apolloni, F. Kurfess (eds.): Kluwer Academic/Plenum Publishers
    • Esposito, A.: The Importance of Data for Training Intelligent Devices. In B. Apolloni, F. Kurfess (eds.): From Synapses to Rules: Discovering Symbolic Rules from Neural Processed Data. Kluwer Academic/Plenum Publishers (2002) 229-250
    • (2002) From Synapses to Rules: Discovering Symbolic Rules from Neural Processed Data , pp. 229-250
    • Esposito, A.1
  • 28
    • 26844572990 scopus 로고    scopus 로고
    • Speech segmentation by parametric filtering: Two new distortion measures and experimental evaluation
    • International Institute for Advanced Scientific Studies, Vietri sul Mare (SA), Italy
    • Esposito, A., Pannacci, L., Perfetti, R., Russo, R.C.: Speech Segmentation by Parametric Filtering: Two New Distortion Measures and Experimental Evaluation, Technical Report n. IIASS-1-00, International Institute for Advanced Scientific Studies, Vietri sul Mare (SA), Italy (2000)
    • (2000) Technical Report N. IIASS-1-00 , vol.IIASS-1-00
    • Esposito, A.1    Pannacci, L.2    Perfetti, R.3    Russo, R.C.4
  • 31
    • 26844489731 scopus 로고
    • Automatic speech segmentation using neural network and phonetic transcription
    • Finster, H.: Automatic speech segmentation using neural network and phonetic transcription. In Proceedings of International Conference on Neural Networks, Vol.4 (1992) 734-736
    • (1992) Proceedings of International Conference on Neural Networks , vol.4 , pp. 734-736
    • Finster, H.1
  • 33
    • 0034270090 scopus 로고    scopus 로고
    • Speech recognition using stochastic phonemic segment model based on phoneme segmentation
    • Furuichi, C., Aizawa, K., Inoue, K.: Speech Recognition Using Stochastic Phonemic Segment Model Based on Phoneme Segmentation. Systems and Computers in Japan, Vol. 31(10) (2000) 1111-1119
    • (2000) Systems and Computers in Japan , vol.31 , Issue.10 , pp. 1111-1119
    • Furuichi, C.1    Aizawa, K.2    Inoue, K.3
  • 36
    • 0038359548 scopus 로고    scopus 로고
    • A probabilistic framework for segment -based speech recognition
    • Glass, J. R.: A Probabilistic Framework for Segment -Based Speech Recognition. Computer Speech and Language, Vol. 17 (2003) 137-152
    • (2003) Computer Speech and Language , vol.17 , pp. 137-152
    • Glass, J.R.1
  • 38
    • 84951850917 scopus 로고    scopus 로고
    • Automatic segmentation of speech at the phonetic level
    • T. Caell et al. (eds)
    • Gómez, J.A., Castro, M. J.: Automatic Segmentation of Speech at the Phonetic Level. In T. Caell et al. (eds): Lecture Notes in Computer Science, Vol. 2396 (2002) 672-680
    • (2002) Lecture Notes in Computer Science , vol.2396 , pp. 672-680
    • Gómez, J.A.1    Castro, M.J.2
  • 42
    • 26844484699 scopus 로고    scopus 로고
    • The switchboard transcription project
    • Center for Language and Speech Processing, Johns Hopkins University, Baltimore USA
    • Greenberg, S.: The Switchboard Transcription Project. Technical Report # 24, Center for Language and Speech Processing, Johns Hopkins University, Baltimore USA (1997)
    • (1997) Technical Report # 24 , vol.24
    • Greenberg, S.1
  • 43
    • 26844548787 scopus 로고    scopus 로고
    • Analysis in automatic recognition of speech
    • Chollet, G., Di Benedetto M., Esposito, A., Marinaro M. (eds.): Speech Processing, Recognition and Artificial Neural Networks, Springer-Verlag, Berlin Heidelberg New York
    • Hermansky, H.: Analysis in Automatic Recognition of Speech. In: Chollet, G., Di Benedetto M., Esposito, A., Marinaro M. (eds.): Speech Processing, Recognition and Artificial Neural Networks, 3rd International School on Neural Nets "Eduardo R. Caianiello". Springer-Verlag, Berlin Heidelberg New York (1999) 115-137
    • (1999) 3rd International School on Neural Nets "Eduardo R. Caianiello" , pp. 115-137
    • Hermansky, H.1
  • 46
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Hermansky H.: Perceptual Linear Predictive (PLP) Analysis of Speech. Journal of Acoustical Society of America, Vol. 87(4) (1990) 1738-1752
    • (1990) Journal of Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 47
    • 0038133213 scopus 로고    scopus 로고
    • Automatic speech segmentation based on DTW with the application of the Czech TTS system
    • E. Keller, G.Bailly, A, Monaghan, J. Terken, M. Huckwale (eds.): John Wiley and Sons Ltd.
    • Horak, P.: Automatic Speech Segmentation Based on DTW with the Application of the Czech TTS System. In E. Keller, G.Bailly, A, Monaghan, J. Terken, M. Huckwale (eds.): Improvements in Speech Synthesis. John Wiley and Sons Ltd. (2001) 331- 340
    • (2001) Improvements in Speech Synthesis , pp. 331-340
    • Horak, P.1
  • 48
    • 0025680225 scopus 로고
    • NTIMIT: A phonetically balanced, continuous speech, telephone bandwidth speech database
    • Jankowski, C., Kalyanswamy, A., Basson, S., Spitz, J.: NTIMIT: A Phonetically Balanced, Continuous Speech, Telephone Bandwidth Speech Database. Proceedings of ICASSP (1990) 109-112
    • (1990) Proceedings of ICASSP , pp. 109-112
    • Jankowski, C.1    Kalyanswamy, A.2    Basson, S.3    Spitz, J.4
  • 49
    • 0030412422 scopus 로고    scopus 로고
    • Automatic phone segmentation and labeling of continuous speech
    • Jeong, C. G., Jeong, H.: Automatic Phone Segmentation and Labeling of Continuous Speech. Speech Communication, Vol. 20 (1997) 291-311
    • (1997) Speech Communication , vol.20 , pp. 291-311
    • Jeong, C.G.1    Jeong, H.2
  • 51
    • 84904240311 scopus 로고    scopus 로고
    • Preprocessing and segmentation of the speech signal in the frequency domain for speech recognition
    • Kolokolov, A.S.: Preprocessing and Segmentation of the Speech Signal in the Frequency Domain for Speech Recognition. Automation and Remote Control, Vol.64(6) (2003) 985-994
    • (2003) Automation and Remote Control , vol.64 , Issue.6 , pp. 985-994
    • Kolokolov, A.S.1
  • 52
    • 0003772719 scopus 로고    scopus 로고
    • Time and pitch scale modification of audio signals
    • M. Kahrs, K. Brandenburg (eds.): Kluwer Academic Publishers
    • Laroche, J.: Time and Pitch Scale Modification of Audio Signals. In M. Kahrs, K. Brandenburg (eds.): Applications of Digital Signal Processing to Audio and Acoustics. Kluwer Academic Publishers (1998)
    • (1998) Applications of Digital Signal Processing to Audio and Acoustics
    • Laroche, J.1
  • 54
    • 0027541355 scopus 로고
    • Detection of changes in the spectrum of multidimensional process
    • Lavielle, M.: Detection of Changes in the Spectrum of Multidimensional Process. IEEE Transactions on Signal Processing, Vol. 41(1993) 742-749
    • (1993) IEEE Transactions on Signal Processing , vol.41 , pp. 742-749
    • Lavielle, M.1
  • 59
    • 0032679043 scopus 로고    scopus 로고
    • Consonant/vowel segmentation for mandarin syllable recognition
    • Lin, M.-T., Lee, C.-K., Lin, §C.-Y. : Consonant/Vowel Segmentation for Mandarin Syllable Recognition. Computer Speech and Language, Vol. 23 (1999) 207-222
    • (1999) Computer Speech and Language , vol.23 , pp. 207-222
    • Lin, M.-T.1    Lee, C.-K.2    Lin, C.-Y.3
  • 60
    • 0037850986 scopus 로고    scopus 로고
    • Phonetic alignment: Speech synthesis-based vs. viterbi-based
    • Malfrère, F., Deroo, O., Dutoit, T., Ris, C.: Phonetic Alignment: Speech Synthesis-Based vs. Viterbi-Based. Speech Communication, Vol. 40(4) (2003) 503-515
    • (2003) Speech Communication , vol.40 , Issue.4 , pp. 503-515
    • Malfrère, F.1    Deroo, O.2    Dutoit, T.3    Ris, C.4
  • 61
    • 0016519041 scopus 로고
    • Spectral linear prediction: Properties and applications
    • Makhoul, J.: Spectral Linear Prediction: Properties and Applications. IEEE Transactions ASSP, Vol. 23(5) (1975) 283-296
    • (1975) IEEE Transactions ASSP , vol.23 , Issue.5 , pp. 283-296
    • Makhoul, J.1
  • 62
    • 0004119130 scopus 로고
    • A multi-band approach to automatic speech recognition
    • Ph.D. thesis, University of California, Berkeley, December, chap. 4. Reprinted Berkeley, CA
    • Mirghafori, N.: A Multi-Band Approach to Automatic Speech Recognition. Ph.D. thesis, University of California, Berkeley, December 1988, chap. 4. Reprinted as ICSI Technical Report, TR-99-04, Berkeley, CA (1999)
    • (1988) ICSI Technical Report , vol.TR-99-04
    • Mirghafori, N.1
  • 64
    • 0033351870 scopus 로고    scopus 로고
    • Automatic speech synthesis unit generation with MLP based postprocessor against auto-segmented phoneme errors
    • Park E.-Y.; Kim, S.-H, Chung, J.-H.: Automatic Speech Synthesis Unit Generation with MLP Based Postprocessor against Auto-segmented Phoneme Errors. In Proceedings of International Joint Conference on Neural Networks, Vol.5 (1999) 2985-2990
    • (1999) Proceedings of International Joint Conference on Neural Networks , vol.5 , pp. 2985-2990
    • Park, E.-Y.1    Kim, S.-H.2    Chung, J.-H.3
  • 67
    • 0032139769 scopus 로고    scopus 로고
    • Automatic segmentation of speech recorded in unknownNoisy channel characteristics
    • Pellom B. L., Hansen J. H. L.: Automatic Segmentation of Speech Recorded in UnknownNoisy Channel Characteristics. Speech Communication, Vol. 25 (1998) 97-116
    • (1998) Speech Communication , vol.25 , pp. 97-116
    • Pellom, B.L.1    Hansen, J.H.L.2
  • 70
    • 0025465111 scopus 로고
    • Continuous speech recognition using hidden markov models
    • Picone J.: Continuous Speech Recognition Using Hidden Markov Models. IEEE ASSP Magazine (1990) 26-41
    • (1990) IEEE ASSP Magazine , pp. 26-41
    • Picone, J.1
  • 71
    • 1842475640 scopus 로고    scopus 로고
    • Automatic segmentation of continuous speech using phase group delay functions
    • Prasad, V. K., Nagarajan, T., Mutrhy, H. A.: Automatic Segmentation of Continuous Speech Using Phase Group Delay Functions. Speech Communication, Vol.42 (2004) 429-446
    • (2004) Speech Communication , vol.42 , pp. 429-446
    • Prasad, V.K.1    Nagarajan, T.2    Mutrhy, H.A.3
  • 74
  • 75
    • 85009251302 scopus 로고    scopus 로고
    • An analysis of transcription consistency in spontaneous speech from the buckeye corpus
    • Denver, USA
    • Raymond W. D. et al.: An Analysis of Transcription Consistency in Spontaneous Speech from the Buckeye Corpus. Proceedings of ICSLP '02. Denver, USA (2002).
    • (2002) Proceedings of ICSLP '02.
    • Raymond, W.D.1
  • 80
    • 0025460605 scopus 로고
    • The application of dynamic programming to connected speech recognition
    • Silverman, H. F., Morgan, D. P.: The Application of Dynamic Programming to Connected Speech Recognition. IEEE ASSP Magazine (1990) 6-25
    • (1990) IEEE ASSP Magazine , pp. 6-25
    • Silverman, H.F.1    Morgan, D.P.2
  • 86
    • 0009617005 scopus 로고
    • A review and new approaches for automatic segmentation of continuous speech signals
    • L. Torress et al. (eds): Elsevier Publisher, New-York
    • Vidal, E., Marzal, A.: A Review and New Approaches for Automatic Segmentation of Continuous Speech Signals. In L. Torress et al. (eds): Signal Processing V: Theories and Applications, Elsevier Publisher, New-York (1990) 43-53
    • (1990) Signal Processing V: Theories and Applications , pp. 43-53
    • Vidal, E.1    Marzal, A.2
  • 87
    • 0030264759 scopus 로고    scopus 로고
    • Automatic segmentation and labeling of multi-lingual speech data
    • Vorstermans, A., Martens, J.P., Van Coile, B.: Automatic Segmentation and Labeling of Multi-lingual Speech Data. Speech Communication, Vol. 19(4) (1996) 271- 293
    • (1996) Speech Communication , vol.19 , Issue.4 , pp. 271-293
    • Vorstermans, A.1    Martens, J.P.2    Van Coile, B.3
  • 88
    • 0037380322 scopus 로고    scopus 로고
    • A new discrete spectral modeling method and an application to CELP coding
    • Wei, B., Gibson, J.D.: A New Discrete Spectral Modeling Method and an Application to CELP Coding. IEEE Signals Processing Letters, Vol. 10(4) (2003) 101-103
    • (2003) IEEE Signals Processing Letters , vol.10 , Issue.4 , pp. 101-103
    • Wei, B.1    Gibson, J.D.2
  • 91
    • 0030362971 scopus 로고    scopus 로고
    • Estimating the quality of phonetic transcriptions and segmentations of speech signals
    • Philadelphia, USA
    • Wesenick, M.B., Kipp, A.:Estimating the Quality of Phonetic Transcriptions and Segmentations of Speech Signals. Proceedings of ICSLP'96. Philadelphia, USA (1996) 129-132
    • (1996) Proceedings of ICSLP'96 , pp. 129-132
    • Wesenick, M.B.1    Kipp, A.2
  • 94
    • 0028530231 scopus 로고
    • State clustering in hidden markov model-based continuous speech recognition
    • Young., S. J., Woodland, P. C.: State Clustering in Hidden Markov Model-Based Continuous Speech Recognition. Computer Speech and Language, Vol.8 (1994) 369-383
    • (1994) Computer Speech and Language , vol.8 , pp. 369-383
    • Young, S.J.1    Woodland, P.C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.