SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 3445 LNAI, Issue , 2005, Pages 261-290

Text independent methods for speech segmentation

(2) Esposito, Anna a,b,c Aversano, Guido b,c,d

a SECOND UNIVERSITY OF NAPLES (Italy)

b IIASS (Italy)

c INFM (Italy)

d TELECOM PARISTECH (France)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; SPEECH ANALYSIS; SPEECH CODING; SPEECH SYNTHESIS;

AUTOMATIC SEGMENTATION; PARAMETRIC SEGMENTATION; SPEECH CORPORA ANNOTATION; SPEECH SEGMENTATION;

SPEECH RECOGNITION;

EID: 26844465977 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11520153_12 Document Type: Conference Paper

Times cited : (33)

References (95)

1
- 0023834348
- Event-based multiple resolution analysis of speech signals
- New-York
- Altosaar, T., Karjalainen, M.: Event-Based Multiple Resolution Analysis of Speech Signals. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing, New-York (1988) 327-330
- (1988) Proceedings of International Conference on Acoustics, Speech, and Signal Processing , pp. 327-330
- Altosaar, T.¹ Karjalainen, M.²

2
- 0023831656
- A new statistical approach for the automatic segmentation of continuous speech signals
- Andre-Obrecht R.: A New Statistical Approach for the Automatic Segmentation of Continuous Speech Signals. IEEE Transactions on Acoustics, Speech Signal Processing, Vol. 36 (1988) 29-40
- (1988) IEEE Transactions on Acoustics, Speech Signal Processing , vol.36 , pp. 29-40
- Andre-Obrecht, R.¹

3
- 0020602364
- Efficient coding of LPC parameters by temporal decomposition
- Atal, B. S.: Efficient Coding of LPC Parameters by Temporal Decomposition. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (1983) 81-84
- (1983) Proceedings of International Conference on Acoustics, Speech, and Signal Processing , pp. 81-84
- Atal, B.S.¹

4
- 26844470504
- Phone level automatic speech segmentation
- Ph.D. Thesis, Università di Salerno - Italy
- Aversano, G.: Phone Level Automatic Speech Segmentation. A Text-Independent Segmentation Algorithm and a Software Tool for Speech Annotation and Analysis. Ph.D. Thesis, Università di Salerno - Italy (2004)
- (2004) A Text-Independent Segmentation Algorithm and A Software Tool for Speech Annotation and Analysis
- Aversano, G.¹

5
- 84942892779
- Automatic parameter estimation for a context-independent speech segmentation algorithm
- Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue, Lecture Notes in Artificial Intelligence. Springer-Verlag
- Aversano, G., Esposito, A: Automatic Parameter Estimation for a Context-Independent Speech Segmentation Algorithm. In Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue, 5th International Conference, Lecture Notes in Artificial Intelligence. Springer-Verlag, (2002) 293 - 300
- (2002) 5th International Conference , pp. 293-300
- Aversano, G.¹ Esposito, A.²

6
- 0035574930
- A new text-independent method for phoneme segmentation
- R. L.Ewing et al. (eds)
- Aversano G., Esposito A., Esposito A., Marinaro M.: A New Text-Independent Method for Phoneme Segmentation. In R. L.Ewing et al. (eds): Proceedings of the IEEE International Workshop on Circuits and Systems, Vol.2 (2001) 516-519
- (2001) Proceedings of the IEEE International Workshop on Circuits and Systems , vol.2 , pp. 516-519
- Aversano, G.¹ Esposito, A.² Esposito, A.³ Marinaro, M.⁴

7
- 0036079747
- Automatic language identification in broadcast news
- Backfried, G., Rainoldi, R., Riedler, J.: Automatic Language Identification in Broadcast News. In Proceedings of International Joint Conference on Neural Networks, Vol.2 (2002) 1406-1410
- (2002) Proceedings of International Joint Conference on Neural Networks , vol.2 , pp. 1406-1410
- Backfried, G.¹ Rainoldi, R.² Riedler, J.³

8
- 0003937919
- Englewood Cliffs, NJ, Prentice Hall
- Basseville M., Nikiforov I. V.: Detection of Abrupt Changes: Theory and Applications. Englewood Cliffs, NJ, Prentice Hall (1993)
- (1993) Detection of Abrupt Changes: Theory and Applications
- Basseville, M.¹ Nikiforov, I.V.²

9
- 0024925404
- Distance measures for signal processing and pattern recognition
- Basseville, M.: Distance Measures for Signal Processing and Pattern Recognition. Signal Processing, Vol. 18 (1989) 349-369
- (1989) Signal Processing , vol.18 , pp. 349-369
- Basseville, M.¹

10
- 23044534737
- Advances in very low bit-rate speech coding using recognition and synthesis
- Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue. Lecture Notes in Artificial Intelligence. Springer-Verlag, Berlin Heidelberg New York
- Baudoin, G., Capman F., Cernocky, J., El Chami, F., Charbit, M., Chollet, G., Petrovska-Delacretaz, D.: Advances in Very Low Bit-rate Speech Coding using Recognition and Synthesis. In: Sojka, P., Kopecek, I., Pala, K. (eds.): Text Speech and Dialogue, 5th International Conference. Lecture Notes in Artificial Intelligence. Springer-Verlag, Berlin Heidelberg New York (2002) 269-276
- (2002) 5th International Conference , pp. 269-276
- Baudoin, G.¹ Capman, F.² Cernocky, J.³ El Chami, F.⁴ Charbit, M.⁵ Chollet, G.⁶ Petrovska-Delacretaz, D.⁷

11
- 85009086783
- Regional pronunciation variants for automatic segmentation
- Athens, Greece
- Beringer, N., Neff, M.: Regional Pronunciation Variants for Automatic Segmentation. In Proceedings of the 2nd International Conference on Language Resources and Evaluation. Athens, Greece (2000)
- (2000) Proceedings of the 2nd International Conference on Language Resources and Evaluation
- Beringer, N.¹ Neff, M.²

12
- 85009112389
- The quality of multilingual automatic segmentation using german MAUS
- Beijing, China
- Beringer, N., Schiel, F.: The Quality of Multilingual Automatic Segmentation Using German MAUS. In Proceedings of the 6th Int. Conference on Spoken Language Processing. Beijing, China (2000) 728-731
- (2000) Proceedings of the 6th Int. Conference on Spoken Language Processing , pp. 728-731
- Beringer, N.¹ Schiel, F.²

13
- 26844560874
- Independent automatic segmentation of speech by pronunciation modeling
- San Francisco
- Beringer, N., Schiel, F.: Independent Automatic Segmentation of Speech by Pronunciation Modeling. In Proceedings of the 14th Int. Congress of Phonetic Sciences. San Francisco (1999) 1653-1656
- (1999) Proceedings of the 14th Int. Congress of Phonetic Sciences , pp. 1653-1656
- Beringer, N.¹ Schiel, F.²

14
- 84892159684
- Automatic question generation for decision tree based state tying
- Beulen, K., Ney, H.: Automatic Question Generation for Decision Tree Based State Tying. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (1998) 805-808
- (1998) Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 805-808
- Beulen, K.¹ Ney, H.²

15
- 85135168826
- State tying for context dependent phoneme models
- Beulen, K., Bransch, E., Ney, H.: State Tying for Context Dependent Phoneme Models. In Proceedings of European Conference on Speech Communication and Technology (1997) 1179-1182
- (1997) Proceedings of European Conference on Speech Communication and Technology , pp. 1179-1182
- Beulen, K.¹ Bransch, E.² Ney, H.³

16
- 26844568056
- How to improve human and machine transcriptions of spontaneous speech
- Tokyo
- Binnenpoorte, D., Goddijn, S., Cucchiarini, C: How to Improve Human and Machine Transcriptions of Spontaneous Speech. ISCA/IEEE Workshop on Spontaneous Speech Processing and Recognition. Tokyo (2003) 147-150
- (2003) ISCA/IEEE Workshop on Spontaneous Speech Processing and Recognition , pp. 147-150
- Binnenpoorte, D.¹ Goddijn, S.² Cucchiarini, C.³

17
- 0003487601
- Clarendon Press
- Bishop, C. M.: Neural Networks for Pattern Recognition, Clarendon Press (1995)
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.M.¹

18
- 0027646354
- Automatic segmentation and labeling of speech based on hidden markov models
- Brugnara F., Falavigna, D., Omologo, M.: Automatic Segmentation and Labeling of Speech Based on Hidden Markov Models. Speech Communication, Vol. 12 (1993) 357-370
- (1993) Speech Communication , vol.12 , pp. 357-370
- Brugnara, F.¹ Falavigna, D.² Omologo, M.³

19
- 26844510057
- Improved connected digit recognition using spectral variation functions
- Brugnara F., De Mori A., Giuliani D., Omologo M.: Improved Connected Digit Recognition Using Spectral Variation Functions. In Proceedings of International Conference on Spoken Language Processing (1992) 627-630
- (1992) Proceedings of International Conference on Spoken Language Processing , pp. 627-630
- Brugnara, F.¹ De Mori, A.² Giuliani, D.³ Omologo, M.⁴

20
- 85009142181
- Automatic phonetic transcription of spontaneous speech (American English)
- Beijing, China
- Chang, S., Shastri, L., Greenberg, S.: Automatic Phonetic Transcription of Spontaneous Speech (American English). Proceedings of the 6th International Conference on Spoken Language Processing. Beijing, China (2000) 330-333
- (2000) Proceedings of the 6th International Conference on Spoken Language Processing , pp. 330-333
- Chang, S.¹ Shastri, L.² Greenberg, S.³

21
- 85009205059
- Speech and language processing: Where have we been and where are we going?
- Geneva, Switzerland
- Church, K. W.: Speech and Language Processing: Where Have We Been and Where Are We Going? Proceedings of the 8th European Conference on Speech Communication and Technology - Eurospeech '03. Geneva, Switzerland (2003) 1-4
- (2003) Proceedings of the 8th European Conference on Speech Communication and Technology - Eurospeech '03 , pp. 1-4
- Church, K.W.¹

22
- 84962892151
- Continuous multi-band speech recognition using bayesian networks
- Trento, Italy
- Daoudi, K., Fohr, D., Antoine, C.: Continuous Multi-Band Speech Recognition using Bayesian Networks. Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop, Trento, Italy (2001)
- (2001) Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop
- Daoudi, K.¹ Fohr, D.² Antoine, C.³

23
- 0002302475
- Off-line statistical analysis in change-point models using non-parametric and likelihood methods
- M. Basseville, A. Beneviste (eds): Springer-Verlag, New-York
- Deshayes, J., Picard, D.: Off-line Statistical Analysis in Change-point Models Using Non-parametric and Likelihood Methods". In M. Basseville, A. Beneviste (eds): Detection of Abrupt Changes in Signals and Dynamical Systems, Springer-Verlag, New-York (1986)
- (1986) Detection of Abrupt Changes in Signals and Dynamical Systems
- Deshayes, J.¹ Picard, D.²

24
- 26844452199
- Very-low-rate speech compression by indexation of polyphones
- Geneva, Switzerland
- du Jeu, C., Charbit, M., Chollet, G.: Very-low-rate Speech Compression by Indexation of Polyphones. Proceedings of the 8th European Conference on Speech Communication and Technology - Eurospeech '03. Geneva, Switzerland (2003) 1085-1088
- (2003) Proceedings of the 8th European Conference on Speech Communication and Technology - Eurospeech '03 , pp. 1085-1088
- Du Jeu, C.¹ Charbit, M.² Chollet, G.³

25
- 0039099780
- Consistency of judgments in manual labeling of phonetic segments: The distinction between clear and nnclear cases
- Banf, Canada
- Eisen, B., Tillman, H. G.: Consistency of Judgments in Manual Labeling of Phonetic Segments: The Distinction between Clear and Nnclear Cases. Proceedings of ICSLP '92. Banf, Canada (1992) 871-874
- (1992) Proceedings of ICSLP '92 , pp. 871-874
- Eisen, B.¹ Tillman, H.G.²

26
- 0348018640
- Reliability of speech segmentation and labeling at different levels of transcription
- Berlin, Germany
- Eisen, B.: Reliability of Speech Segmentation and Labeling at Different Levels of Transcription. Proceedings of the 3rd European Conference on Speech Communication and Technology. Eurospeech '93. Berlin, Germany (1991) 673-676
- (1991) Proceedings of the 3rd European Conference on Speech Communication and Technology. Eurospeech '93 , pp. 673-676
- Eisen, B.¹

27
- 26844505908
- The importance of data for training intelligent devices
- B. Apolloni, F. Kurfess (eds.): Kluwer Academic/Plenum Publishers
- Esposito, A.: The Importance of Data for Training Intelligent Devices. In B. Apolloni, F. Kurfess (eds.): From Synapses to Rules: Discovering Symbolic Rules from Neural Processed Data. Kluwer Academic/Plenum Publishers (2002) 229-250
- (2002) From Synapses to Rules: Discovering Symbolic Rules from Neural Processed Data , pp. 229-250
- Esposito, A.¹

28
- 26844572990
- Speech segmentation by parametric filtering: Two new distortion measures and experimental evaluation
- International Institute for Advanced Scientific Studies, Vietri sul Mare (SA), Italy
- Esposito, A., Pannacci, L., Perfetti, R., Russo, R.C.: Speech Segmentation by Parametric Filtering: Two New Distortion Measures and Experimental Evaluation, Technical Report n. IIASS-1-00, International Institute for Advanced Scientific Studies, Vietri sul Mare (SA), Italy (2000)
- (2000) Technical Report N. IIASS-1-00 , vol.IIASS-1-00
- Esposito, A.¹ Pannacci, L.² Perfetti, R.³ Russo, R.C.⁴

29
- 0006563543
- Method for time or frequency compression expansion of speech
- 29.Fairbanks, G., Everitt, W. and Jaeger, R.: Method for Time or Frequency Compression Expansion of Speech. IEEE Transactions on Audio and Electro-acoustics, AU-2 (1954) 7-12
- (1954) IEEE Transactions on Audio and Electro-acoustics , vol.AU-2 , pp. 7-12
- Fairbanks, G.¹ Everitt, W.² Jaeger, R.³

30
- 26844481974
- Speech segmentation using multilevel hybrid filters
- Faundez-Zanuy, M., Vallverdù-Bayes, F.: Speech Segmentation Using Multilevel Hybrid Filters. In Proceedings of European Signal Processing Conference (EUSIPCO) (1996) 1003-1006
- (1996) Proceedings of European Signal Processing Conference (EUSIPCO) , pp. 1003-1006
- Faundez-Zanuy, M.¹ Vallverdù-Bayes, F.²

31
- 26844489731
- Automatic speech segmentation using neural network and phonetic transcription
- Finster, H.: Automatic speech segmentation using neural network and phonetic transcription. In Proceedings of International Conference on Neural Networks, Vol.4 (1992) 734-736
- (1992) Proceedings of International Conference on Neural Networks , vol.4 , pp. 734-736
- Finster, H.¹

32
- 0348206344
- Segment based variable frame rate speech analysis and recognition using spectral variation function
- Flammia, G., Dalsgaard, P., Andersen, O., Lindberg, B.: Segment Based Variable Frame Rate Speech Analysis and Recognition Using Spectral Variation Function. In Proceedings of International Conference on Spoken Language Processing (1992) 983-986
- (1992) Proceedings of International Conference on Spoken Language Processing , pp. 983-986
- Flammia, G.¹ Dalsgaard, P.² Andersen, O.³ Lindberg, B.⁴

33
- 0034270090
- Speech recognition using stochastic phonemic segment model based on phoneme segmentation
- Furuichi, C., Aizawa, K., Inoue, K.: Speech Recognition Using Stochastic Phonemic Segment Model Based on Phoneme Segmentation. Systems and Computers in Japan, Vol. 31(10) (2000) 1111-1119
- (2000) Systems and Computers in Japan , vol.31 , Issue.10 , pp. 1111-1119
- Furuichi, C.¹ Aizawa, K.² Inoue, K.³

34
- 34248183857
- The DARPA TIMIT acoustic-phonetic continuous speech corpus
- NTIS order number PB91-100354
- Garofolo, J. S., Lamel, L. F., Fisher, W. M., Fiscus, J. G., Pallett, D. S., Dahlgren, N. L.: The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus. CDROM (1992) NTIS order number PB91-100354
- (1992) CDROM
- Garofolo, J.S.¹ Lamel, L.F.² Fisher, W.M.³ Fiscus, J.G.⁴ Pallett, D.S.⁵ Dahlgren, N.L.⁶

35
- 0033697852
- CSELT hybrid HMM/neural networks technology for continuous speech recognition
- Gemello, R., Albesano, D., Mana, F.: CSELT Hybrid HMM/Neural Networks Technology for Continuous Speech Recognition. In Proceedings of IEEE-INNS-ENNS International Joint Conference on Neural Networks, Vol. 5 (2000) 103-108
- (2000) Proceedings of IEEE-INNS-ENNS International Joint Conference on Neural Networks , vol.5 , pp. 103-108
- Gemello, R.¹ Albesano, D.² Mana, F.³

36
- 0038359548
- A probabilistic framework for segment -based speech recognition
- Glass, J. R.: A Probabilistic Framework for Segment -Based Speech Recognition. Computer Speech and Language, Vol. 17 (2003) 137-152
- (2003) Computer Speech and Language , vol.17 , pp. 137-152
- Glass, J.R.¹

37
- 0023776395
- Multilevel acoustic segmentation of continuous speech
- Glass, J. R., Zue, V.W.: Multilevel Acoustic Segmentation of Continuous Speech". In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (1988) 429-432
- (1988) Proceedings of International Conference on Acoustics, Speech, and Signal Processing , pp. 429-432
- Glass, J.R.¹ Zue, V.W.²

38
- 84951850917
- Automatic segmentation of speech at the phonetic level
- T. Caell et al. (eds)
- Gómez, J.A., Castro, M. J.: Automatic Segmentation of Speech at the Phonetic Level. In T. Caell et al. (eds): Lecture Notes in Computer Science, Vol. 2396 (2002) 672-680
- (2002) Lecture Notes in Computer Science , vol.2396 , pp. 672-680
- Gómez, J.A.¹ Castro, M.J.²

39
- 0019050955
- Distortion measures for speech processing
- Gray, R.M., Buzo, A., Gray, A., Matsuyama, Y.: Distortion Measures for Speech Processing. IEEE Transactions on Acoustics, Speech Signal Processing, Vol. 28 (1980) 367-376
- (1980) IEEE Transactions on Acoustics, Speech Signal Processing , vol.28 , pp. 367-376
- Gray, R.M.¹ Buzo, A.² Gray, A.³ Matsuyama, Y.⁴

40
- 0003407830
- John Wiley and Sons
- Green, D., Swets, J.: Signal Detection Theory and Psychophysics. John Wiley and Sons (1996)
- (1996) Signal Detection Theory and Psychophysics
- Green, D.¹ Swets, J.²

41
- 85009179135
- Strategies for automatic multi-tier annotation of spoken language corpora
- Geneva, Switzerland
- Greenberg, S.: Strategies for Automatic Multi-Tier Annotation of Spoken Language Corpora. In Proceedings of the 8th European Conference on Speech Communication and Technology - Eurospeech '03. Geneva, Switzerland (2003) 45-48
- (2003) Proceedings of the 8th European Conference on Speech Communication and Technology - Eurospeech '03 , pp. 45-48
- Greenberg, S.¹

42
- 26844484699
- The switchboard transcription project
- Center for Language and Speech Processing, Johns Hopkins University, Baltimore USA
- Greenberg, S.: The Switchboard Transcription Project. Technical Report # 24, Center for Language and Speech Processing, Johns Hopkins University, Baltimore USA (1997)
- (1997) Technical Report # 24 , vol.24
- Greenberg, S.¹

43
- 26844548787
- Analysis in automatic recognition of speech
- Chollet, G., Di Benedetto M., Esposito, A., Marinaro M. (eds.): Speech Processing, Recognition and Artificial Neural Networks, Springer-Verlag, Berlin Heidelberg New York
- Hermansky, H.: Analysis in Automatic Recognition of Speech. In: Chollet, G., Di Benedetto M., Esposito, A., Marinaro M. (eds.): Speech Processing, Recognition and Artificial Neural Networks, 3rd International School on Neural Nets "Eduardo R. Caianiello". Springer-Verlag, Berlin Heidelberg New York (1999) 115-137
- (1999) 3rd International School on Neural Nets "Eduardo R. Caianiello" , pp. 115-137
- Hermansky, H.¹

44
- 0346126937
- Auditory modeling in automatic recognition of speech
- Keele, Sweden
- Hermansky, H.: Auditory Modeling in Automatic Recognition of Speech. Proceedings of the ESCA Workshop on the Auditory Basis of Speech Perception. Keele, Sweden (1996)
- (1996) Proceedings of the ESCA Workshop on the Auditory Basis of Speech Perception
- Hermansky, H.¹

45
- 0028517164
- RASTA processing of speech
- Hermansky H., Morgan N.: RASTA Processing of Speech. IEEE Transactions. Speech and Audio Processing, Vol. 2(4) (1994) 578-589
- (1994) IEEE Transactions. Speech and Audio Processing , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

46
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Hermansky H.: Perceptual Linear Predictive (PLP) Analysis of Speech. Journal of Acoustical Society of America, Vol. 87(4) (1990) 1738-1752
- (1990) Journal of Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

47
- 0038133213
- Automatic speech segmentation based on DTW with the application of the Czech TTS system
- E. Keller, G.Bailly, A, Monaghan, J. Terken, M. Huckwale (eds.): John Wiley and Sons Ltd.
- Horak, P.: Automatic Speech Segmentation Based on DTW with the Application of the Czech TTS System. In E. Keller, G.Bailly, A, Monaghan, J. Terken, M. Huckwale (eds.): Improvements in Speech Synthesis. John Wiley and Sons Ltd. (2001) 331- 340
- (2001) Improvements in Speech Synthesis , pp. 331-340
- Horak, P.¹

48
- 0025680225
- NTIMIT: A phonetically balanced, continuous speech, telephone bandwidth speech database
- Jankowski, C., Kalyanswamy, A., Basson, S., Spitz, J.: NTIMIT: A Phonetically Balanced, Continuous Speech, Telephone Bandwidth Speech Database. Proceedings of ICASSP (1990) 109-112
- (1990) Proceedings of ICASSP , pp. 109-112
- Jankowski, C.¹ Kalyanswamy, A.² Basson, S.³ Spitz, J.⁴

49
- 0030412422
- Automatic phone segmentation and labeling of continuous speech
- Jeong, C. G., Jeong, H.: Automatic Phone Segmentation and Labeling of Continuous Speech. Speech Communication, Vol. 20 (1997) 291-311
- (1997) Speech Communication , vol.20 , pp. 291-311
- Jeong, C.G.¹ Jeong, H.²

50
- 84893657385
- Multilingual acoustic modeling using graphemes
- Kanthak, S., Ney:, H.: Multilingual Acoustic Modeling Using Graphemes. In Proceedings of European Conference on Speech Communication and Technology, Vol.2 (2003) 1145-1148
- (2003) Proceedings of European Conference on Speech Communication and Technology , vol.2 , pp. 1145-1148
- Kanthak, S.¹ Ney, H.²

51
- 84904240311
- Preprocessing and segmentation of the speech signal in the frequency domain for speech recognition
- Kolokolov, A.S.: Preprocessing and Segmentation of the Speech Signal in the Frequency Domain for Speech Recognition. Automation and Remote Control, Vol.64(6) (2003) 985-994
- (2003) Automation and Remote Control , vol.64 , Issue.6 , pp. 985-994
- Kolokolov, A.S.¹

52
- 0003772719
- Time and pitch scale modification of audio signals
- M. Kahrs, K. Brandenburg (eds.): Kluwer Academic Publishers
- Laroche, J.: Time and Pitch Scale Modification of Audio Signals. In M. Kahrs, K. Brandenburg (eds.): Applications of Digital Signal Processing to Audio and Acoustics. Kluwer Academic Publishers (1998)
- (1998) Applications of Digital Signal Processing to Audio and Acoustics
- Laroche, J.¹

53
- 64149129299
- An improved algorithm for the automatic segmentation of speech corpora
- M. González Rodriguez, C. Paz Suárez Araujo (eds.)
- Laureys, T., K. Demuynck, J. Duchateau, Wambacq, P.: An Improved Algorithm for the Automatic Segmentation of Speech Corpora. In M. González Rodriguez, C. Paz Suárez Araujo (eds.): Proceedings of Third International Conference on Language Resources and Evaluation (2002) 1564-1567
- (2002) Proceedings of Third International Conference on Language Resources and Evaluation , pp. 1564-1567
- Laureys, T.¹ Demuynck, K.² Duchateau, J.³ Wambacq, P.⁴

54
- 0027541355
- Detection of changes in the spectrum of multidimensional process
- Lavielle, M.: Detection of Changes in the Spectrum of Multidimensional Process. IEEE Transactions on Signal Processing, Vol. 41(1993) 742-749
- (1993) IEEE Transactions on Signal Processing , vol.41 , pp. 742-749
- Lavielle, M.¹

55
- 26844462286
- Pseudo-segment based speech recognition using neural recurrent whole-word recognizers
- Le Cerf, P., Demuynck, K., Duchateau, J., Van Compemolle, D.: Pseudo-Segment Based Speech Recognition Using Neural Recurrent Whole-Word Recognizers. In Proceedings of International Conference on Acoustics, Speech and Signal Processing, Vol.1 (1994) 609-612
- (1994) Proceedings of International Conference on Acoustics, Speech and Signal Processing , vol.1 , pp. 609-612
- Le Cerf, P.¹ Demuynck, K.² Duchateau, J.³ Van Compemolle, D.⁴

56
- 26844576363
- A comparative study of speech segmentation for automatic multi-lingual recognition
- Li, B. N.L., Liu, J.N.K.: A Comparative Study of Speech Segmentation for Automatic Multi-Lingual Recognition. In Proceedings of Second ACM Hong Kong Postgraduate Research Conference (1999) http://www.cse.cuhk.edu.hk/~acm-hk/activity/pg/polyu-nlli.pdf
- (1999) Proceedings of Second ACM Hong Kong Postgraduate Research Conference
- Li, B.N.L.¹ Liu, J.N.K.²

57
- 0030143452
- Speech analysis and segmentation by parametric filtering
- Li T. H., Gibson J. D.: Speech Analysis and Segmentation by Parametric Filtering. IEEE Transactions on Speech and Audio Processing, Vol. 4(3)(1996) 203-213
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.3 , pp. 203-213
- Li, T.H.¹ Gibson, J.D.²

58
- 0029707026
- Time-correlation analysis of non-stationary signals with application to speech processing
- Paris, France
- Li T. H., Gibson J. D.: Time-Correlation Analysis of Non-stationary Signals with Application to Speech Processing. In Proceedings of International Symposium on Time- Frequency &Time-Scale Analysis, Paris, France (1996) 449-452
- (1996) Proceedings of International Symposium on Time- frequency &Time-scale Analysis , pp. 449-452
- Li, T.H.¹ Gibson, J.D.²

59
- 0032679043
- Consonant/vowel segmentation for mandarin syllable recognition
- Lin, M.-T., Lee, C.-K., Lin, §C.-Y. : Consonant/Vowel Segmentation for Mandarin Syllable Recognition. Computer Speech and Language, Vol. 23 (1999) 207-222
- (1999) Computer Speech and Language , vol.23 , pp. 207-222
- Lin, M.-T.¹ Lee, C.-K.² Lin, C.-Y.³

60
- 0037850986
- Phonetic alignment: Speech synthesis-based vs. viterbi-based
- Malfrère, F., Deroo, O., Dutoit, T., Ris, C.: Phonetic Alignment: Speech Synthesis-Based vs. Viterbi-Based. Speech Communication, Vol. 40(4) (2003) 503-515
- (2003) Speech Communication , vol.40 , Issue.4 , pp. 503-515
- Malfrère, F.¹ Deroo, O.² Dutoit, T.³ Ris, C.⁴

61
- 0016519041
- Spectral linear prediction: Properties and applications
- Makhoul, J.: Spectral Linear Prediction: Properties and Applications. IEEE Transactions ASSP, Vol. 23(5) (1975) 283-296
- (1975) IEEE Transactions ASSP , vol.23 , Issue.5 , pp. 283-296
- Makhoul, J.¹

62
- 0004119130
- A multi-band approach to automatic speech recognition
- Ph.D. thesis, University of California, Berkeley, December, chap. 4. Reprinted Berkeley, CA
- Mirghafori, N.: A Multi-Band Approach to Automatic Speech Recognition. Ph.D. thesis, University of California, Berkeley, December 1988, chap. 4. Reprinted as ICSI Technical Report, TR-99-04, Berkeley, CA (1999)
- (1988) ICSI Technical Report , vol.TR-99-04
- Mirghafori, N.¹

63
- 0028996888
- Using explicit segmentation to improve HMM phone recognition
- Mitchell C. D., Harper M. P., Jamieson L. H.: Using Explicit Segmentation to Improve HMM Phone Recognition. In Proceedings of International Conference on Acoustic, Speech and Signal Processing (1995) 229-232
- (1995) Proceedings of International Conference on Acoustic, Speech and Signal Processing , pp. 229-232
- Mitchell, C.D.¹ Harper, M.P.² Jamieson, L.H.³

64
- 0033351870
- Automatic speech synthesis unit generation with MLP based postprocessor against auto-segmented phoneme errors
- Park E.-Y.; Kim, S.-H, Chung, J.-H.: Automatic Speech Synthesis Unit Generation with MLP Based Postprocessor against Auto-segmented Phoneme Errors. In Proceedings of International Joint Conference on Neural Networks, Vol.5 (1999) 2985-2990
- (1999) Proceedings of International Joint Conference on Neural Networks , vol.5 , pp. 2985-2990
- Park, E.-Y.¹ Kim, S.-H.² Chung, J.-H.³

65
- 0004274888
- McGraw-Hill, New-York
- Parson, T.: Voice and Speech Processing. McGraw-Hill, New-York (1986)
- (1986) Voice and Speech Processing
- Parson, T.¹

66
- 0041959901
- Time series, statistics and information
- D. Brillinger, P. Caines, J. Geweke, E. Parzen, M. Rosenblatt, M.S. Taqqu (eds): Springer Verlag, New York
- Parzen, E.: Time Series, Statistics and Information. In D. Brillinger, P. Caines, J. Geweke, E. Parzen, M. Rosenblatt, M.S. Taqqu (eds): New Directions in Time Series Analysis, Part I,. The IMA Volumes in Mathematics and its Applications. Series. Vol. 45, Springer Verlag, New York (1992)
- (1992) New Directions in Time Series Analysis, Part I,. the IMA Volumes in Mathematics and Its Applications. Series , vol.45
- Parzen, E.¹

67
- 0032139769
- Automatic segmentation of speech recorded in unknownNoisy channel characteristics
- Pellom B. L., Hansen J. H. L.: Automatic Segmentation of Speech Recorded in UnknownNoisy Channel Characteristics. Speech Communication, Vol. 25 (1998) 97-116
- (1998) Speech Communication , vol.25 , pp. 97-116
- Pellom, B.L.¹ Hansen, J.H.L.²

68
- 0030374906
- On the robust automatic segmentation of spontaneous speech
- Petek B., Andersen O., Dalsgaard P.: On the Robust Automatic Segmentation of Spontaneous Speech. In Proceedings of International Conference on Spoken Language Processing (1996) 913-916
- (1996) Proceedings of International Conference on Spoken Language Processing , pp. 913-916
- Petek, B.¹ Andersen, O.² Dalsgaard, P.³

69
- 84937349279
- The theory of signal detectability
- Peterson, W., Birdsall, T., Fox, W.: The Theory of Signal Detectability. IEEE Transactions on Information Theory, Vol. 4(4) (1954) 171-212
- (1954) IEEE Transactions on Information Theory , vol.4 , Issue.4 , pp. 171-212
- Peterson, W.¹ Birdsall, T.² Fox, W.³

70
- 0025465111
- Continuous speech recognition using hidden markov models
- Picone J.: Continuous Speech Recognition Using Hidden Markov Models. IEEE ASSP Magazine (1990) 26-41
- (1990) IEEE ASSP Magazine , pp. 26-41
- Picone, J.¹

71
- 1842475640
- Automatic segmentation of continuous speech using phase group delay functions
- Prasad, V. K., Nagarajan, T., Mutrhy, H. A.: Automatic Segmentation of Continuous Speech Using Phase Group Delay Functions. Speech Communication, Vol.42 (2004) 429-446
- (2004) Speech Communication , vol.42 , pp. 429-446
- Prasad, V.K.¹ Nagarajan, T.² Mutrhy, H.A.³

72
- 0003560513
- Prentice Hall, Englewood Cliffs
- Quackenbush, S. R., Barnwell, T. P., Clements, M. A.: Objective Measures of Speech Quality, Prentice Hall, Englewood Cliffs (1988)
- (1988) Objective Measures of Speech Quality
- Quackenbush, S.R.¹ Barnwell, T.P.² Clements, M.A.³

73
- 0004244302
- Prentice-Hall, Inc. Upper Saddle River, NJ
- Rabiner L., Juang B.-H.: Fundamentals of Speech Recognition. Prentice-Hall, Inc. Upper Saddle River, NJ (1993)
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

74
- 0022594196
- An introduction to hidden markov models
- Rabiner, L.R., Juang, B. H.: An Introduction to Hidden Markov Models. IEEE ASSP Magazine (1986) 4-16
- (1986) IEEE ASSP Magazine , pp. 4-16
- Rabiner, L.R.¹ Juang, B.H.²

75
- 85009251302
- An analysis of transcription consistency in spontaneous speech from the buckeye corpus
- Denver, USA
- Raymond W. D. et al.: An Analysis of Transcription Consistency in Spontaneous Speech from the Buckeye Corpus. Proceedings of ICSLP '02. Denver, USA (2002).
- (2002) Proceedings of ICSLP '02.
- Raymond, W.D.¹

76
- 23944443102
- Automatic phonetic transcription of non-prompted speech
- San Francisco
- Schiel, F.: Automatic Phonetic Transcription of Non-Prompted Speech. Proceedings of the 14th International Congress on Phonetic Sciences. San Francisco (1999) 607- 610
- (1999) Proceedings of the 14th International Congress on Phonetic Sciences , pp. 607-610
- Schiel, F.¹

77
- 51449102009
- Grapheme based recognition for large vocabularies
- Schillo, C., Fink, G. A., Kummert, F.: Grapheme Based Recognition for Large Vocabularies. In Procceedings of International Conference on Spoken Processing (2000) 129-132
- (2000) Procceedings of International Conference on Spoken Processing , pp. 129-132
- Schillo, C.¹ Fink, G.A.² Kummert, F.³

78
- 0029228688
- Automatic speech segmentation using neural tree networks
- Sharma, M., Mammone, R.: Automatic Speech Segmentation Using Neural Tree Networks. In Proceedings of IEEE Workshop on Neural Networks for Signal Processing (1995) 282-290
- (1995) Proceedings of IEEE Workshop on Neural Networks for Signal Processing , pp. 282-290
- Sharma, M.¹ Mammone, R.²

79
- 26844505109
- Signal segmentation into spectral homogeneous units
- Segura-Luna J. C., Soler J. M., Peinado A. M., Sanchez V., Rubio A.: Signal Segmentation into Spectral Homogeneous Units. In Proceedings of European Signal Processing Conference (1990) 1251-1254
- (1990) Proceedings of European Signal Processing Conference , pp. 1251-1254
- Segura-Luna, J.C.¹ Soler, J.M.² Peinado, A.M.³ Sanchez, V.⁴ Rubio, A.⁵

80
- 0025460605
- The application of dynamic programming to connected speech recognition
- Silverman, H. F., Morgan, D. P.: The Application of Dynamic Programming to Connected Speech Recognition. IEEE ASSP Magazine (1990) 6-25
- (1990) IEEE ASSP Magazine , pp. 6-25
- Silverman, H.F.¹ Morgan, D.P.²

81
- 0001263636
- The relation of pitch to frequency
- Stephens S. S., Volkman, J.: The Relation of Pitch to Frequency. American Journal of Psychology, Vol. 53(3) (1940) 329-353
- (1940) American Journal of Psychology , vol.53 , Issue.3 , pp. 329-353
- Stephens, S.S.¹ Volkman, J.²

82
- 34249287540
- Decision tree based text-to-mapping for speech recognition
- Suontasuta, J., Hakkinen, J.: Decision Tree Based Text-to-Mapping for Speech Recognition. In Procceedings of International Conference on Spoken Processing (2000) 199-202
- (2000) Procceedings of International Conference on Spoken Processing , pp. 199-202
- Suontasuta, J.¹ Hakkinen, J.²

83
- 0023211850
- On automatic segmentation of speech signals
- Dallas
- Svendsen, T., Soong, F. K.: On Automatic Segmentation of Speech Signals. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Dallas (1987) 77-80
- (1987) Proceedings of International Conference on Acoustics, Speech, and Signal Processing , pp. 77-80
- Svendsen, T.¹ Soong, F.K.²

84
- 0027189330
- An efficient way to learn english grapheme-to-phoneme rules automatically
- Torkolla, K.: An Efficient Way to Learn English Grapheme-to-Phoneme Rules Automatically. In Proceedings of International Conference on Acoustics, Speech and Signal Processing, Vol. 2 (1993) 199-202
- (1993) Proceedings of International Conference on Acoustics, Speech and Signal Processing , vol.2 , pp. 199-202
- Torkolla, K.¹

85
- 0026140140
- Automatic segmentation of speech
- van Hemert, J.P.: Automatic Segmentation of Speech. IEEE Transactions on Signal Processing, Vol. 39 (4) (1991) 1008-1012
- (1991) IEEE Transactions on Signal Processing , vol.39 , Issue.4 , pp. 1008-1012
- Van Hemert, J.P.¹

86
- 0009617005
- A review and new approaches for automatic segmentation of continuous speech signals
- L. Torress et al. (eds): Elsevier Publisher, New-York
- Vidal, E., Marzal, A.: A Review and New Approaches for Automatic Segmentation of Continuous Speech Signals. In L. Torress et al. (eds): Signal Processing V: Theories and Applications, Elsevier Publisher, New-York (1990) 43-53
- (1990) Signal Processing V: Theories and Applications , pp. 43-53
- Vidal, E.¹ Marzal, A.²

87
- 0030264759
- Automatic segmentation and labeling of multi-lingual speech data
- Vorstermans, A., Martens, J.P., Van Coile, B.: Automatic Segmentation and Labeling of Multi-lingual Speech Data. Speech Communication, Vol. 19(4) (1996) 271- 293
- (1996) Speech Communication , vol.19 , Issue.4 , pp. 271-293
- Vorstermans, A.¹ Martens, J.P.² Van Coile, B.³

88
- 0037380322
- A new discrete spectral modeling method and an application to CELP coding
- Wei, B., Gibson, J.D.: A New Discrete Spectral Modeling Method and an Application to CELP Coding. IEEE Signals Processing Letters, Vol. 10(4) (2003) 101-103
- (2003) IEEE Signals Processing Letters , vol.10 , Issue.4 , pp. 101-103
- Wei, B.¹ Gibson, J.D.²

89
- 0012715410
- Comparison of distance measure in discrete spectral modeling
- Wei, B., Gibson, J.D.: Comparison of Distance Measure in Discrete Spectral Modeling. In Proceedings of IEEE Digital Signal Processing Workshop (2000) 1-4
- (2000) Proceedings of IEEE Digital Signal Processing Workshop , pp. 1-4
- Wei, B.¹ Gibson, J.D.²

90
- 0029696998
- Pitch determination and speech segmentation using the discrete wavelet transform
- Wendt, C., Petropulu, A.P.: Pitch Determination and Speech Segmentation Using the Discrete Wavelet Transform. In Proceedings of IEEE International Symposium on Circuits and Systems, Vol. 2 (1996) 45-48
- (1996) Proceedings of IEEE International Symposium on Circuits and Systems , vol.2 , pp. 45-48
- Wendt, C.¹ Petropulu, A.P.²

91
- 0030362971
- Estimating the quality of phonetic transcriptions and segmentations of speech signals
- Philadelphia, USA
- Wesenick, M.B., Kipp, A.:Estimating the Quality of Phonetic Transcriptions and Segmentations of Speech Signals. Proceedings of ICSLP'96. Philadelphia, USA (1996) 129-132
- (1996) Proceedings of ICSLP'96 , pp. 129-132
- Wesenick, M.B.¹ Kipp, A.²

92
- 26844495285
- Comparison between expert listeners and continuous speech recognizers in selecting pronunciation variants
- San Francisco
- Wester, M., Kessens, J. M., Cucchiarini, C., Strik, H.: Comparison between Expert Listeners and Continuous Speech Recognizers in Selecting Pronunciation Variants. Proceedings of the 14th Int. Congress of Phonetic Sciences. San Francisco (1999) 723-726
- (1999) Proceedings of the 14th Int. Congress of Phonetic Sciences , pp. 723-726
- Wester, M.¹ Kessens, J.M.² Cucchiarini, C.³ Strik, H.⁴

93
- 26844499938
- Corpus based evaluation of entropy rate speech segmentation
- Wokurek, W.: Corpus Based Evaluation of Entropy Rate Speech Segmentation. In Proceedings of 14th International Congress of Phonetic Sciences (1999) 1217- 1220
- (1999) Proceedings of 14th International Congress of Phonetic Sciences , pp. 1217-1220
- Wokurek, W.¹

94
- 0028530231
- State clustering in hidden markov model-based continuous speech recognition
- Young., S. J., Woodland, P. C.: State Clustering in Hidden Markov Model-Based Continuous Speech Recognition. Computer Speech and Language, Vol.8 (1994) 369-383
- (1994) Computer Speech and Language , vol.8 , pp. 369-383
- Young, S.J.¹ Woodland, P.C.²

95
- 0024922971
- Acoustic segmentation and phonetic classification in the summit system
- Zue, V.W., Glass, J. R., Philips, M., Seneff, S.: Acoustic Segmentation and Phonetic Classification in the Summit System". In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (1989) 389-392
- (1989) Proceedings of International Conference on Acoustics, Speech, and Signal Processing , pp. 389-392
- Zue, V.W.¹ Glass, J.R.² Philips, M.³ Seneff, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.