메뉴 건너뛰기




Volumn 5221 LNAI, Issue , 2008, Pages 4-15

Speech processing for audio indexing

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTATIONAL LINGUISTICS; INDEXING (MATERIALS WORKING); LINGUISTICS; NATURAL LANGUAGE PROCESSING SYSTEMS; SPEECH ANALYSIS; SPEECH PROCESSING; SPEECH RECOGNITION; TRANSCRIPTION;

EID: 52149106921     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-85287-2_2     Document Type: Conference Paper
Times cited : (47)

References (33)
  • 2
    • 85013700737 scopus 로고    scopus 로고
    • Schultz, T, Kirchhoff, K, eds, Elsevier, Amsterdam
    • Schultz, T., Kirchhoff, K. (eds.): Multilingual Speech Processing. Elsevier, Amsterdam (2006)
    • (2006) Multilingual Speech Processing
  • 3
    • 52149120165 scopus 로고    scopus 로고
    • Bourlard, H., Furui, S., Morgan, N., Strik, H. (eds.): Modeling pronunciation variation for automatic speech recognition.In: Speech Communication, 29(2-4) (November 1999) (Special issue)
    • Bourlard, H., Furui, S., Morgan, N., Strik, H. (eds.): Modeling pronunciation variation for automatic speech recognition.In: Speech Communication, vol. 29(2-4) (November 1999) (Special issue)
  • 4
    • 19944378102 scopus 로고    scopus 로고
    • Fosler-Lussier, E., Byrne, W., Jurafsky, D. (eds.): Pronunciation Modeling and Lexicon Adaptation.In: Speech communication, 46(2) (June 2005) (Special issue)
    • Fosler-Lussier, E., Byrne, W., Jurafsky, D. (eds.): Pronunciation Modeling and Lexicon Adaptation.In: Speech communication, vol. 46(2) (June 2005) (Special issue)
  • 5
    • 0033354260 scopus 로고    scopus 로고
    • Pronunciation variants across system configuration, language and speaking style
    • Adda-Decker, M., Lamel, L.: Pronunciation variants across system configuration, language and speaking style. Speech Communication 29(2-4), 83-98 (1999)
    • (1999) Speech Communication , vol.29 , Issue.2-4 , pp. 83-98
    • Adda-Decker, M.1    Lamel, L.2
  • 6
    • 0036460898 scopus 로고    scopus 로고
    • An overview of decoding techniques for large vocabulary continuous speech recognition
    • Aubert, X.L.: An overview of decoding techniques for large vocabulary continuous speech recognition. Computer Speech & Language 16(1), 89-114 (2002)
    • (2002) Computer Speech & Language , vol.16 , Issue.1 , pp. 89-114
    • Aubert, X.L.1
  • 7
    • 52149092625 scopus 로고    scopus 로고
    • Bahl, L.R., Baker, J.K., Cohen, P.S., Dixon, N.R., Jelinek, F., Mercer, R.L., Silverman, H.F.: Preliminary results on the performance of a system for the automatic recognition of continuous speech. In: IEEE ICASSP-1976, Philadelphia (April 1976)
    • Bahl, L.R., Baker, J.K., Cohen, P.S., Dixon, N.R., Jelinek, F., Mercer, R.L., Silverman, H.F.: Preliminary results on the performance of a system for the automatic recognition of continuous speech. In: IEEE ICASSP-1976, Philadelphia (April 1976)
  • 9
    • 52149106986 scopus 로고    scopus 로고
    • Bulyko, I., Ostendorf, M., Stolcke, A.: Gtting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures. In: Hearst, M., Ostendorf, M. (eds.) HLT-NAACL 2003, Edmonton, March 2003, 2, pp. 7-9 (2003)
    • Bulyko, I., Ostendorf, M., Stolcke, A.: Gtting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures. In: Hearst, M., Ostendorf, M. (eds.) HLT-NAACL 2003, Edmonton, March 2003, vol. 2, pp. 7-9 (2003)
  • 10
    • 0031233424 scopus 로고    scopus 로고
    • Speaker Recognition: A Tutorial
    • September
    • Campbell, J.: Speaker Recognition: A Tutorial. Proc. of the IEEE 85(9) (September 1997)
    • (1997) Proc. of the IEEE , vol.85 , Issue.9
    • Campbell, J.1
  • 13
    • 85009254284 scopus 로고    scopus 로고
    • TRAPs - classifiers of TempoRAl Patterns
    • Sydney November
    • Hermansky, H., Sharma, S.: TRAPs - classifiers of TempoRAl Patterns. In: ICSLP 1998, Sydney (November 1998)
    • (1998) ICSLP
    • Hermansky, H.1    Sharma, S.2
  • 14
    • 0016939124 scopus 로고
    • Continuous Speech Recognition by Statistical Methods
    • Jelinek, F.: Continuous Speech Recognition by Statistical Methods. Proc. of the IEEE 64(4), 532-556 (1976)
    • (1976) Proc. of the IEEE , vol.64 , Issue.4 , pp. 532-556
    • Jelinek, F.1
  • 15
    • 0023312404 scopus 로고
    • Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer
    • Katz, S.M.: Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. IEEE Trans. Acoustics, Speech & Signal Processing ASSP-35(3), 400-401 (1987)
    • (1987) IEEE Trans. Acoustics, Speech & Signal Processing , vol.ASSP-35 , Issue.3 , pp. 400-401
    • Katz, S.M.1
  • 16
    • 85135261720 scopus 로고    scopus 로고
    • Unsupervised Training of a Speech Recognizer: Recent Experiments
    • Budapest, Hungary, September
    • Kemp, T., Waibel, A.: Unsupervised Training of a Speech Recognizer: Recent Experiments. In: ESCA Eurospeech 1999, Budapest, Hungary, September 1999, vol. 6, pp. 2725-2728 (1999)
    • (1999) ESCA Eurospeech 1999 , vol.6 , pp. 2725-2728
    • Kemp, T.1    Waibel, A.2
  • 17
    • 84893372402 scopus 로고    scopus 로고
    • Using Quick Transcriptions to Improve Conversational Speech Models
    • Jeju, October
    • Kimball, O., Kao, C.L., Iyer, R., Arvizo, T., Makhoul, J.: Using Quick Transcriptions to Improve Conversational Speech Models. In: ICSLP 2004, Jeju, (October 2004)
    • (2004) ICSLP
    • Kimball, O.1    Kao, C.L.2    Iyer, R.3    Arvizo, T.4    Makhoul, J.5
  • 19
    • 52149118752 scopus 로고    scopus 로고
    • Speech Recognition
    • Mitkov, R, ed, Oxford University Press, Oxford
    • Lamel, L., Gauvain, J.L.: Speech Recognition. In: Mitkov, R. (ed.) Chapter 16 in OUP Handbook on Computational Linguistics, pp. 305-322. Oxford University Press, Oxford (2003)
    • (2003) in OUP Handbook on Computational Linguistics , pp. 305-322
    • Lamel, L.1    Gauvain, J.L.2
  • 20
    • 0036460908 scopus 로고    scopus 로고
    • Lightly Supervised and Unsupervised Acoustic Model Training
    • Lamel, L., Gauvain, J.L., Adda, G.: Lightly Supervised and Unsupervised Acoustic Model Training. Computer, Speech & Language 16(1), 115-229 (2002)
    • (2002) Computer, Speech & Language , vol.16 , Issue.1 , pp. 115-229
    • Lamel, L.1    Gauvain, J.L.2    Adda, G.3
  • 21
    • 52149123032 scopus 로고    scopus 로고
    • Lamel, L., Gauvain, J.L., Adda, G., Adda-Decker, M., Canseco, L., Chen, L., Galibert, O., Messaoudi, A., Schwenk, H.: Speech Transcription in Multiple Languages. In: IEEE ICASSP 2004, Montreal (April 2004)
    • Lamel, L., Gauvain, J.L., Adda, G., Adda-Decker, M., Canseco, L., Chen, L., Galibert, O., Messaoudi, A., Schwenk, H.: Speech Transcription in Multiple Languages. In: IEEE ICASSP 2004, Montreal (April 2004)
  • 22
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • Lippmann, R.P.: Speech recognition by machines and humans. Speech Communication 22(1), 1-16
    • Speech Communication , vol.22 , Issue.1 , pp. 1-16
    • Lippmann, R.P.1
  • 23
    • 44949137618 scopus 로고    scopus 로고
    • Experimental detection of vowel pronunciation variants in Amharic
    • Genoa
    • Pellegrini, T., Lamel, L.: Experimental detection of vowel pronunciation variants in Amharic. In: LREC 2006, Genoa (2006)
    • (2006) LREC
    • Pellegrini, T.1    Lamel, L.2
  • 24
    • 52149100845 scopus 로고    scopus 로고
    • Technology Advancements have Required NIST Evaluations to Change Data and Tasks - and now Metrics
    • Presented at the, Marrakesh
    • Przybocki, M.: Technology Advancements have Required NIST Evaluations to Change Data and Tasks - and now Metrics. In: Presented at the ELRA Workshop on Evaluation, LREC 2008, Marrakesh (2008)
    • (2008) ELRA Workshop on Evaluation, LREC
    • Przybocki, M.1
  • 26
    • 0040262071 scopus 로고
    • Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance
    • Madrid, pp, September
    • van Leeuwen, D.A., van den Berg, L.G., Steeneken, H.J.M.: Human Benchmarks for Speaker Independent Large Vocabulary Recognition Performance. In: ESCA Eurospeech 1995, Madrid, pp. 1461-1464 (September 1995)
    • (1995) ESCA Eurospeech 1995 , pp. 1461-1464
    • van Leeuwen, D.A.1    van den Berg, L.G.2    Steeneken, H.J.M.3
  • 27
    • 33646907991 scopus 로고    scopus 로고
    • Two decades of statistical language modeling: Where do we go from here?
    • Rosenfeld, R.: Two decades of statistical language modeling: where do we go from here? Proc. IEEE 88(8), 1270-1278 (1999)
    • (1999) Proc. IEEE , vol.88 , Issue.8 , pp. 1270-1278
    • Rosenfeld, R.1
  • 28
    • 33847610331 scopus 로고    scopus 로고
    • Continuous space language models
    • Schwenk, H.: Continuous space language models. Computer Speech and Language 21, 492- 518 (2007)
    • (2007) Computer Speech and Language , vol.21 , pp. 492-518
    • Schwenk, H.1
  • 29
    • 0001476520 scopus 로고    scopus 로고
    • SpeechBot: A speech recognition based audio indexing system for the web
    • Content-Based Multimedia Information Access, Paris, pp, April
    • Van Thong, J.M., Goddeau, D., Litvinova, A., Logan, B., Moreno, P., Swain, M.: SpeechBot: a speech recognition based audio indexing system for the web. In: RIAO 2000 Content-Based Multimedia Information Access, Paris, pp. 106-115 (April 2000)
    • (2000) RIAO , pp. 106-115
    • Van Thong, J.M.1    Goddeau, D.2    Litvinova, A.3    Logan, B.4    Moreno, P.5    Swain, M.6
  • 31
    • 70350349017 scopus 로고    scopus 로고
    • Zhu, X., Barras, C., Lamel, L., Gauvain, J.L.: Speaker Diarization: from Broadcast News to Lectures. In: Renals, S., Bengio, S., Fiscus, J. (eds.) MLMI 2006. LNCS, 4299, pp. 396-406. Springer, Heidelberg (2006)
    • Zhu, X., Barras, C., Lamel, L., Gauvain, J.L.: Speaker Diarization: from Broadcast News to Lectures. In: Renals, S., Bengio, S., Fiscus, J. (eds.) MLMI 2006. LNCS, vol. 4299, pp. 396-406. Springer, Heidelberg (2006)
  • 32
    • 33745185321 scopus 로고    scopus 로고
    • Using MLP features in SRI's conversational speech recognition system
    • Lisbon
    • Zhu, Q., Stolcke, A., Chen, B.Y., Morgan, N.: Using MLP features in SRI's conversational speech recognition system. Interspeech 2005, 2141-2144, Lisbon (2005)
    • (2005) Interspeech 2005 , pp. 2141-2144
    • Zhu, Q.1    Stolcke, A.2    Chen, B.Y.3    Morgan, N.4
  • 33
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of Four Approaches to Automatic Language Identification of Telephone Speech
    • Zissman, M.A.: Comparison of Four Approaches to Automatic Language Identification of Telephone Speech. IEEE Trans. Speech and Audio Proc. 4(1), 31-44 (1996)
    • (1996) IEEE Trans. Speech and Audio Proc , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.