메뉴 건너뛰기




Volumn 56, Issue 1, 2014, Pages 85-100

Automatic speech recognition for under-resourced languages: A survey

Author keywords

Automatic pronunciation generation; Automatic speech recognition (ASR); Crosslingual acoustic modeling and adaptation; Language portability; Lexical modeling; Speech and language resources acquisition; Statistical language modeling; Under resourced languages

Indexed keywords

DEEP NEURAL NETWORKS; MODELING LANGUAGES; NATURAL LANGUAGE PROCESSING SYSTEMS; SPEECH; SPEECH PROCESSING; SPEECH SYNTHESIS; SURVEYS;

EID: 84893667016     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2013.07.008     Document Type: Article
Times cited : (427)

References (138)
  • 1
    • 44949086643 scopus 로고    scopus 로고
    • Automatic transcription of somali language
    • Pittsburgh, PA, USA
    • Abdillahi, N., Nocera, P., Bonastre, J.-F., 2006. Automatic transcription of Somali language. In: ICSLP'06, Pittsburgh, PA, USA, pp. 289-292.
    • (2006) ICSLP'06 , pp. 289-292
    • Abdillahi, N.1    Nocera, P.2    Bonastre, J.-F.3
  • 3
    • 85009202964 scopus 로고    scopus 로고
    • A corpus-based decompounding algorithm for german lexical modeling in LVCSR
    • Geneva, Switzerland
    • Adda-Decker, M., 2003. A corpus-based decompounding algorithm for German lexical modeling in LVCSR. In: Proc. Eurospeech-2003, Geneva, Switzerland, pp. 257-260.
    • (2003) Proc. Eurospeech-2003 , pp. 257-260
    • Adda-Decker, M.1
  • 4
    • 33745683825 scopus 로고    scopus 로고
    • A unified language model for large vocabulary continuous speech recognition of turkish
    • Arisoy, E., Dutagaci, H., Arslan, L., 2006. A unified language model for large vocabulary continuous speech recognition of Turkish. Signal Processing 86 (10), 2844-2862.
    • (2006) Signal Processing , vol.86 , Issue.10 , pp. 2844-2862
    • Arisoy, E.1    Dutagaci, H.2    Arslan, L.3
  • 6
    • 70450219527 scopus 로고    scopus 로고
    • ASR corpus design for resource-scarce languages
    • Barnard, E., Davel, M., van Heerden, C., 2009. ASR corpus design for resource-scarce languages. In: Proc. Interspeech, pp. 2847-2850.
    • (2009) Proc. Interspeech , pp. 2847-2850
    • Barnard, E.1    Davel, M.2    Van Heerden, C.3
  • 10
    • 48749106056 scopus 로고    scopus 로고
    • Towards speech translation of non written languages
    • Aruba, December 2006
    • Besacier, L., Zhou, B., Gao, Y., 2006. Towards speech translation of non written languages. In: IEEE/ACL SLT 2006. Aruba, December 2006.
    • (2006) IEEE/ACL SLT 2006
    • Besacier, L.1    Zhou, B.2    Gao, Y.3
  • 13
    • 84893666435 scopus 로고    scopus 로고
    • Transcribing southern min speech corpora with a web-based language learning system
    • Hanoi, Vietnam
    • Cai, J., 2008. Transcribing southern min speech corpora with a web-based language learning system. In: SLTU'08, Hanoi, Vietnam.
    • (2008) SLTU'08
    • Cai, J.1
  • 14
    • 85030413663 scopus 로고    scopus 로고
    • Turkish LVCSR: Towards better speech recognition for agglutinative languages
    • Carki, K., Geutner, P., Schultz, T., 2000. Turkish LVCSR: towards better speech recognition for agglutinative languages. In: IEEE ICASSP.
    • (2000) IEEE ICASSP
    • Carki, K.1    Geutner, P.2    Schultz, T.3
  • 15
    • 69249125883 scopus 로고    scopus 로고
    • Unsupervised adaptive speech technology for limited resource languages: A case study for tamil
    • Hanoi, Vietnam
    • Cetin, O., 2008. Unsupervised adaptive speech technology for limited resource languages: A case study for Tamil. In: SLTU'08, Hanoi, Vietnam.
    • (2008) SLTU'08
    • Cetin, O.1
  • 17
    • 25844455735 scopus 로고    scopus 로고
    • Syntax-based language models for machine translation
    • New Orleans, USA
    • Charniak, E., Knight, K., Yamada, K., 2003. Syntax-based language models for machine translation. In: Proc. IX MT Summit, New Orleans, USA, pp. 40-46.
    • (2003) Proc. IX MT Summit , pp. 40-46
    • Charniak, E.1    Knight, K.2    Yamada, K.3
  • 22
    • 33750359664 scopus 로고    scopus 로고
    • Unsupervised morpheme segmentation and morphology induction from text corpora using morfessor 1.0
    • Helsinki University of Technology, Finland
    • Creutz, M., Lagus, K., 2005. Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0. Computer and Information Science, Report A81, Helsinki University of Technology, Finland.
    • (2005) Computer and Information Science, Report A81
    • Creutz, M.1    Lagus, K.2
  • 25
    • 84858978033 scopus 로고    scopus 로고
    • Investigating the role of machine translated text in ASR domain adaptation: Unsuper-vised and semi-supervised methods
    • Hawaii, USA
    • Cucu, H., Besacier, L., Burileanu, C., Buzo, A., 2011. Investigating the role of machine translated text in ASR domain adaptation: unsuper-vised and semi-supervised methods. In: Proc. ASRU 2011, Hawaii, USA.
    • (2011) Proc. ASRU 2011
    • Cucu, H.1    Besacier, L.2    Burileanu, C.3    Buzo, A.4
  • 26
    • 84893681586 scopus 로고    scopus 로고
    • ASR domain adaptation methods for low-resourced languages: Application to Romanian language
    • Bucarest, Romania
    • Cucu, H., Besacier, L., Burileanu, C., Buzo, A., 2012. ASR domain adaptation methods for low-resourced languages: application to Romanian language. In: EUSIPCO'2012, Bucarest, Romania.
    • (2012) EUSIPCO'2012
    • Cucu, H.1    Besacier, L.2    Burileanu, C.3    Buzo, A.4
  • 27
    • 84893703276 scopus 로고    scopus 로고
    • SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian
    • http://dx.doi.org/10.1016/j.specom.2013.05.003
    • Cucu, H., Buzo, A., Besacier, L., Burileanu, C., 2013. SMT-based ASR domain adaptation methods for under-resourced languages: application to Romanian. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.05.003.
    • (2013) Speech Communication
    • Cucu, H.1    Buzo, A.2    Besacier, L.3    Burileanu, C.4
  • 28
  • 29
    • 84865706565 scopus 로고    scopus 로고
    • Woefzela - An open-source platform for ASR data collection in the developing world
    • De Vries, N.J., Badenhorst, J., Davel, M.H., Barnard, E., De Waal, A., 2011. Woefzela - an open-source platform for ASR data collection in the developing world. In: Proc. Interspeech, pp. 3177-3180.
    • (2011) Proc. Interspeech , pp. 3177-3180
    • De Vries, N.J.1    Badenhorst, J.2    Davel, M.H.3    Barnard, E.4    De Waal, A.5
  • 31
    • 51449096957 scopus 로고    scopus 로고
    • The character as an appropriate unit of processing for non-segmenting languages
    • Tokyo, Japan
    • Denoual, E., Lepage, Y., 2006. The character as an appropriate unit of processing for non-segmenting languages. In: NLP Annual Meeting, Tokyo, Japan, pp. 731-734.
    • (2006) NLP Annual Meeting , pp. 731-734
    • Denoual, E.1    Lepage, Y.2
  • 33
    • 84885550611 scopus 로고
    • The philips large-vocabulary recognition system for American english, french, and german
    • Madrid
    • Dugast, C., Aubert, X., Kneser, R., 1995. The Philips large-vocabulary recognition system for American English, French, and German. In: Proc. Eurospeech, Madrid, pp. 197-200.
    • (1995) Proc. Eurospeech , pp. 197-200
    • Dugast, C.1    Aubert, X.2    Kneser, R.3
  • 34
    • 84893670051 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis for ibibio
    • http://dx.doi.org/10.1016/j.specom.2013.02.003
    • Ekpenyong, M., Urua, E.-A., Watts, O., King, S., Yamagishi, J., 2013. Statistical parametric speech synthesis for Ibibio, Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.02.003.
    • (2013) Speech Communication
    • Ekpenyong, M.1    Urua, E.-A.2    Watts, O.3    King, S.4    Yamagishi, J.5
  • 37
    • 84893680491 scopus 로고    scopus 로고
    • Using automatic speech recognition for phonological purposes: Study of vowel length in punu (Bantu B40)
    • New Mexico (US), July 2010
    • Gelas, H., Besacier, L., Rossato, S., Pellegrino, F., 2010. Using automatic speech recognition for phonological purposes: study of vowel length in Punu (Bantu B40). In: Laphon 12, New Mexico (US), July 2010.
    • (2010) Laphon 12
    • Gelas, H.1    Besacier, L.2    Rossato, S.3    Pellegrino, F.4
  • 38
    • 84865722538 scopus 로고    scopus 로고
    • Quality assessment of crowdsourcing transcriptions for african languages
    • Italy, 28-31 August 2011
    • Gelas, H., Teferra Abate, S., Besacier, L., Pellegrino, F., 2011. Quality assessment of crowdsourcing transcriptions for African languages. In: Interspeech 2011 Florence, Italy, 28-31 August 2011.
    • (2011) Interspeech 2011 Florence
    • Gelas, H.1    Teferra Abate, S.2    Besacier, L.3    Pellegrino, F.4
  • 39
    • 85030406898 scopus 로고    scopus 로고
    • A hierarchical exemplar-based sparse model of speech with an application to ASR
    • HI, USA
    • Gemmeke, J.F., Van hamme, H., 2011. A hierarchical exemplar-based sparse model of speech with an application to ASR. IEEE ASRU 2011, HI, USA.
    • (2011) IEEE ASRU 2011
    • Gemmeke, J.F.1    Van Hamme, H.2
  • 41
    • 84893699024 scopus 로고    scopus 로고
    • Multiple pronunciation model for amharic speech recognition system
    • Hanoi, Vietnam
    • Gizaw, S., 2008. Multiple pronunciation model for Amharic speech recognition system. In: SLTU 2008, Hanoi, Vietnam.
    • (2008) SLTU 2008
    • Gizaw, S.1
  • 44
    • 0030640721 scopus 로고    scopus 로고
    • A multilingual phoneme and model set: Towards a universal base for automatic speech recognition
    • St. Barbara CA
    • Gokcen, S., Gokcen, J., 1997. A multilingual phoneme and model set: towards a universal base for automatic speech recognition. In: Proc. Automatic Speech Recognition and Understanding (ASRU), St. Barbara CA, pp. 599-603.
    • (1997) Proc. Automatic Speech Recognition and Understanding (ASRU) , pp. 599-603
    • Gokcen, S.1    Gokcen, J.2
  • 45
    • 34547548235 scopus 로고    scopus 로고
    • Probabilistic and bottle-neck features for LVCSR of meetings
    • USA
    • Grezl, F., et al., 2007. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. ICASSP, USA.
    • (2007) Proc. ICASSP
    • Grezl, F.1
  • 46
    • 0033709098 scopus 로고    scopus 로고
    • Tandem connectionist feature extraction for conventional HMM systems
    • Turkey
    • Hermansky, H., Wellis, D., Sharma, S., 2000. Tandem connectionist feature extraction for conventional HMM systems. In: Proc. ICASSP, Turkey.
    • (2000) Proc. ICASSP
    • Hermansky, H.1    Wellis, D.2    Sharma, S.3
  • 47
    • 0141476926 scopus 로고    scopus 로고
    • Accent modeling based on pronunciation dictionary adaptation for large vocabulary mandarin speech recognition
    • Beijing, China
    • Huang, C., Chang, E., Zhou, J., Lee K.-F., 2000. Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition. In: Proc. INTERSPEECH-2000, Beijing, China, pp. 818-821.
    • (2000) Proc. INTERSPEECH-2000 , pp. 818-821
    • Huang, C.1    Chang, E.2    Zhou, J.3    Lee, K.-F.4
  • 48
    • 77950944695 scopus 로고    scopus 로고
    • Morpho-syntactic postprocessing of N-best lists for improved french automatic speech recognition
    • Huet, S., Gravier, G., Sebillot, P., 2010. Morpho-syntactic postprocessing of N-best lists for improved French automatic speech recognition. Computer Speech and Language 24 (4), 663-684.
    • (2010) Computer Speech and Language , vol.24 , Issue.4 , pp. 663-684
    • Huet, S.1    Gravier, G.2    Sebillot, P.3
  • 49
    • 79959826604 scopus 로고    scopus 로고
    • Building transcribed speech corpora quickly and cheaply for many languages
    • Makuhari, Japan
    • Hughes, T., Nakajima, K., Ha, L., Moreno, P., LeBeau, M., 2010. Building transcribed speech corpora quickly and cheaply for many languages. In: Proc. Interspeech, Makuhari, Japan, pp. 1914-1917.
    • (2010) Proc. Interspeech , pp. 1914-1917
    • Hughes, T.1    Nakajima, K.2    Ha, L.3    Moreno, P.4    LeBeau, M.5
  • 51
    • 69249138826 scopus 로고    scopus 로고
    • Development of a speech recognition system for Icelandic using machine translated text
    • Hanoi, Vietnam
    • Jensson, A., 2008. Development of a speech recognition system for Icelandic using machine translated text. In: SLTU'08, Hanoi, Vietnam.
    • (2008) SLTU'08
    • Jensson, A.1
  • 54
    • 84893657385 scopus 로고    scopus 로고
    • Multilingual acoustic modeling using graphemes
    • Geneva, Switzerland
    • Kanthak, S., Ney, H., 2003. Multilingual acoustic modeling using graphemes. In: Eurospeech-2003, Geneva, Switzerland, pp. 1145-1148.
    • (2003) Eurospeech-2003 , pp. 1145-1148
    • Kanthak, S.1    Ney, H.2
  • 55
    • 77956597632 scopus 로고    scopus 로고
    • Comparing SMT methods for automatic generation of pronunciation variants
    • Reykjavik, Iceland
    • Karanasou, P., Lamel, L., 2010. Comparing SMT methods for automatic generation of pronunciation variants. In: IceTAL 2010, Reykjavik, Iceland, p. 167.
    • (2010) IceTAL 2010 , pp. 167
    • Karanasou, P.1    Lamel, L.2
  • 56
    • 84859039566 scopus 로고    scopus 로고
    • Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis
    • Florence, Italy
    • Karpov, A., Kipyatkova, I., Ronzhin, A., 2011. Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis. In: Proc. Interspeech'2011, Florence, Italy, pp. 3161-3164.
    • (2011) Proc. Interspeech'2011 , pp. 3161-3164
    • Karpov, A.1    Kipyatkova, I.2    Ronzhin, A.3
  • 57
    • 84893652340 scopus 로고    scopus 로고
    • Large vocabulary Russian speech recognition using syntactico-statistical language modeling
    • http://dx.doi.org/10.1016/j.specom.2013.07.004
    • Karpov, A., Markov, K., Kipyatkova, I., Vazhenina, D., Ronzhin, A., 2013. Large vocabulary Russian speech recognition using syntactico-statistical language modeling. Speech Communication. http://dx.doi.org/10.1016/j.specom. 2013.07.004.
    • (2013) Speech Communication
    • Karpov, A.1    Markov, K.2    Kipyatkova, I.3    Vazhenina, D.4    Ronzhin, A.5
  • 60
    • 84872512624 scopus 로고    scopus 로고
    • Analysis of long-distance word dependencies and pronunciation variability at conversational Russian speech recognition
    • Wroclav, Poland
    • Kipyatkova, I., Karpov, A., Verkhodanova, V., Zelezny, M., 2012. Analysis of long-distance word dependencies and pronunciation variability at conversational Russian speech recognition. In: Proc. FedCSIS-2012, Wroclav, Poland, pp. 719-725.
    • (2012) Proc. FedCSIS-2012 , pp. 719-725
    • Kipyatkova, I.1    Karpov, A.2    Verkhodanova, V.3    Zelezny, M.4
  • 61
    • 0031619917 scopus 로고    scopus 로고
    • Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks
    • Seattle
    • Köhler, J., 1998. Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks. In: Proc. ICASSP, Seattle, pp. 417-420.
    • (1998) Proc. ICASSP , pp. 417-420
    • Köhler, J.1
  • 62
    • 84893646836 scopus 로고    scopus 로고
    • The basic language resource kit (BLARK) as the first milestone for the language resources roadmap
    • Moscow, Russia
    • Krauwer, S., 2003. The basic language resource kit (BLARK) as the first milestone for the language resources roadmap. In: Proceedings of the 2003 International Workshop Speech and Computer SPECOM-2003, Moscow, Russia, pp. 8-15.
    • (2003) Proceedings of the 2003 International Workshop Speech and Computer SPECOM-2003 , pp. 8-15
    • Krauwer, S.1
  • 65
    • 44949103508 scopus 로고    scopus 로고
    • Unsupervised segmentation of words into morphemes - Morpho challenge. Application to automatic speech recognition
    • Pittsburgh, PA, USA
    • Kurimo, M., et al., 2006. Unsupervised segmentation of words into morphemes - Morpho Challenge. Application to automatic speech recognition. In: Proc. Interspeech'06, Pittsburgh, PA, USA, pp. 1021-1024.
    • (2006) Proc. Interspeech'06 , pp. 1021-1024
    • Kurimo, M.1
  • 66
    • 0000143923 scopus 로고
    • Issues in large vocabulary multilingual speech recognition
    • Madrid
    • Lamel, L., Adda-Decker, M., Gauvain, J.L., 1995. Issues in large vocabulary multilingual speech recognition. In: Proc. Eurospeech, Madrid, pp. 185-189.
    • (1995) Proc. Eurospeech , pp. 185-189
    • Lamel, L.1    Adda-Decker, M.2    Gauvain, J.L.3
  • 67
    • 70450194704 scopus 로고    scopus 로고
    • Grapheme to phoneme conversion using an SMT system
    • Brighton, UK
    • Laurent, A., Deléglise, P., Meignier, S., 2009. Grapheme to phoneme conversion using an SMT system. In: Interspeech 2009, Brighton, UK, pp. 708-711.
    • (2009) Interspeech 2009 , pp. 708-711
    • Laurent, A.1    Deléglise, P.2    Meignier, S.3
  • 68
    • 69249139569 scopus 로고    scopus 로고
    • Automatic speech recognition for under-resourced languages: Application to vietnamese language
    • Le, V.-B., Besacier, L., 2009. Automatic speech recognition for under-resourced languages: application to Vietnamese language. IEEE Transactions on Audio, Speech and Language Processing 17(8), 1471-1482.
    • (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.8 , pp. 1471-1482
    • Le, V.-B.1    Besacier, L.2
  • 69
    • 33646820243 scopus 로고    scopus 로고
    • Using the web for fast language model construction in minority languages
    • Geneva, Switzerland
    • Le, V.B., Bigi, B., Besacier, L., Castelli, E., 2003. Using the Web for fast language model construction in minority languages. In: Euro-speech'03, Geneva, Switzerland, pp. 3117-3120.
    • (2003) Euro-speech'03 , pp. 3117-3120
    • Le, V.B.1    Bigi, B.2    Besacier, L.3    Castelli, E.4
  • 71
    • 70450181523 scopus 로고    scopus 로고
    • Cross-language bootstrapping for unsupervised acoustic model training: Rapid development of a polish speech recognition system
    • Brighton, UK
    • Loof, J., Gollan, C., Ney, H., 2009. Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system. In: Interspeech 2009. Brighton, UK.
    • (2009) Interspeech 2009
    • Loof, J.1    Gollan, C.2    Ney, H.3
  • 72
    • 33646070918 scopus 로고    scopus 로고
    • Modeling syntax of free word-order languages: Dependency analysis by reduction
    • Springer LNAI 3658, Karlovy Vary, Czech Republic
    • Lopatková, M., Plátek, M., Kuboň, V., 2005. Modeling syntax of free word-order languages: dependency analysis by reduction. In: Proc. TSD'2005, Springer LNAI 3658, Karlovy Vary, Czech Republic, pp. 140-147.
    • (2005) Proc. TSD'2005 , pp. 140-147
    • Lopatková, M.1    Plátek, M.2    Kuboň, V.3
  • 73
    • 56149124525 scopus 로고    scopus 로고
    • Morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - Like hungarian
    • Antwerp, Belgium
    • Mihajlik, P., Fegyó, T., Tüske, Z., Ircing, P., 2007. Morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. In: Interspeech'07, Antwerp, Belgium.
    • (2007) Interspeech'07
    • Mihajlik, P.1    Fegyó, T.2    Tüske, Z.3    Ircing, P.4
  • 77
    • 33745195515 scopus 로고    scopus 로고
    • Language model adaptation with additional text generated by machine translation
    • Taipei, Taiwan
    • Nakajima, H., Yamamoto, H., Watanabe, T., 2002. Language model adaptation with additional text generated by machine translation. In: COLING 2002, Vol. 2, Taipei, Taiwan, pp. 716-722.
    • (2002) COLING 2002 , vol.2 , pp. 716-722
    • Nakajima, H.1    Yamamoto, H.2    Watanabe, T.3
  • 81
    • 79951777091 scopus 로고    scopus 로고
    • Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data
    • Berkeley, California, December 2010
    • Parent, G., Eskenazi, M., 2010. Toward better crowdsourced transcription: transcription of a year of the Let's Go bus information system data. In: Proceedings of IEEE Workshop on Spoken Language Technology, Berkeley, California, December 2010, pp. 312-317.
    • (2010) Proceedings of IEEE Workshop on Spoken Language Technology , pp. 312-317
    • Parent, G.1    Eskenazi, M.2
  • 83
    • 77953970245 scopus 로고    scopus 로고
    • Avaaj otalo: A field study of an interactive voice forum for small farmers in rural India
    • Patel, N., Chittamuru, D., Jain, A., Dave, P., Parikh, T.S., 2010. Avaaj Otalo: A field study of an interactive voice forum for small farmers in rural India. In: CHI. ACM, pp. 733-742.
    • (2010) CHI. ACM , pp. 733-742
    • Patel, N.1    Chittamuru, D.2    Jain, A.3    Dave, P.4    Parikh, T.S.5
  • 84
    • 84893644519 scopus 로고    scopus 로고
    • Investigating automatic decomposition for ASR in less represented languages
    • Pittsburgh
    • Pellegrini, T., Lamel, L., 2006. Investigating automatic decomposition for ASR in less represented languages. In: ICSLP'06, Pittsburgh.
    • (2006) ICSLP'06
    • Pellegrini, T.1    Lamel, L.2
  • 85
    • 69249115863 scopus 로고    scopus 로고
    • Are audio or textual training data more important for ASR in less-represented languages?
    • Hanoi, Vietnam
    • Pellegrini, T., Lamel, L., 2008. Are audio or textual training data more important for ASR in less-represented languages?. In: SLTU'08, Hanoi, Vietnam.
    • (2008) SLTU'08
    • Pellegrini, T.1    Lamel, L.2
  • 86
    • 85008009596 scopus 로고    scopus 로고
    • Automatic word decompounding for ASR in a morphologically rich language: Application to amharic
    • Pellegrini, T., Lamel, L., 2009. Automatic word decompounding for ASR in a morphologically rich language: application to Amharic. IEEE Transactions on Audio, Speech & Language Processing 17 (5), 863-873.
    • (2009) IEEE Transactions on Audio, Speech & Language Processing , vol.17 , Issue.5 , pp. 863-873
    • Pellegrini, T.1    Lamel, L.2
  • 87
    • 84858976609 scopus 로고    scopus 로고
    • Cross-lingual portability of Chinese and english neural network features for french and german LVCSR
    • USA
    • Plahl, C., Schlueter, R., Ney, H., 2011. Cross-lingual portability of Chinese and English neural network features for French and German LVCSR. In: Proc. ASRU, USA.
    • (2011) Proc. ASRU
    • Plahl, C.1    Schlueter, R.2    Ney, H.3
  • 90
    • 34250004339 scopus 로고    scopus 로고
    • Large vocabulary continuous speech recognition of an inflected language using stems and endings
    • DOI 10.1016/j.specom.2007.02.010, PII S0167639307000428
    • Rotovnik, T., Maucec, M.S., Kacix, Z., 2007. Large vocabulary continuous speech recognition of an inflected language using stems and endings. Speech Communication 49 (6), 437-452. (Pubitemid 46891622)
    • (2007) Speech Communication , vol.49 , Issue.6 , pp. 437-452
    • Rotovnik, T.1    Maucec, M.S.2    Kacic, Z.3
  • 92
    • 78049399086 scopus 로고    scopus 로고
    • Morphology-based and subword language modeling for turkish speech recognition
    • Sak, H., Saraclar, M., Güngör, T., 2010. Morphology-based and subword language modeling for Turkish speech recognition. In: ICASSP 2010, pp. 5402-5405.
    • (2010) ICASSP 2010 , pp. 5402-5405
    • Sak, H.1    Saraclar, M.2    Güngör, T.3
  • 93
    • 34547535012 scopus 로고    scopus 로고
    • Joint morphological-lexical language modeling (JMLLM) for arabic
    • Sarikaya, R., Afify, M., Gao, Y., 2007. Joint morphological-lexical language modeling (JMLLM) for Arabic. In: Proc. ICASSP'07, Vol. 4, pp. 181-184.
    • (2007) Proc. ICASSP'07 , vol.4 , pp. 181-184
    • Sarikaya, R.1    Afify, M.2    Gao, Y.3
  • 94
    • 79959851710 scopus 로고    scopus 로고
    • Wiktionary as a source for automatic pronunciation extraction
    • Makuhari, Japan, 26-30 September 2010
    • Schlippe, T., Ochs, S., Schultz, T., 2010. Wiktionary as a source for automatic pronunciation extraction. In: Interspeech 2010, Makuhari, Japan, 26-30 September 2010.
    • (2010) Interspeech 2010
    • Schlippe, T.1    Ochs, S.2    Schultz, T.3
  • 95
    • 84867605828 scopus 로고    scopus 로고
    • Grapheme-to-phoneme model generation for indo-european languages
    • Kyoto, Japan, 25-30 March 2012
    • Schlippe, T., Ochs, S., Schultz, T., 2012a. Grapheme-to-phoneme model generation for indo-European languages. In: ICASSP 2012, Kyoto, Japan, 25-30 March 2012.
    • (2012) ICASSP 2012
    • Schlippe, T.1    Ochs, S.2    Schultz, T.3
  • 96
    • 84878526331 scopus 로고    scopus 로고
    • Automatic error recovery for pronunciation dictionaries
    • Portland, Oregon, 9-13 September 2012
    • Schlippe, T., Ochs, S., Vu, N.T., Schultz, T., 2012b. Automatic error recovery for pronunciation dictionaries. In: Interspeech 2012, Portland, Oregon, 9-13 September 2012.
    • (2012) Interspeech 2012
    • Schlippe, T.1    Ochs, S.2    Vu, N.T.3    Schultz, T.4
  • 97
    • 84893706991 scopus 로고    scopus 로고
    • Web-based tools and methods for rapid pronunciation dictionary creation
    • http://dx.doi.org/10.1016/j.specom.2013.06.015
    • Schlippe, T., Ochs, S., Schultz, T., 2013. Web-based tools and methods for rapid pronunciation dictionary creation. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.06.015.
    • (2013) Speech Communication
    • Schlippe, T.1    Ochs, S.2    Schultz, T.3
  • 98
    • 85009274666 scopus 로고    scopus 로고
    • GlobalPhone: A multilingual speech and text database developed at karlsruhe university
    • Schultz, T., 2002. GlobalPhone: A multilingual speech and text database developed at Karlsruhe University. In: ICSLP, pp. 345-348.
    • (2002) ICSLP , pp. 345-348
    • Schultz, T.1
  • 99
    • 85013700737 scopus 로고    scopus 로고
    • Tanja Schultz, Katrin Kirchhoff (Eds.), Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5, 2006
    • Schultz, T., 2006. Multilingual speech processing. In: Tanja Schultz, Katrin Kirchhoff (Eds.), Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5, 2006.
    • (2006) Multilingual Speech Processing
    • Schultz, T.1
  • 100
    • 43849102326 scopus 로고    scopus 로고
    • SPICE: Web-based tools for rapid language adaptation in speech processing systems
    • Antwerp, Belgium
    • Schultz, T., Black, A.W., Badaskar, S., Hornyak, M., Kominek, J., 2007. SPICE: web-based tools for rapid language adaptation in speech processing systems. In: Interspeech 2007, Antwerp, Belgium.
    • (2007) Interspeech 2007
    • Schultz, T.1    Black, A.W.2    Badaskar, S.3    Hornyak, M.4    Kominek, J.5
  • 101
    • 84890463379 scopus 로고    scopus 로고
    • GlobalPhone: A multilingual text & speech database in 20 languages
    • Vancouver, Canada
    • Schultz, T., Vu, N.T., Schlippe, T., 2013. GlobalPhone: A multilingual text & speech database in 20 languages. In: ICASSP 2013, Vancouver, Canada.
    • (2013) ICASSP 2013
    • Schultz, T.1    Vu, N.T.2    Schlippe, T.3
  • 102
    • 0001216191 scopus 로고    scopus 로고
    • Language independent and language adaptive LVCSR
    • Sydney
    • Schultz, T., Waibel, A., 1998. Language independent and language adaptive LVCSR. In: Proc. ICSLP, Sydney, pp. 1819-1822.
    • (1998) Proc. ICSLP , pp. 1819-1822
    • Schultz, T.1    Waibel, A.2
  • 103
    • 0035426931 scopus 로고    scopus 로고
    • Language-independent and language-adaptive acoustic modeling for speech recognition
    • DOI 10.1016/S0167-6393(00)00094-7, PII S0167639300000947
    • Schultz, T., Waibel, A., 2001. Language independent and language adaptive acoustic modeling for speech recognition. Speech Communication 35, 31-51. (Pubitemid 32599645)
    • (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 31-51
    • Schultz, T.1    Waibel, A.2
  • 104
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-dependent deep neural networks for conversational speech transcription
    • HI, USA
    • Seide, F., Li, G., Chen, X., Yu, D., 2011. Feature engineering in context-dependent deep neural networks for conversational speech transcription. In: Proc. ASRU-2011 International Workshop, HI, USA, pp. 24-29.
    • (2011) Proc. ASRU-2011 International Workshop , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 105
    • 84867333761 scopus 로고    scopus 로고
    • Universal attribute characterization of spoken languages for automatic spoken language recognition
    • Siniscalchi, S.M., Reed, J., Svendsen, T., Lee, C.-H., 2013. Universal attribute characterization of spoken languages for automatic spoken language recognition. Computer Speech & Language 27 (1), 209-227.
    • (2013) Computer Speech & Language , vol.27 , Issue.1 , pp. 209-227
    • Siniscalchi, S.M.1    Reed, J.2    Svendsen, T.3    Lee, C.-H.4
  • 110
    • 33947619591 scopus 로고    scopus 로고
    • Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons
    • Stolcke, A., Grezl, F., Hwang, M.-Y., Lei, X., Morgan, N., Vergyri, D., 2006. Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons. In: Proc. ICASSP 2006.
    • (2006) Proc. ICASSP 2006
    • Stolcke, A.1    Grezl, F.2    Hwang, M.-Y.3    Lei, X.4    Morgan, N.5    Vergyri, D.6
  • 111
    • 85133311295 scopus 로고    scopus 로고
    • Integrating thai grapheme based acoustic models into the ML-mix framework - For language independent and cross-language ASR
    • Hanoi, Vietnam
    • Stüker, S., 2008. Integrating Thai grapheme based acoustic models into the ML-mix framework - for language independent and cross-language ASR. In: SLTU'08, Hanoi, Vietnam.
    • (2008) SLTU'08
    • Stüker, S.1
  • 114
    • 70450198124 scopus 로고    scopus 로고
    • Human translations guided language discovery for ASR systems
    • Brighton, UK
    • Stüker, S., Besacier, L., Waibel, A., 2009. Human translations guided language discovery for ASR systems. In: InterSpeech-2009, Brighton, UK.
    • (2009) InterSpeech-2009
    • Stüker, S.1    Besacier, L.2    Waibel, A.3
  • 115
    • 70450172181 scopus 로고    scopus 로고
    • Localization of speech recognition in spoken dialog systems: How machine translation can make our lives
    • Brighton, UK
    • Suenderman, K., Liscombe, J., 2009. Localization of speech recognition in spoken dialog systems: how machine translation can make our lives. In: Interspeech 2009, Brighton, UK, pp. 1475-1478.
    • (2009) Interspeech 2009 , pp. 1475-1478
    • Suenderman, K.1    Liscombe, J.2
  • 116
    • 0141703236 scopus 로고    scopus 로고
    • Finite-state transducer based modeling of morphosyntax with applications to hungarian LVCSR
    • HongKong, China
    • Szarvas, M., Furui, S., 2003. Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR. In: Proc. ICASSP, HongKong, China, pp. 368-371.
    • (2003) Proc. ICASSP , pp. 368-371
    • Szarvas, M.1    Furui, S.2
  • 118
    • 84893678734 scopus 로고    scopus 로고
    • Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic
    • http://dx.doi.org/10.1016/j.specom.2013.01.008
    • Tachbelie, M., Abate, S.T., Besacier, L., 2013. Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.01.008.
    • (2013) Speech Communication
    • Tachbelie, M.1    Abate, S.T.2    Besacier, L.3
  • 120
    • 84867606552 scopus 로고    scopus 로고
    • Multilingual MLP features for low-resource LVCSR systems
    • Japan
    • Thomas, S., Ganapathy, S., Hermansky, H., 2012a. Multilingual MLP features for low-resource LVCSR systems. In: Proc. ICASSP, Japan.
    • (2012) Proc. ICASSP
    • Thomas, S.1    Ganapathy, S.2    Hermansky, H.3
  • 121
    • 84878392008 scopus 로고    scopus 로고
    • Data-driven posterior features for low resource speech recognition applications
    • USA
    • Thomas, S., Ganapathy, S., Jansen, A., Hermansky, H., 2012b. Data-driven posterior features for low resource speech recognition applications. In: Proc. Interspeech, USA.
    • (2012) Proc. Interspeech
    • Thomas, S.1    Ganapathy, S.2    Jansen, A.3    Hermansky, H.4
  • 122
    • 84858985238 scopus 로고    scopus 로고
    • Cross-lingual portability of MLP-based tandem features - A case study for english and hungarian
    • Toth, L., Frankel, J., Gosztolya, G., King, S., 2008. Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian. In: Proc. Interspeech.
    • (2008) Proc. Interspeech
    • Toth, L.1    Frankel, J.2    Gosztolya, G.3    King, S.4
  • 123
    • 0035101535 scopus 로고    scopus 로고
    • A survey of hybrid ANN/HMM models for automatic speech recognition
    • Trentin, E., Gori, M., 2001. A survey of hybrid ANN/HMM models for automatic speech recognition. Neurocomputing 37 (1), 91-126.
    • (2001) Neurocomputing , vol.37 , Issue.1 , pp. 91-126
    • Trentin, E.1    Gori, M.2
  • 125
    • 84893679840 scopus 로고    scopus 로고
    • Predicting utterance pitch targets in yoruba for tone realisation in speech synthesis
    • http://dx.doi.org/10.1016/j.specom.2013.01.009
    • van Niekerk, D.R., Barnard, E., 2013. Predicting utterance pitch targets in Yoruba for tone realisation in speech synthesis, Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.01.009.
    • (2013) Speech Communication
    • Van Niekerk, D.R.1    Barnard, E.2
  • 126
    • 85009110467 scopus 로고    scopus 로고
    • Morphology-based language modeling for arabic speech recognition
    • Vergyri, D., Kirchhoff, K., Duh, K., Stolcke, A., 2004. Morphology-based language modeling for Arabic speech recognition. In: Proc. ICSLP'04, pp. 2245-2248.
    • (2004) Proc. ICSLP'04 , pp. 2245-2248
    • Vergyri, D.1    Kirchhoff, K.2    Duh, K.3    Stolcke, A.4
  • 128
    • 79951796711 scopus 로고    scopus 로고
    • Multilingual A-stabil: A new confidence score for multilingual unsupervised training
    • USA
    • Vu, N.T., Kraus, F., Schultz, T., 2010. Multilingual A-stabil: A new confidence score for multilingual unsupervised training. In: Proc. SLT, USA.
    • (2010) Proc. SLT
    • Vu, N.T.1    Kraus, F.2    Schultz, T.3
  • 129
    • 84865764419 scopus 로고    scopus 로고
    • Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training
    • Italy
    • Vu, N.T., Kraus, F., Schultz, T., 2011. Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training. In: Proc. Interspeech, Italy.
    • (2011) Proc. Interspeech
    • Vu, N.T.1    Kraus, F.2    Schultz, T.3
  • 130
    • 84890464069 scopus 로고    scopus 로고
    • Multilingual bottle-neck feature for under resourced languages
    • South Africa
    • Vu, N.T., Metze, F., Schultz, T., 2012a. Multilingual bottle-neck feature for under resourced languages. In: Proc. SLTU, South Africa.
    • (2012) Proc. SLTU
    • Vu, N.T.1    Metze, F.2    Schultz, T.3
  • 131
    • 84878559540 scopus 로고    scopus 로고
    • An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
    • USA
    • Vu, N.T., Breiter, W., Metze, F., Schultz, T., 2012b. An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance. In: Proc. Interspeech, USA.
    • (2012) Proc. Interspeech
    • Vu, N.T.1    Breiter, W.2    Metze, F.3    Schultz, T.4
  • 132
    • 84856280064 scopus 로고
    • An evaluation of cross-language adaptation for rapid HMM development in a new language
    • Adelaide
    • Wheatley, B., Kondo, K., Anderson, W., Muthusamy, Y., 1994. An evaluation of cross-language adaptation for rapid HMM development in a new language. In: Proc. ICASSP, Adelaide, pp. 237-240.
    • (1994) Proc. ICASSP , pp. 237-240
    • Wheatley, B.1    Kondo, K.2    Anderson, W.3    Muthusamy, Y.4
  • 137
    • 85075927145 scopus 로고    scopus 로고
    • HMMs and related speech recognition technologies
    • Springer-Verlag, Berlin Heidelberg
    • Young, S., 2008. HMMs and related speech recognition technologies. In: Springer Handbook of Speech Processing. Springer-Verlag, Berlin Heidelberg, pp. 539-557.
    • (2008) Springer Handbook of Speech Processing , pp. 539-557
    • Young, S.1
  • 138
    • 84867329143 scopus 로고    scopus 로고
    • Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition
    • Yu, D., Siniscalchi, S.M., Deng, L., Lee, C.-H., 2012. Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition. In: Proc. ICASSP-2012, pp. 4169-4172.
    • (2012) Proc. ICASSP-2012 , pp. 4169-4172
    • Yu, D.1    Siniscalchi, S.M.2    Deng, L.3    Lee, C.-H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.