SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 56, Issue 1, 2014, Pages 85-100

Automatic speech recognition for under-resourced languages: A survey

(4) Besacier, Laurent a Barnard, Etienne b Karpov, Alexey c Schultz, Tanja d

a UNIV GRENOBLE ALPES (France)

b NORTH WEST UNIVERSITY (South Africa)

c ST PETERSBURG INSTITUTE FOR INFORMATICS AND AUTOMATION (Russian Federation)

d KARLSRUHE INSTITUTE OF TECHNOLOGY (Germany)

Author keywords

Automatic pronunciation generation; Automatic speech recognition (ASR); Crosslingual acoustic modeling and adaptation; Language portability; Lexical modeling; Speech and language resources acquisition; Statistical language modeling; Under resourced languages

Indexed keywords

DEEP NEURAL NETWORKS; MODELING LANGUAGES; NATURAL LANGUAGE PROCESSING SYSTEMS; SPEECH; SPEECH PROCESSING; SPEECH SYNTHESIS; SURVEYS;

ACOUSTIC MODEL; AUTOMATIC PRONUNCIATION GENERATION; AUTOMATIC SPEECH RECOGNITION; LANGUAGE PORTABILITIES; LANGUAGE RESOURCES; STATISTICAL LANGUAGE MODELING; UNDER-RESOURCED LANGUAGES;

SPEECH RECOGNITION;

EID: 84893667016 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2013.07.008 Document Type: Article

Times cited : (427)

References (138)

1
- 44949086643
- Automatic transcription of somali language
- Pittsburgh, PA, USA
- Abdillahi, N., Nocera, P., Bonastre, J.-F., 2006. Automatic transcription of Somali language. In: ICSLP'06, Pittsburgh, PA, USA, pp. 289-292.
- (2006) ICSLP'06 , pp. 289-292
- Abdillahi, N.¹ Nocera, P.² Bonastre, J.-F.³

2
- 78651107637
- Uyghur morpheme-based language models and ASR
- Beijing, China
- Ablimit, M., Neubig, G., Mimura, M., Mori, S., Kawahara, T., Hamdulla, A., 2010. Uyghur Morpheme-based language models and ASR. In: Proc. IEEE 10th International Conference on Signal Processing (ICSP), Beijing, China, pp. 581-584.
- (2010) Proc. IEEE 10th International Conference on Signal Processing (ICSP) , pp. 581-584
- Ablimit, M.¹ Neubig, G.² Mimura, M.³ Mori, S.⁴ Kawahara, T.⁵ Hamdulla, A.⁶

3
- 85009202964
- A corpus-based decompounding algorithm for german lexical modeling in LVCSR
- Geneva, Switzerland
- Adda-Decker, M., 2003. A corpus-based decompounding algorithm for German lexical modeling in LVCSR. In: Proc. Eurospeech-2003, Geneva, Switzerland, pp. 257-260.
- (2003) Proc. Eurospeech-2003 , pp. 257-260
- Adda-Decker, M.¹

4
- 33745683825
- A unified language model for large vocabulary continuous speech recognition of turkish
- Arisoy, E., Dutagaci, H., Arslan, L., 2006. A unified language model for large vocabulary continuous speech recognition of Turkish. Signal Processing 86 (10), 2844-2862.
- (2006) Signal Processing , vol.86 , Issue.10 , pp. 2844-2862
- Arisoy, E.¹ Dutagaci, H.² Arslan, L.³

5
- 85075436378
- Deep neural network language models
- Montreal, Canada
- Arisoy, E., Sainath, T.N., Kingsbury, B., Ramabhadran, B., 2012. Deep neural network language models. In: Proc. NAACL-HLT 2012 Workshop, Montreal, Canada, pp. 20-28.
- (2012) Proc. NAACL-HLT 2012 Workshop , pp. 20-28
- Arisoy, E.¹ Sainath, T.N.² Kingsbury, B.³ Ramabhadran, B.⁴

6
- 70450219527
- ASR corpus design for resource-scarce languages
- Barnard, E., Davel, M., van Heerden, C., 2009. ASR corpus design for resource-scarce languages. In: Proc. Interspeech, pp. 2847-2850.
- (2009) Proc. Interspeech , pp. 2847-2850
- Barnard, E.¹ Davel, M.² Van Heerden, C.³

7
- 77957968025
- Speech technology for information access: A South African case study
- Palo Alto, California, March 2010
- Barnard, E., Davel, M., van Huyssteen, G.B., 2010. Speech technology for information access: A South African case study. In: Proceedings of the AAAI Spring Symposium on Artificial Intelligence for Development (AI-D), Palo Alto, California, March 2010, pp. 8-13.
- (2010) Proceedings of the AAAI Spring Symposium on Artificial Intelligence for Development (AI-D) , pp. 8-13
- Barnard, E.¹ Davel, M.² Van Huyssteen, G.B.³

8
- 0030355995
- Multilingual speech recognition at dragon systems
- Philadelphia
- Barnett, J., Corrada, A., Gao, G., Gillik, L., Ito, Y., Lowe, S., Manganaro, L., Peskin, B., 1996. Multilingual speech recognition at Dragon systems. In: Proc. ICSLP, Philadelphia, pp. 2191-2194.
- (1996) Proc. ICSLP , pp. 2191-2194
- Barnett, J.¹ Corrada, A.² Gao, G.³ Gillik, L.⁴ Ito, Y.⁵ Lowe, S.⁶ Manganaro, L.⁷ Peskin, B.⁸

9
- 33947661751
- Ph.D. Thesis, J. Fourier University - Grenoble I, May 2004
- Berment, V., 2004. Méthodes pour informatiser des langues et des groupes de langues peu dotées. Ph.D. Thesis, J. Fourier University - Grenoble I, May 2004.
- (2004) Méthodes pour Informatiser des Langues et des Groupes de Langues Peu Dotées
- Berment, V.¹

10
- 48749106056
- Towards speech translation of non written languages
- Aruba, December 2006
- Besacier, L., Zhou, B., Gao, Y., 2006. Towards speech translation of non written languages. In: IEEE/ACL SLT 2006. Aruba, December 2006.
- (2006) IEEE/ACL SLT 2006
- Besacier, L.¹ Zhou, B.² Gao, Y.³

11
- 84865750037
- Errgrams - A way to improving ASR for highly inflective dravidian languages
- India
- Bhanuprasad, K., Svenson, M., 2008. Errgrams - a way to improving ASR for highly inflective Dravidian languages. In: Proc. 3rd International Joint Conf. on Natural Language Processing IJCNLP'08, India, pp. 805-810.
- (2008) Proc. 3rd International Joint Conf. on Natural Language Processing IJCNLP'08 , pp. 805-810
- Bhanuprasad, K.¹ Svenson, M.²

12
- 0003132144
- Multilingual speech recognition: The 1996 byblos callhome system
- Rhodes, Greece
- Billa, J., Ma, K., McDonough, J., Zavaliagkos, G., Miller, D.R., Ross, K.N., El-Jaroudi, A., 1997. Multilingual speech recognition: the 1996 Byblos Callhome system. In: Proc. Eurospeech-1997, Rhodes, Greece, pp. 363-366.
- (1997) Proc. Eurospeech-1997 , pp. 363-366
- Billa, J.¹ Ma, K.² McDonough, J.³ Zavaliagkos, G.⁴ Miller, D.R.⁵ Ross, K.N.⁶ El-Jaroudi, A.⁷

13
- 84893666435
- Transcribing southern min speech corpora with a web-based language learning system
- Hanoi, Vietnam
- Cai, J., 2008. Transcribing southern min speech corpora with a web-based language learning system. In: SLTU'08, Hanoi, Vietnam.
- (2008) SLTU'08
- Cai, J.¹

14
- 85030413663
- Turkish LVCSR: Towards better speech recognition for agglutinative languages
- Carki, K., Geutner, P., Schultz, T., 2000. Turkish LVCSR: towards better speech recognition for agglutinative languages. In: IEEE ICASSP.
- (2000) IEEE ICASSP
- Carki, K.¹ Geutner, P.² Schultz, T.³

15
- 69249125883
- Unsupervised adaptive speech technology for limited resource languages: A case study for tamil
- Hanoi, Vietnam
- Cetin, O., 2008. Unsupervised adaptive speech technology for limited resource languages: A case study for Tamil. In: SLTU'08, Hanoi, Vietnam.
- (2008) SLTU'08
- Cetin, O.¹

16
- 84859918743
- Discriminative pronunciation learning for speech recognition for resource scarce languages
- Article No. 12
- Chan, H.Y., Rosenfeld, R. 2012. Discriminative pronunciation learning for speech recognition for resource scarce languages. In: Proceedings of the 2nd ACM Symposium on Computing for Development. Article No. 12.
- (2012) Proceedings of the 2nd ACM Symposium on Computing for Development
- Chan, H.Y.¹ Rosenfeld, R.²

17
- 25844455735
- Syntax-based language models for machine translation
- New Orleans, USA
- Charniak, E., Knight, K., Yamada, K., 2003. Syntax-based language models for machine translation. In: Proc. IX MT Summit, New Orleans, USA, pp. 40-46.
- (2003) Proc. IX MT Summit , pp. 40-46
- Charniak, E.¹ Knight, K.² Yamada, K.³

18
- 67650412177
- Thai grapheme-based speech recognition
- Charoenpornsawat, P., Hewavitharana, S., Schultz, T., 2006. Thai grapheme-based speech recognition. In: Human Language Technology Conference (HLT).
- (2006) Human Language Technology Conference (HLT)
- Charoenpornsawat, P.¹ Hewavitharana, S.² Schultz, T.³

19
- 0034295822
- Structured language model
- Chelba, C., Jelinek, F., 2000. Structured language model. Computer Speech and Language 10, 283-332.
- (2000) Computer Speech and Language , vol.10 , pp. 283-332
- Chelba, C.¹ Jelinek, F.²

20
- 0030638088
- Towards a universal speech recognizer for multiple languages
- St. Barbara CA
- Cohen, P., Dharanipragada, S., Gros, J., Monkowski, M., Neti, C., Roukos, S., Ward, T., 1997. Towards a universal speech recognizer for multiple languages. In: Proc. Automatic Speech Recognition and Understanding (ASRU), St. Barbara CA, pp. 591-598.
- (1997) Proc. Automatic Speech Recognition and Understanding (ASRU) , pp. 591-598
- Cohen, P.¹ Dharanipragada, S.² Gros, J.³ Monkowski, M.⁴ Neti, C.⁵ Roukos, S.⁶ Ward, T.⁷

21
- 0030638067
- On cross-language experiments and data-driven units for ALISP
- St. Barbara CA
- Constantinescu, A., Chollet, G., 1997. On cross-language experiments and data-driven units for ALISP. In: Proc. Automatic Speech Recognition and Understanding (ASRU), St. Barbara CA, pp. 606-613.
- (1997) Proc. Automatic Speech Recognition and Understanding (ASRU) , pp. 606-613
- Constantinescu, A.¹ Chollet, G.²

22
- 33750359664
- Unsupervised morpheme segmentation and morphology induction from text corpora using morfessor 1.0
- Helsinki University of Technology, Finland
- Creutz, M., Lagus, K., 2005. Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0. Computer and Information Science, Report A81, Helsinki University of Technology, Finland.
- (2005) Computer and Information Science, Report A81
- Creutz, M.¹ Lagus, K.²

23
- 37849048345
- Morph-based speech recognition and modeling of out-of-vocabulary words across languages
- Article No. 3
- Creutz, M., Hirsimaki, T., Kurimo, M., Puurula, A., Pylkkonen, J., Siivola, V., Varjokallio, M., Arisoy, E., Saraclar, M., Stolcke, A., 2007. Morph-based speech recognition and modeling of out-of-vocabulary words across languages. ACM Transactions on Speech and Language Processing 5 (1). Article No. 3.
- (2007) ACM Transactions on Speech and Language Processing , vol.5 , Issue.1
- Creutz, M.¹ Hirsimaki, T.² Kurimo, M.³ Puurula, A.⁴ Pylkkonen, J.⁵ Siivola, V.⁶ Varjokallio, M.⁷ Arisoy, E.⁸ Saraclar, M.⁹ Stolcke, A.¹⁰

24
- 78349265614
- Cambridge CUP
- Crystal, D., 2000. Language Death. Cambridge CUP.
- (2000) Language Death
- Crystal, D.¹

25
- 84858978033
- Investigating the role of machine translated text in ASR domain adaptation: Unsuper-vised and semi-supervised methods
- Hawaii, USA
- Cucu, H., Besacier, L., Burileanu, C., Buzo, A., 2011. Investigating the role of machine translated text in ASR domain adaptation: unsuper-vised and semi-supervised methods. In: Proc. ASRU 2011, Hawaii, USA.
- (2011) Proc. ASRU 2011
- Cucu, H.¹ Besacier, L.² Burileanu, C.³ Buzo, A.⁴

26
- 84893681586
- ASR domain adaptation methods for low-resourced languages: Application to Romanian language
- Bucarest, Romania
- Cucu, H., Besacier, L., Burileanu, C., Buzo, A., 2012. ASR domain adaptation methods for low-resourced languages: application to Romanian language. In: EUSIPCO'2012, Bucarest, Romania.
- (2012) EUSIPCO'2012
- Cucu, H.¹ Besacier, L.² Burileanu, C.³ Buzo, A.⁴

27
- 84893703276
- SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian
- http://dx.doi.org/10.1016/j.specom.2013.05.003
- Cucu, H., Buzo, A., Besacier, L., Burileanu, C., 2013. SMT-based ASR domain adaptation methods for under-resourced languages: application to Romanian. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.05.003.
- (2013) Speech Communication
- Cucu, H.¹ Buzo, A.² Besacier, L.³ Burileanu, C.⁴

28
- 84865744412
- Efficient harvesting of internet audio for resource-scarce ASR
- Davel, M.H., van Heerden, C., Kleynhans, N., Barnard, E., 2011. Efficient harvesting of Internet audio for resource-scarce ASR. In: Proc. Interspeech, pp. 3153-3156.
- (2011) Proc. Interspeech , pp. 3153-3156
- Davel, M.H.¹ Van Heerden, C.² Kleynhans, N.³ Barnard, E.⁴

29
- 84865706565
- Woefzela - An open-source platform for ASR data collection in the developing world
- De Vries, N.J., Badenhorst, J., Davel, M.H., Barnard, E., De Waal, A., 2011. Woefzela - an open-source platform for ASR data collection in the developing world. In: Proc. Interspeech, pp. 3177-3180.
- (2011) Proc. Interspeech , pp. 3177-3180
- De Vries, N.J.¹ Badenhorst, J.² Davel, M.H.³ Barnard, E.⁴ De Waal, A.⁵

30
- 84893666907
- A smartphone-based ASR data collection tool for under-resourced languages
- http://dx.doi.org/10.1016/j.specom.2013.07.001
- De Vries, N.J., Davel, M.H., Badenhorst, J., Basson, W.D., de Wet, F., Barnard, E., De Waal, A., 2013. A smartphone-based ASR data collection tool for under-resourced languages, Speech Communication. http://dx.doi.org/10.1016/j. specom.2013.07.001.
- (2013) Speech Communication
- De Vries, N.J.¹ Davel, M.H.² Badenhorst, J.³ Basson, W.D.⁴ De Wet, F.⁵ Barnard, E.⁶ De Waal, A.⁷

31
- 51449096957
- The character as an appropriate unit of processing for non-segmenting languages
- Tokyo, Japan
- Denoual, E., Lepage, Y., 2006. The character as an appropriate unit of processing for non-segmenting languages. In: NLP Annual Meeting, Tokyo, Japan, pp. 731-734.
- (2006) NLP Annual Meeting , pp. 731-734
- Denoual, E.¹ Lepage, Y.²

32
- 84894590340
- Unsupervised SMT for a low-resourced language pair
- Penang, Malaysia
- Do, T., Besacier, L., Castelli, E., 2010. Unsupervised SMT for a low-resourced language pair. In: Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Penang, Malaysia.
- (2010) Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU)
- Do, T.¹ Besacier, L.² Castelli, E.³

33
- 84885550611
- The philips large-vocabulary recognition system for American english, french, and german
- Madrid
- Dugast, C., Aubert, X., Kneser, R., 1995. The Philips large-vocabulary recognition system for American English, French, and German. In: Proc. Eurospeech, Madrid, pp. 197-200.
- (1995) Proc. Eurospeech , pp. 197-200
- Dugast, C.¹ Aubert, X.² Kneser, R.³

34
- 84893670051
- Statistical parametric speech synthesis for ibibio
- http://dx.doi.org/10.1016/j.specom.2013.02.003
- Ekpenyong, M., Urua, E.-A., Watts, O., King, S., Yamagishi, J., 2013. Statistical parametric speech synthesis for Ibibio, Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.02.003.
- (2013) Speech Communication
- Ekpenyong, M.¹ Urua, E.-A.² Watts, O.³ King, S.⁴ Yamagishi, J.⁵

35
- 1942448479
- Hybrid SVM/HMM architectures for speech recognition
- Ganapathiraju, A., Hamaker, J., Picone, J., 2000. Hybrid SVM/HMM architectures for speech recognition. In: Proceedings of Speech Transcription Workshop, pp. 504-507.
- (2000) Proceedings of Speech Transcription Workshop , pp. 504-507
- Ganapathiraju, A.¹ Hamaker, J.² Picone, J.³

36
- 84893683459
- English-amharic statistical machine translation
- Cape-Town, South Africa
- Gebreegziabher, M., Besacier, L., 2012. English-Amharic statistical machine translation. In: SLTU - Workshop on Spoken Language Technologies for Under-Resourced Languages, Cape-Town, South Africa.
- (2012) SLTU - Workshop on Spoken Language Technologies for Under-Resourced Languages
- Gebreegziabher, M.¹ Besacier, L.²

37
- 84893680491
- Using automatic speech recognition for phonological purposes: Study of vowel length in punu (Bantu B40)
- New Mexico (US), July 2010
- Gelas, H., Besacier, L., Rossato, S., Pellegrino, F., 2010. Using automatic speech recognition for phonological purposes: study of vowel length in Punu (Bantu B40). In: Laphon 12, New Mexico (US), July 2010.
- (2010) Laphon 12
- Gelas, H.¹ Besacier, L.² Rossato, S.³ Pellegrino, F.⁴

38
- 84865722538
- Quality assessment of crowdsourcing transcriptions for african languages
- Italy, 28-31 August 2011
- Gelas, H., Teferra Abate, S., Besacier, L., Pellegrino, F., 2011. Quality assessment of crowdsourcing transcriptions for African languages. In: Interspeech 2011 Florence, Italy, 28-31 August 2011.
- (2011) Interspeech 2011 Florence
- Gelas, H.¹ Teferra Abate, S.² Besacier, L.³ Pellegrino, F.⁴

39
- 85030406898
- A hierarchical exemplar-based sparse model of speech with an application to ASR
- HI, USA
- Gemmeke, J.F., Van hamme, H., 2011. A hierarchical exemplar-based sparse model of speech with an application to ASR. IEEE ASRU 2011, HI, USA.
- (2011) IEEE ASRU 2011
- Gemmeke, J.F.¹ Van Hamme, H.²

40
- 70349227690
- Web-derived pronunciations
- Ghoshal, A., Jansche, M., Khudanpur, S., Riley, M., Ulinski, M., 2009. Web-derived pronunciations. In: IEEE ICASSP.
- (2009) IEEE ICASSP
- Ghoshal, A.¹ Jansche, M.² Khudanpur, S.³ Riley, M.⁴ Ulinski, M.⁵

41
- 84893699024
- Multiple pronunciation model for amharic speech recognition system
- Hanoi, Vietnam
- Gizaw, S., 2008. Multiple pronunciation model for Amharic speech recognition system. In: SLTU 2008, Hanoi, Vietnam.
- (2008) SLTU 2008
- Gizaw, S.¹

42
- 0029354754
- Multi-lingual spoken language understanding in the MIT voyager system
- Glass, J., Flammia, G., Goodine, D., Phillips, M., Polifroni, J., Sakai, S., Seneff, S., Zue, V., 1995. Multi-lingual spoken language understanding in the MIT voyager system. Speech Communication 17, 1-18.
- (1995) Speech Communication , vol.17 , pp. 1-18
- Glass, J.¹ Flammia, G.² Goodine, D.³ Phillips, M.⁴ Polifroni, J.⁵ Sakai, S.⁶ Seneff, S.⁷ Zue, V.⁸

43
- 85016587886
- SWITCHBOARD: Telephone speech corpus for research and development
- Godfrey, J.J., Holliman, E.C., McDaniel, J., 1992. SWITCHBOARD: telephone speech corpus for research and development. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 517-520.
- (1992) IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.1 , pp. 517-520
- Godfrey, J.J.¹ Holliman, E.C.² McDaniel, J.³

44
- 0030640721
- A multilingual phoneme and model set: Towards a universal base for automatic speech recognition
- St. Barbara CA
- Gokcen, S., Gokcen, J., 1997. A multilingual phoneme and model set: towards a universal base for automatic speech recognition. In: Proc. Automatic Speech Recognition and Understanding (ASRU), St. Barbara CA, pp. 599-603.
- (1997) Proc. Automatic Speech Recognition and Understanding (ASRU) , pp. 599-603
- Gokcen, S.¹ Gokcen, J.²

45
- 34547548235
- Probabilistic and bottle-neck features for LVCSR of meetings
- USA
- Grezl, F., et al., 2007. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. ICASSP, USA.
- (2007) Proc. ICASSP
- Grezl, F.¹

46
- 0033709098
- Tandem connectionist feature extraction for conventional HMM systems
- Turkey
- Hermansky, H., Wellis, D., Sharma, S., 2000. Tandem connectionist feature extraction for conventional HMM systems. In: Proc. ICASSP, Turkey.
- (2000) Proc. ICASSP
- Hermansky, H.¹ Wellis, D.² Sharma, S.³

47
- 0141476926
- Accent modeling based on pronunciation dictionary adaptation for large vocabulary mandarin speech recognition
- Beijing, China
- Huang, C., Chang, E., Zhou, J., Lee K.-F., 2000. Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition. In: Proc. INTERSPEECH-2000, Beijing, China, pp. 818-821.
- (2000) Proc. INTERSPEECH-2000 , pp. 818-821
- Huang, C.¹ Chang, E.² Zhou, J.³ Lee, K.-F.⁴

48
- 77950944695
- Morpho-syntactic postprocessing of N-best lists for improved french automatic speech recognition
- Huet, S., Gravier, G., Sebillot, P., 2010. Morpho-syntactic postprocessing of N-best lists for improved French automatic speech recognition. Computer Speech and Language 24 (4), 663-684.
- (2010) Computer Speech and Language , vol.24 , Issue.4 , pp. 663-684
- Huet, S.¹ Gravier, G.² Sebillot, P.³

49
- 79959826604
- Building transcribed speech corpora quickly and cheaply for many languages
- Makuhari, Japan
- Hughes, T., Nakajima, K., Ha, L., Moreno, P., LeBeau, M., 2010. Building transcribed speech corpora quickly and cheaply for many languages. In: Proc. Interspeech, Makuhari, Japan, pp. 1914-1917.
- (2010) Proc. Interspeech , pp. 1914-1917
- Hughes, T.¹ Nakajima, K.² Ha, L.³ Moreno, P.⁴ LeBeau, M.⁵

50
- 0003417482
- IPA Cambridge University Press
- IPA, 1999. Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet. Cambridge University Press.
- (1999) Handbook of the International Phonetic Association: A Guide to the use of the International Phonetic Alphabet

51
- 69249138826
- Development of a speech recognition system for Icelandic using machine translated text
- Hanoi, Vietnam
- Jensson, A., 2008. Development of a speech recognition system for Icelandic using machine translated text. In: SLTU'08, Hanoi, Vietnam.
- (2008) SLTU'08
- Jensson, A.¹

52
- 78649995680
- Speech recognition system based improved DTW algorithm
- Jing, Z., Min, Z., 2010. Speech recognition system based improved DTW algorithm. In: Proc. Int. Conf. on Computer, Mechatronics, Control and, Electronic Engineering CMCE-2010, Vol. 5, pp. 320-323.
- (2010) Proc. Int. Conf. on Computer, Mechatronics, Control And, Electronic Engineering CMCE-2010 , vol.5 , pp. 320-323
- Jing, Z.¹ Min, Z.²

53
- 85030410304
- Statistical language modeling using syntactically enhanced LSA
- Mumbai, India
- Kanejiya, D.P., Kumar, A., Prasad, S., 2003. Statistical language modeling using syntactically enhanced LSA. In: Proc. TIFR Workshop on Spoken Language Processing, Mumbai, India, pp. 93-100.
- (2003) Proc. TIFR Workshop on Spoken Language Processing , pp. 93-100
- Kanejiya, D.P.¹ Kumar, A.² Prasad, S.³

54
- 84893657385
- Multilingual acoustic modeling using graphemes
- Geneva, Switzerland
- Kanthak, S., Ney, H., 2003. Multilingual acoustic modeling using graphemes. In: Eurospeech-2003, Geneva, Switzerland, pp. 1145-1148.
- (2003) Eurospeech-2003 , pp. 1145-1148
- Kanthak, S.¹ Ney, H.²

55
- 77956597632
- Comparing SMT methods for automatic generation of pronunciation variants
- Reykjavik, Iceland
- Karanasou, P., Lamel, L., 2010. Comparing SMT methods for automatic generation of pronunciation variants. In: IceTAL 2010, Reykjavik, Iceland, p. 167.
- (2010) IceTAL 2010 , pp. 167
- Karanasou, P.¹ Lamel, L.²

56
- 84859039566
- Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis
- Florence, Italy
- Karpov, A., Kipyatkova, I., Ronzhin, A., 2011. Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis. In: Proc. Interspeech'2011, Florence, Italy, pp. 3161-3164.
- (2011) Proc. Interspeech'2011 , pp. 3161-3164
- Karpov, A.¹ Kipyatkova, I.² Ronzhin, A.³

57
- 84893652340
- Large vocabulary Russian speech recognition using syntactico-statistical language modeling
- http://dx.doi.org/10.1016/j.specom.2013.07.004
- Karpov, A., Markov, K., Kipyatkova, I., Vazhenina, D., Ronzhin, A., 2013. Large vocabulary Russian speech recognition using syntactico-statistical language modeling. Speech Communication. http://dx.doi.org/10.1016/j.specom. 2013.07.004.
- (2013) Speech Communication
- Karpov, A.¹ Markov, K.² Kipyatkova, I.³ Vazhenina, D.⁴ Ronzhin, A.⁵

58
- 0004654418
- Data-driven determination of appropriate dictionary units for Korean LVCSR
- Kiecza, D., Schultz, T., Waibel, A., 1999. Data-driven determination of appropriate dictionary units for Korean LVCSR. In: Proceedings of the International Conference on Speech Processing, pp. 323-327.
- (1999) Proceedings of the International Conference on Speech Processing , pp. 323-327
- Kiecza, D.¹ Schultz, T.² Waibel, A.³

59
- 84893639064
- Grapheme based speech recognition
- Killer, M., Stüker, S., Schultz, T., 2003. Grapheme based speech recognition. In: Interspeech.
- (2003) Interspeech
- Killer, M.¹ Stüker, S.² Schultz, T.³

60
- 84872512624
- Analysis of long-distance word dependencies and pronunciation variability at conversational Russian speech recognition
- Wroclav, Poland
- Kipyatkova, I., Karpov, A., Verkhodanova, V., Zelezny, M., 2012. Analysis of long-distance word dependencies and pronunciation variability at conversational Russian speech recognition. In: Proc. FedCSIS-2012, Wroclav, Poland, pp. 719-725.
- (2012) Proc. FedCSIS-2012 , pp. 719-725
- Kipyatkova, I.¹ Karpov, A.² Verkhodanova, V.³ Zelezny, M.⁴

61
- 0031619917
- Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks
- Seattle
- Köhler, J., 1998. Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks. In: Proc. ICASSP, Seattle, pp. 417-420.
- (1998) Proc. ICASSP , pp. 417-420
- Köhler, J.¹

62
- 84893646836
- The basic language resource kit (BLARK) as the first milestone for the language resources roadmap
- Moscow, Russia
- Krauwer, S., 2003. The basic language resource kit (BLARK) as the first milestone for the language resources roadmap. In: Proceedings of the 2003 International Workshop Speech and Computer SPECOM-2003, Moscow, Russia, pp. 8-15.
- (2003) Proceedings of the 2003 International Workshop Speech and Computer SPECOM-2003 , pp. 8-15
- Krauwer, S.¹

63
- 77949369404
- Syntactic features for arabic speech recognition
- Merano, Italy
- Kuo, H.-K.J., Mangu, L., Emami, A., Zitouni, I., Lee, Y.-S., 2009. Syntactic features for Arabic speech recognition. In: Proc. International Workshop ASRU'2009, Merano, Italy, pp. 327-332.
- (2009) Proc. International Workshop ASRU'2009 , pp. 327-332
- Kuo, H.-K.J.¹ Mangu, L.² Emami, A.³ Zitouni, I.⁴ Lee, Y.-S.⁵

64
- 84858385446
- Unlimited vocabulary speech recognition for agglutinative languages
- NY, USA
- Kurimo, M., Puurula, A., Arisoy, E., Siivola, V., Hirsimaki, T., Pylkkonen, J., Alumae, T., Saraclar, M., 2006. Unlimited vocabulary speech recognition for agglutinative languages. In: Proc. HLT-NAACL, NY, USA.
- (2006) Proc. HLT-NAACL
- Kurimo, M.¹ Puurula, A.² Arisoy, E.³ Siivola, V.⁴ Hirsimaki, T.⁵ Pylkkonen, J.⁶ Alumae, T.⁷ Saraclar, M.⁸

65
- 44949103508
- Unsupervised segmentation of words into morphemes - Morpho challenge. Application to automatic speech recognition
- Pittsburgh, PA, USA
- Kurimo, M., et al., 2006. Unsupervised segmentation of words into morphemes - Morpho Challenge. Application to automatic speech recognition. In: Proc. Interspeech'06, Pittsburgh, PA, USA, pp. 1021-1024.
- (2006) Proc. Interspeech'06 , pp. 1021-1024
- Kurimo, M.¹

66
- 0000143923
- Issues in large vocabulary multilingual speech recognition
- Madrid
- Lamel, L., Adda-Decker, M., Gauvain, J.L., 1995. Issues in large vocabulary multilingual speech recognition. In: Proc. Eurospeech, Madrid, pp. 185-189.
- (1995) Proc. Eurospeech , pp. 185-189
- Lamel, L.¹ Adda-Decker, M.² Gauvain, J.L.³

67
- 70450194704
- Grapheme to phoneme conversion using an SMT system
- Brighton, UK
- Laurent, A., Deléglise, P., Meignier, S., 2009. Grapheme to phoneme conversion using an SMT system. In: Interspeech 2009, Brighton, UK, pp. 708-711.
- (2009) Interspeech 2009 , pp. 708-711
- Laurent, A.¹ Deléglise, P.² Meignier, S.³

68
- 69249139569
- Automatic speech recognition for under-resourced languages: Application to vietnamese language
- Le, V.-B., Besacier, L., 2009. Automatic speech recognition for under-resourced languages: application to Vietnamese language. IEEE Transactions on Audio, Speech and Language Processing 17(8), 1471-1482.
- (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.8 , pp. 1471-1482
- Le, V.-B.¹ Besacier, L.²

69
- 33646820243
- Using the web for fast language model construction in minority languages
- Geneva, Switzerland
- Le, V.B., Bigi, B., Besacier, L., Castelli, E., 2003. Using the Web for fast language model construction in minority languages. In: Euro-speech'03, Geneva, Switzerland, pp. 3117-3120.
- (2003) Euro-speech'03 , pp. 3117-3120
- Le, V.B.¹ Bigi, B.² Besacier, L.³ Castelli, E.⁴

70
- 85008025475
- Probabilistic modeling of Korean morphology
- Lee, D.-G., Rim, H.-C., 2009. Probabilistic modeling of Korean morphology. IEEE Transactions on Audio, Speech & Language Processing 17 (5), 945-955.
- (2009) IEEE Transactions on Audio, Speech & Language Processing , vol.17 , Issue.5 , pp. 945-955
- Lee, D.-G.¹ Rim, H.-C.²

71
- 70450181523
- Cross-language bootstrapping for unsupervised acoustic model training: Rapid development of a polish speech recognition system
- Brighton, UK
- Loof, J., Gollan, C., Ney, H., 2009. Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system. In: Interspeech 2009. Brighton, UK.
- (2009) Interspeech 2009
- Loof, J.¹ Gollan, C.² Ney, H.³

72
- 33646070918
- Modeling syntax of free word-order languages: Dependency analysis by reduction
- Springer LNAI 3658, Karlovy Vary, Czech Republic
- Lopatková, M., Plátek, M., Kuboň, V., 2005. Modeling syntax of free word-order languages: dependency analysis by reduction. In: Proc. TSD'2005, Springer LNAI 3658, Karlovy Vary, Czech Republic, pp. 140-147.
- (2005) Proc. TSD'2005 , pp. 140-147
- Lopatková, M.¹ Plátek, M.² Kuboň, V.³

73
- 56149124525
- Morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - Like hungarian
- Antwerp, Belgium
- Mihajlik, P., Fegyó, T., Tüske, Z., Ircing, P., 2007. Morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. In: Interspeech'07, Antwerp, Belgium.
- (2007) Interspeech'07
- Mihajlik, P.¹ Fegyó, T.² Tüske, Z.³ Ircing, P.⁴

74
- 79959829092
- Recurrent neural network based language model
- Makuhari, Japan
- Mikolov, T., Karafiat, M., Burget, L., Cernocky, J., Khudanpur, S., 2010. Recurrent neural network based language model. In: Proc. INTER-SPEECH-2010, Makuhari, Japan, pp. 1045-1048.
- (2010) Proc. INTER-SPEECH-2010 , pp. 1045-1048
- Mikolov, T.¹ Karafiat, M.² Burget, L.³ Cernocky, J.⁴ Khudanpur, S.⁵

75
- 84055211743
- Acoustic modeling using deep belief networks
- Mohamed, A., Dahl, G.E., Hinton, G., 2012. Acoustic modeling using deep belief networks. IEEE Transactions on Audio, Speech, and Language Processing 20 (1), 14-22.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.E.² Hinton, G.³

76
- 85079278177
- Automatic segmentation and identification of ten languages using telephone speech
- Muthusamy, Y.K., Cole, R.A., 1992. Automatic segmentation and identification of ten languages using telephone speech. In: Second International Conference on Spoken Language Processing.
- (1992) Second International Conference on Spoken Language Processing
- Muthusamy, Y.K.¹ Cole, R.A.²

77
- 33745195515
- Language model adaptation with additional text generated by machine translation
- Taipei, Taiwan
- Nakajima, H., Yamamoto, H., Watanabe, T., 2002. Language model adaptation with additional text generated by machine translation. In: COLING 2002, Vol. 2, Taipei, Taiwan, pp. 716-722.
- (2002) COLING 2002 , vol.2 , pp. 716-722
- Nakajima, H.¹ Yamamoto, H.² Watanabe, T.³

78
- 33646767826
- A new ASR evaluation measure and minimum bayes-risk decoding for open-domain speech understanding
- PA, USA
- Nanjo, H., Kawahara, T., 2005. A new ASR evaluation measure and minimum Bayes-risk decoding for open-domain speech understanding. In: Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP-2005, PA, USA, pp. 1053-1056.
- (2005) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP-2005 , pp. 1053-1056
- Nanjo, H.¹ Kawahara, T.²

79
- 78649842703
- The US NIST 2009 (RT-09)
- The US NIST 2009 (RT-09) Rich Transcription Meeting Recognition Evaluation Plan, 2009.
- (2009) Rich Transcription Meeting Recognition Evaluation Plan

80
- 67649515539
- Morphological random forests for language modeling of inflectional languages
- Goa, India
- Oparin, I., Glembek, O., Burget, L., Černocký, J., 2008. Morphological random forests for language modeling of inflectional languages. In: Proc. IEEE Workshop on Spoken Language Technology SLT'08, Goa, India.
- (2008) Proc. IEEE Workshop on Spoken Language Technology SLT'08
- Oparin, I.¹ Glembek, O.² Burget, L.³ Černocký, J.⁴

81
- 79951777091
- Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data
- Berkeley, California, December 2010
- Parent, G., Eskenazi, M., 2010. Toward better crowdsourced transcription: transcription of a year of the Let's Go bus information system data. In: Proceedings of IEEE Workshop on Spoken Language Technology, Berkeley, California, December 2010, pp. 312-317.
- (2010) Proceedings of IEEE Workshop on Spoken Language Technology , pp. 312-317
- Parent, G.¹ Eskenazi, M.²

82
- 84892473615
- A comparative study of speech and dialed input voice interfaces in rural India
- ACM, New York, NY, USA
- Patel, N., Agarwal, S., Rajput, N., Nanavati, A., Dave, P., Parikh, T.S., 2009. A comparative study of speech and dialed input voice interfaces in rural India. In: CHI'09: Proceedings of the 27th international conference on Human factors in computing systems. ACM, New York, NY, USA, pp. 51-54.
- (2009) CHI'09: Proceedings of the 27th International Conference on Human Factors in Computing Systems , pp. 51-54
- Patel, N.¹ Agarwal, S.² Rajput, N.³ Nanavati, A.⁴ Dave, P.⁵ Parikh, T.S.⁶

83
- 77953970245
- Avaaj otalo: A field study of an interactive voice forum for small farmers in rural India
- Patel, N., Chittamuru, D., Jain, A., Dave, P., Parikh, T.S., 2010. Avaaj Otalo: A field study of an interactive voice forum for small farmers in rural India. In: CHI. ACM, pp. 733-742.
- (2010) CHI. ACM , pp. 733-742
- Patel, N.¹ Chittamuru, D.² Jain, A.³ Dave, P.⁴ Parikh, T.S.⁵

84
- 84893644519
- Investigating automatic decomposition for ASR in less represented languages
- Pittsburgh
- Pellegrini, T., Lamel, L., 2006. Investigating automatic decomposition for ASR in less represented languages. In: ICSLP'06, Pittsburgh.
- (2006) ICSLP'06
- Pellegrini, T.¹ Lamel, L.²

85
- 69249115863
- Are audio or textual training data more important for ASR in less-represented languages?
- Hanoi, Vietnam
- Pellegrini, T., Lamel, L., 2008. Are audio or textual training data more important for ASR in less-represented languages?. In: SLTU'08, Hanoi, Vietnam.
- (2008) SLTU'08
- Pellegrini, T.¹ Lamel, L.²

86
- 85008009596
- Automatic word decompounding for ASR in a morphologically rich language: Application to amharic
- Pellegrini, T., Lamel, L., 2009. Automatic word decompounding for ASR in a morphologically rich language: application to Amharic. IEEE Transactions on Audio, Speech & Language Processing 17 (5), 863-873.
- (2009) IEEE Transactions on Audio, Speech & Language Processing , vol.17 , Issue.5 , pp. 863-873
- Pellegrini, T.¹ Lamel, L.²

87
- 84858976609
- Cross-lingual portability of Chinese and english neural network features for french and german LVCSR
- USA
- Plahl, C., Schlueter, R., Ney, H., 2011. Cross-lingual portability of Chinese and English neural network features for French and German LVCSR. In: Proc. ASRU, USA.
- (2011) Proc. ASRU
- Plahl, C.¹ Schlueter, R.² Ney, H.³

88
- 84876792117
- Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining
- Jeju, Korea
- Rastrow, A., Dredze, M., Khudanpur, S., 2012. Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining. In: Proc. 50th Annual Meeting of Association for Computational Linguistics ACL'2012, Jeju, Korea, pp. 175-183.
- (2012) Proc. 50th Annual Meeting of Association for Computational Linguistics ACL'2012 , pp. 175-183
- Rastrow, A.¹ Dredze, M.² Khudanpur, S.³

89
- 34250720573
- Russian voice interface
- DOI 10.1134/S1054661807020216
- (Pubitemid 46964448)
- (2007) Pattern Recognition and Image Analysis , vol.17 , Issue.2 , pp. 321-336
- Ronzhin, A.L.¹ Karpov, A.A.²

90
- 34250004339
- Large vocabulary continuous speech recognition of an inflected language using stems and endings
- DOI 10.1016/j.specom.2007.02.010, PII S0167639307000428
- Rotovnik, T., Maucec, M.S., Kacix, Z., 2007. Large vocabulary continuous speech recognition of an inflected language using stems and endings. Speech Communication 49 (6), 437-452. (Pubitemid 46891622)
- (2007) Speech Communication , vol.49 , Issue.6 , pp. 437-452
- Rotovnik, T.¹ Maucec, M.S.² Kacic, Z.³

91
- 84886596580
- Developing a multilingual telephone based information retrieval system in african languages
- Roux, J.C., Botha, E.C., du Preez, J.A., 2000. Developing a multilingual telephone based information retrieval system in African languages. In: Proceedings of the Second International Conference on Language Resources and Evaluation, pp. 975-980.
- (2000) Proceedings of the Second International Conference on Language Resources and Evaluation , pp. 975-980
- Roux, J.C.¹ Botha, E.C.² Du Preez, J.A.³

92
- 78049399086
- Morphology-based and subword language modeling for turkish speech recognition
- Sak, H., Saraclar, M., Güngör, T., 2010. Morphology-based and subword language modeling for Turkish speech recognition. In: ICASSP 2010, pp. 5402-5405.
- (2010) ICASSP 2010 , pp. 5402-5405
- Sak, H.¹ Saraclar, M.² Güngör, T.³

93
- 34547535012
- Joint morphological-lexical language modeling (JMLLM) for arabic
- Sarikaya, R., Afify, M., Gao, Y., 2007. Joint morphological-lexical language modeling (JMLLM) for Arabic. In: Proc. ICASSP'07, Vol. 4, pp. 181-184.
- (2007) Proc. ICASSP'07 , vol.4 , pp. 181-184
- Sarikaya, R.¹ Afify, M.² Gao, Y.³

94
- 79959851710
- Wiktionary as a source for automatic pronunciation extraction
- Makuhari, Japan, 26-30 September 2010
- Schlippe, T., Ochs, S., Schultz, T., 2010. Wiktionary as a source for automatic pronunciation extraction. In: Interspeech 2010, Makuhari, Japan, 26-30 September 2010.
- (2010) Interspeech 2010
- Schlippe, T.¹ Ochs, S.² Schultz, T.³

95
- 84867605828
- Grapheme-to-phoneme model generation for indo-european languages
- Kyoto, Japan, 25-30 March 2012
- Schlippe, T., Ochs, S., Schultz, T., 2012a. Grapheme-to-phoneme model generation for indo-European languages. In: ICASSP 2012, Kyoto, Japan, 25-30 March 2012.
- (2012) ICASSP 2012
- Schlippe, T.¹ Ochs, S.² Schultz, T.³

96
- 84878526331
- Automatic error recovery for pronunciation dictionaries
- Portland, Oregon, 9-13 September 2012
- Schlippe, T., Ochs, S., Vu, N.T., Schultz, T., 2012b. Automatic error recovery for pronunciation dictionaries. In: Interspeech 2012, Portland, Oregon, 9-13 September 2012.
- (2012) Interspeech 2012
- Schlippe, T.¹ Ochs, S.² Vu, N.T.³ Schultz, T.⁴

97
- 84893706991
- Web-based tools and methods for rapid pronunciation dictionary creation
- http://dx.doi.org/10.1016/j.specom.2013.06.015
- Schlippe, T., Ochs, S., Schultz, T., 2013. Web-based tools and methods for rapid pronunciation dictionary creation. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.06.015.
- (2013) Speech Communication
- Schlippe, T.¹ Ochs, S.² Schultz, T.³

98
- 85009274666
- GlobalPhone: A multilingual speech and text database developed at karlsruhe university
- Schultz, T., 2002. GlobalPhone: A multilingual speech and text database developed at Karlsruhe University. In: ICSLP, pp. 345-348.
- (2002) ICSLP , pp. 345-348
- Schultz, T.¹

99
- 85013700737
- Tanja Schultz, Katrin Kirchhoff (Eds.), Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5, 2006
- Schultz, T., 2006. Multilingual speech processing. In: Tanja Schultz, Katrin Kirchhoff (Eds.), Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5, 2006.
- (2006) Multilingual Speech Processing
- Schultz, T.¹

100
- 43849102326
- SPICE: Web-based tools for rapid language adaptation in speech processing systems
- Antwerp, Belgium
- Schultz, T., Black, A.W., Badaskar, S., Hornyak, M., Kominek, J., 2007. SPICE: web-based tools for rapid language adaptation in speech processing systems. In: Interspeech 2007, Antwerp, Belgium.
- (2007) Interspeech 2007
- Schultz, T.¹ Black, A.W.² Badaskar, S.³ Hornyak, M.⁴ Kominek, J.⁵

101
- 84890463379
- GlobalPhone: A multilingual text & speech database in 20 languages
- Vancouver, Canada
- Schultz, T., Vu, N.T., Schlippe, T., 2013. GlobalPhone: A multilingual text & speech database in 20 languages. In: ICASSP 2013, Vancouver, Canada.
- (2013) ICASSP 2013
- Schultz, T.¹ Vu, N.T.² Schlippe, T.³

102
- 0001216191
- Language independent and language adaptive LVCSR
- Sydney
- Schultz, T., Waibel, A., 1998. Language independent and language adaptive LVCSR. In: Proc. ICSLP, Sydney, pp. 1819-1822.
- (1998) Proc. ICSLP , pp. 1819-1822
- Schultz, T.¹ Waibel, A.²

103
- 0035426931
- Language-independent and language-adaptive acoustic modeling for speech recognition
- DOI 10.1016/S0167-6393(00)00094-7, PII S0167639300000947
- Schultz, T., Waibel, A., 2001. Language independent and language adaptive acoustic modeling for speech recognition. Speech Communication 35, 31-51. (Pubitemid 32599645)
- (2001) Speech Communication , vol.35 , Issue.1-2 , pp. 31-51
- Schultz, T.¹ Waibel, A.²

104
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- HI, USA
- Seide, F., Li, G., Chen, X., Yu, D., 2011. Feature engineering in context-dependent deep neural networks for conversational speech transcription. In: Proc. ASRU-2011 International Workshop, HI, USA, pp. 24-29.
- (2011) Proc. ASRU-2011 International Workshop , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

105
- 84867333761
- Universal attribute characterization of spoken languages for automatic spoken language recognition
- Siniscalchi, S.M., Reed, J., Svendsen, T., Lee, C.-H., 2013. Universal attribute characterization of spoken languages for automatic spoken language recognition. Computer Speech & Language 27 (1), 209-227.
- (2013) Computer Speech & Language , vol.27 , Issue.1 , pp. 209-227
- Siniscalchi, S.M.¹ Reed, J.² Svendsen, T.³ Lee, C.-H.⁴

106
- 34047146422
- Robust ASR using Support Vector Machines
- DOI 10.1016/j.specom.2007.01.013, PII S0167639307000246
- Solera-Urena, R., Martin-Iglesias, D., Gallardo-Antolin, A., Pelaez-Moreno, C., Diaz-de-Maria, F., 2007. Robust ASR using support vector machines. Speech Communication 49 (4), 253-267. (Pubitemid 46517709)
- (2007) Speech Communication , vol.49 , Issue.4 , pp. 253-267
- Solera-Urena, R.¹ Martin-Iglesias, D.² Gallardo-Antolin, A.³ Pelaez-Moreno, C.⁴ Diaz-de-Maria, F.⁵

107
- 84874250689
- Word segmentation through cross-lingual word-to-phoneme alignment
- Miami, Florida, 2-5 December 2012
- Stahlberg, F., Schlippe, T., Vogel, S., Schultz, T., 2012. Word segmentation through cross-lingual word-to-phoneme alignment. In: Proceedings of The Fourth IEEE Workshop on Spoken Language Technology (SLT 2012), Miami, Florida, 2-5 December 2012.
- (2012) Proceedings of the Fourth IEEE Workshop on Spoken Language Technology (SLT 2012)
- Stahlberg, F.¹ Schlippe, T.² Vogel, S.³ Schultz, T.⁴

108
- 84883149008
- Pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment
- Tarragona, Spain, 29-31 July 2013
- Stahlberg, F., Schlippe, T., Vogel, S., Schultz, T., 2013. Pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment. In: Proceedings of the 1st international conference on statistical language and speech processing (SLSP 2013), Tarragona, Spain, 29-31 July 2013.
- (2013) Proceedings of the 1st International Conference on Statistical Language and Speech Processing (SLSP 2013)
- Stahlberg, F.¹ Schlippe, T.² Vogel, S.³ Schultz, T.⁴

109
- 85030406878
- Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables
- Stephenson, T.A., Escofet, J., Magimai-Doss, M., Bourlard, H., 2002. Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables, Technical Report Idiap-RR-24-2002, p. 10.
- (2002) Technical Report Idiap-RR-24-2002 , pp. 10
- Stephenson, T.A.¹ Escofet, J.² Magimai-Doss, M.³ Bourlard, H.⁴

110
- 33947619591
- Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons
- Stolcke, A., Grezl, F., Hwang, M.-Y., Lei, X., Morgan, N., Vergyri, D., 2006. Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons. In: Proc. ICASSP 2006.
- (2006) Proc. ICASSP 2006
- Stolcke, A.¹ Grezl, F.² Hwang, M.-Y.³ Lei, X.⁴ Morgan, N.⁵ Vergyri, D.⁶

111
- 85133311295
- Integrating thai grapheme based acoustic models into the ML-mix framework - For language independent and cross-language ASR
- Hanoi, Vietnam
- Stüker, S., 2008. Integrating Thai grapheme based acoustic models into the ML-mix framework - for language independent and cross-language ASR. In: SLTU'08, Hanoi, Vietnam.
- (2008) SLTU'08
- Stüker, S.¹

112
- 0141591550
- Multilingual articulatory features
- Stüker, S., Schultz, T., Metze, F., Waibel, A., 2003. Multilingual articulatory features, In: ICASSP 2003.
- (2003) ICASSP 2003
- Stüker, S.¹ Schultz, T.² Metze, F.³ Waibel, A.⁴

113
- 0141591550
- Multilingual articulatory features
- Stuker, S., Schultz, T., Metze, F., Waibel, A., 2003. Multilingual articulatory features. In: Proceedings. ICASSP'03 IEEE International Conference on Acoustics, Speech, and, Signal Processing.
- (2003) Proceedings. ICASSP'03 IEEE International Conference on Acoustics, Speech, And, Signal Processing
- Stuker, S.¹ Schultz, T.² Metze, F.³ Waibel, A.⁴

114
- 70450198124
- Human translations guided language discovery for ASR systems
- Brighton, UK
- Stüker, S., Besacier, L., Waibel, A., 2009. Human translations guided language discovery for ASR systems. In: InterSpeech-2009, Brighton, UK.
- (2009) InterSpeech-2009
- Stüker, S.¹ Besacier, L.² Waibel, A.³

115
- 70450172181
- Localization of speech recognition in spoken dialog systems: How machine translation can make our lives
- Brighton, UK
- Suenderman, K., Liscombe, J., 2009. Localization of speech recognition in spoken dialog systems: how machine translation can make our lives. In: Interspeech 2009, Brighton, UK, pp. 1475-1478.
- (2009) Interspeech 2009 , pp. 1475-1478
- Suenderman, K.¹ Liscombe, J.²

116
- 0141703236
- Finite-state transducer based modeling of morphosyntax with applications to hungarian LVCSR
- HongKong, China
- Szarvas, M., Furui, S., 2003. Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR. In: Proc. ICASSP, HongKong, China, pp. 368-371.
- (2003) Proc. ICASSP , pp. 368-371
- Szarvas, M.¹ Furui, S.²

117
- 85071192719
- Syllable-based and hybrid acoustic models for amharic speech recognition
- Cape-Town, South Africa
- Tachbelie, M., Abate, S.T., Besacier, L., Rossato, S., 2012. Syllable-based and hybrid acoustic models for Amharic speech recognition. In: SLTU - Workshop on Spoken Language Technologies for Under-Resourced Languages, Cape-Town, South Africa.
- (2012) SLTU - Workshop on Spoken Language Technologies for Under-Resourced Languages
- Tachbelie, M.¹ Abate, S.T.² Besacier, L.³ Rossato, S.⁴

118
- 84893678734
- Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic
- http://dx.doi.org/10.1016/j.specom.2013.01.008
- Tachbelie, M., Abate, S.T., Besacier, L., 2013. Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.01.008.
- (2013) Speech Communication
- Tachbelie, M.¹ Abate, S.T.² Besacier, L.³

119
- 78649856491
- On morph-based LVCSR improvements
- Malaysia
- Tarjan, B., Mihajlik, P., 2010. On morph-based LVCSR improvements. In: Proc. 2nd Int. Workshop on Spoken Languages Technologies for Under-resourced Languages SLTU-2010, Malaysia, pp. 10-16.
- (2010) Proc. 2nd Int. Workshop on Spoken Languages Technologies for Under-resourced Languages SLTU-2010 , pp. 10-16
- Tarjan, B.¹ Mihajlik, P.²

120
- 84867606552
- Multilingual MLP features for low-resource LVCSR systems
- Japan
- Thomas, S., Ganapathy, S., Hermansky, H., 2012a. Multilingual MLP features for low-resource LVCSR systems. In: Proc. ICASSP, Japan.
- (2012) Proc. ICASSP
- Thomas, S.¹ Ganapathy, S.² Hermansky, H.³

121
- 84878392008
- Data-driven posterior features for low resource speech recognition applications
- USA
- Thomas, S., Ganapathy, S., Jansen, A., Hermansky, H., 2012b. Data-driven posterior features for low resource speech recognition applications. In: Proc. Interspeech, USA.
- (2012) Proc. Interspeech
- Thomas, S.¹ Ganapathy, S.² Jansen, A.³ Hermansky, H.⁴

122
- 84858985238
- Cross-lingual portability of MLP-based tandem features - A case study for english and hungarian
- Toth, L., Frankel, J., Gosztolya, G., King, S., 2008. Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian. In: Proc. Interspeech.
- (2008) Proc. Interspeech
- Toth, L.¹ Frankel, J.² Gosztolya, G.³ King, S.⁴

123
- 0035101535
- A survey of hybrid ANN/HMM models for automatic speech recognition
- Trentin, E., Gori, M., 2001. A survey of hybrid ANN/HMM models for automatic speech recognition. Neurocomputing 37 (1), 91-126.
- (2001) Neurocomputing , vol.37 , Issue.1 , pp. 91-126
- Trentin, E.¹ Gori, M.²

124
- 85133315126
- Pooling ASR data for closely related languages
- Penang, Malaysia, May 2010
- van Heerden, C., Kleynhans, N., Barnard, E., Davel, M., 2010. Pooling ASR data for closely related languages. In: Proceedings of the Workshop on Spoken Languages Technologies for Under-Resourced Languages (SLTU 2010), Penang, Malaysia, May 2010, pp. 17-23.
- (2010) Proceedings of the Workshop on Spoken Languages Technologies for Under-Resourced Languages (SLTU 2010) , pp. 17-23
- Van Heerden, C.¹ Kleynhans, N.² Barnard, E.³ Davel, M.⁴

125
- 84893679840
- Predicting utterance pitch targets in yoruba for tone realisation in speech synthesis
- http://dx.doi.org/10.1016/j.specom.2013.01.009
- van Niekerk, D.R., Barnard, E., 2013. Predicting utterance pitch targets in Yoruba for tone realisation in speech synthesis, Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.01.009.
- (2013) Speech Communication
- Van Niekerk, D.R.¹ Barnard, E.²

126
- 85009110467
- Morphology-based language modeling for arabic speech recognition
- Vergyri, D., Kirchhoff, K., Duh, K., Stolcke, A., 2004. Morphology-based language modeling for Arabic speech recognition. In: Proc. ICSLP'04, pp. 2245-2248.
- (2004) Proc. ICSLP'04 , pp. 2245-2248
- Vergyri, D.¹ Kirchhoff, K.² Duh, K.³ Stolcke, A.⁴

127
- 84874226274
- The language-independent bottleneck features
- USA
- Vesely, K., Karafiat, M., Grezl, F., Janda, M., Egorova, E., 2012. The language-independent bottleneck features. In: Proc. SLT, USA.
- (2012) Proc. SLT
- Vesely, K.¹ Karafiat, M.² Grezl, F.³ Janda, M.⁴ Egorova, E.⁵

128
- 79951796711
- Multilingual A-stabil: A new confidence score for multilingual unsupervised training
- USA
- Vu, N.T., Kraus, F., Schultz, T., 2010. Multilingual A-stabil: A new confidence score for multilingual unsupervised training. In: Proc. SLT, USA.
- (2010) Proc. SLT
- Vu, N.T.¹ Kraus, F.² Schultz, T.³

129
- 84865764419
- Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training
- Italy
- Vu, N.T., Kraus, F., Schultz, T., 2011. Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training. In: Proc. Interspeech, Italy.
- (2011) Proc. Interspeech
- Vu, N.T.¹ Kraus, F.² Schultz, T.³

130
- 84890464069
- Multilingual bottle-neck feature for under resourced languages
- South Africa
- Vu, N.T., Metze, F., Schultz, T., 2012a. Multilingual bottle-neck feature for under resourced languages. In: Proc. SLTU, South Africa.
- (2012) Proc. SLTU
- Vu, N.T.¹ Metze, F.² Schultz, T.³

131
- 84878559540
- An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
- USA
- Vu, N.T., Breiter, W., Metze, F., Schultz, T., 2012b. An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance. In: Proc. Interspeech, USA.
- (2012) Proc. Interspeech
- Vu, N.T.¹ Breiter, W.² Metze, F.³ Schultz, T.⁴

132
- 84856280064
- An evaluation of cross-language adaptation for rapid HMM development in a new language
- Adelaide
- Wheatley, B., Kondo, K., Anderson, W., Muthusamy, Y., 1994. An evaluation of cross-language adaptation for rapid HMM development in a new language. In: Proc. ICASSP, Adelaide, pp. 237-240.
- (1994) Proc. ICASSP , pp. 237-240
- Wheatley, B.¹ Kondo, K.² Anderson, W.³ Muthusamy, Y.⁴

133
- 0003903706
- Ph.D. thesis, Cambridge Univ.
- Whittaker, E.W.D., 2000. Statistical language modelling for automatic speech recognition of Russian and English. Ph.D. thesis, Cambridge Univ., p. 140.
- (2000) Statistical Language Modelling for Automatic Speech Recognition of Russian and English , pp. 140
- Whittaker, E.W.D.¹

134
- 0034848043
- Efficient class-based language modelling for very large vocabularies
- Whittaker, E.W.D., Woodland, P.C., 2001. Efficient class-based language modelling for very large vocabularies. In: ICASSP-2001, Salt Lake City, USA, pp. 545-548. (Pubitemid 32839308)
- (2001) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 545-548
- Whittaker, E.W.D.¹ Woodland, P.C.²

135
- 55849099996
- Vowel variations in southern sotho: An acoustical investigation
- Wissing, D., Barnard, E., 2008. Vowel variations in Southern Sotho: an acoustical investigation. Southern African Linguistics and Applied Language Studies 26 (2), 255-265.
- (2008) Southern African Linguistics and Applied Language Studies , vol.26 , Issue.2 , pp. 255-265
- Wissing, D.¹ Barnard, E.²

136
- 0030718943
- Multilingual large vocabulary speech recognition: The European SQALE project
- Young, S.J., Adda-Decker, M., Aubert, X., Dugast, C., Gauvain, J.L., Kershaw, D.J., Lamel, L., Leeuwen, D.A., Pye, D., Robinson, A.J., Steeneken, H.J.M., Woodland, P.C., 1997. Multilingual large vocabulary speech recognition: the European SQALE project. Computer Speech & Language 11, 73-89. (Pubitemid 127375894)
- (1997) Computer Speech and Language , vol.11 , Issue.1 , pp. 73-89
- Young, S.J.¹ Adda-Dekker, M.² Aubert, X.³ Dugast, C.⁴ Gauvain, J.-L.⁵ Kershaw, D.J.⁶ Lamel, L.⁷ Leeuwen, D.A.⁸ Pye, D.⁹ Robinson, A.J.¹⁰ Steeneken, H.J.M.¹¹ Woodland, P.C.¹²

137
- 85075927145
- HMMs and related speech recognition technologies
- Springer-Verlag, Berlin Heidelberg
- Young, S., 2008. HMMs and related speech recognition technologies. In: Springer Handbook of Speech Processing. Springer-Verlag, Berlin Heidelberg, pp. 539-557.
- (2008) Springer Handbook of Speech Processing , pp. 539-557
- Young, S.¹

138
- 84867329143
- Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition
- Yu, D., Siniscalchi, S.M., Deng, L., Lee, C.-H., 2012. Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition. In: Proc. ICASSP-2012, pp. 4169-4172.
- (2012) Proc. ICASSP-2012 , pp. 4169-4172
- Yu, D.¹ Siniscalchi, S.M.² Deng, L.³ Lee, C.-H.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.