-
1
-
-
44949086643
-
Automatic transcription of somali language
-
Pittsburgh, PA, USA
-
Abdillahi, N., Nocera, P., Bonastre, J.-F., 2006. Automatic transcription of Somali language. In: ICSLP'06, Pittsburgh, PA, USA, pp. 289-292.
-
(2006)
ICSLP'06
, pp. 289-292
-
-
Abdillahi, N.1
Nocera, P.2
Bonastre, J.-F.3
-
2
-
-
78651107637
-
Uyghur morpheme-based language models and ASR
-
Beijing, China
-
Ablimit, M., Neubig, G., Mimura, M., Mori, S., Kawahara, T., Hamdulla, A., 2010. Uyghur Morpheme-based language models and ASR. In: Proc. IEEE 10th International Conference on Signal Processing (ICSP), Beijing, China, pp. 581-584.
-
(2010)
Proc. IEEE 10th International Conference on Signal Processing (ICSP)
, pp. 581-584
-
-
Ablimit, M.1
Neubig, G.2
Mimura, M.3
Mori, S.4
Kawahara, T.5
Hamdulla, A.6
-
3
-
-
85009202964
-
A corpus-based decompounding algorithm for german lexical modeling in LVCSR
-
Geneva, Switzerland
-
Adda-Decker, M., 2003. A corpus-based decompounding algorithm for German lexical modeling in LVCSR. In: Proc. Eurospeech-2003, Geneva, Switzerland, pp. 257-260.
-
(2003)
Proc. Eurospeech-2003
, pp. 257-260
-
-
Adda-Decker, M.1
-
4
-
-
33745683825
-
A unified language model for large vocabulary continuous speech recognition of turkish
-
Arisoy, E., Dutagaci, H., Arslan, L., 2006. A unified language model for large vocabulary continuous speech recognition of Turkish. Signal Processing 86 (10), 2844-2862.
-
(2006)
Signal Processing
, vol.86
, Issue.10
, pp. 2844-2862
-
-
Arisoy, E.1
Dutagaci, H.2
Arslan, L.3
-
5
-
-
85075436378
-
Deep neural network language models
-
Montreal, Canada
-
Arisoy, E., Sainath, T.N., Kingsbury, B., Ramabhadran, B., 2012. Deep neural network language models. In: Proc. NAACL-HLT 2012 Workshop, Montreal, Canada, pp. 20-28.
-
(2012)
Proc. NAACL-HLT 2012 Workshop
, pp. 20-28
-
-
Arisoy, E.1
Sainath, T.N.2
Kingsbury, B.3
Ramabhadran, B.4
-
6
-
-
70450219527
-
ASR corpus design for resource-scarce languages
-
Barnard, E., Davel, M., van Heerden, C., 2009. ASR corpus design for resource-scarce languages. In: Proc. Interspeech, pp. 2847-2850.
-
(2009)
Proc. Interspeech
, pp. 2847-2850
-
-
Barnard, E.1
Davel, M.2
Van Heerden, C.3
-
7
-
-
77957968025
-
Speech technology for information access: A South African case study
-
Palo Alto, California, March 2010
-
Barnard, E., Davel, M., van Huyssteen, G.B., 2010. Speech technology for information access: A South African case study. In: Proceedings of the AAAI Spring Symposium on Artificial Intelligence for Development (AI-D), Palo Alto, California, March 2010, pp. 8-13.
-
(2010)
Proceedings of the AAAI Spring Symposium on Artificial Intelligence for Development (AI-D)
, pp. 8-13
-
-
Barnard, E.1
Davel, M.2
Van Huyssteen, G.B.3
-
8
-
-
0030355995
-
Multilingual speech recognition at dragon systems
-
Philadelphia
-
Barnett, J., Corrada, A., Gao, G., Gillik, L., Ito, Y., Lowe, S., Manganaro, L., Peskin, B., 1996. Multilingual speech recognition at Dragon systems. In: Proc. ICSLP, Philadelphia, pp. 2191-2194.
-
(1996)
Proc. ICSLP
, pp. 2191-2194
-
-
Barnett, J.1
Corrada, A.2
Gao, G.3
Gillik, L.4
Ito, Y.5
Lowe, S.6
Manganaro, L.7
Peskin, B.8
-
10
-
-
48749106056
-
Towards speech translation of non written languages
-
Aruba, December 2006
-
Besacier, L., Zhou, B., Gao, Y., 2006. Towards speech translation of non written languages. In: IEEE/ACL SLT 2006. Aruba, December 2006.
-
(2006)
IEEE/ACL SLT 2006
-
-
Besacier, L.1
Zhou, B.2
Gao, Y.3
-
12
-
-
0003132144
-
Multilingual speech recognition: The 1996 byblos callhome system
-
Rhodes, Greece
-
Billa, J., Ma, K., McDonough, J., Zavaliagkos, G., Miller, D.R., Ross, K.N., El-Jaroudi, A., 1997. Multilingual speech recognition: the 1996 Byblos Callhome system. In: Proc. Eurospeech-1997, Rhodes, Greece, pp. 363-366.
-
(1997)
Proc. Eurospeech-1997
, pp. 363-366
-
-
Billa, J.1
Ma, K.2
McDonough, J.3
Zavaliagkos, G.4
Miller, D.R.5
Ross, K.N.6
El-Jaroudi, A.7
-
13
-
-
84893666435
-
Transcribing southern min speech corpora with a web-based language learning system
-
Hanoi, Vietnam
-
Cai, J., 2008. Transcribing southern min speech corpora with a web-based language learning system. In: SLTU'08, Hanoi, Vietnam.
-
(2008)
SLTU'08
-
-
Cai, J.1
-
14
-
-
85030413663
-
Turkish LVCSR: Towards better speech recognition for agglutinative languages
-
Carki, K., Geutner, P., Schultz, T., 2000. Turkish LVCSR: towards better speech recognition for agglutinative languages. In: IEEE ICASSP.
-
(2000)
IEEE ICASSP
-
-
Carki, K.1
Geutner, P.2
Schultz, T.3
-
15
-
-
69249125883
-
Unsupervised adaptive speech technology for limited resource languages: A case study for tamil
-
Hanoi, Vietnam
-
Cetin, O., 2008. Unsupervised adaptive speech technology for limited resource languages: A case study for Tamil. In: SLTU'08, Hanoi, Vietnam.
-
(2008)
SLTU'08
-
-
Cetin, O.1
-
17
-
-
25844455735
-
Syntax-based language models for machine translation
-
New Orleans, USA
-
Charniak, E., Knight, K., Yamada, K., 2003. Syntax-based language models for machine translation. In: Proc. IX MT Summit, New Orleans, USA, pp. 40-46.
-
(2003)
Proc. IX MT Summit
, pp. 40-46
-
-
Charniak, E.1
Knight, K.2
Yamada, K.3
-
20
-
-
0030638088
-
Towards a universal speech recognizer for multiple languages
-
St. Barbara CA
-
Cohen, P., Dharanipragada, S., Gros, J., Monkowski, M., Neti, C., Roukos, S., Ward, T., 1997. Towards a universal speech recognizer for multiple languages. In: Proc. Automatic Speech Recognition and Understanding (ASRU), St. Barbara CA, pp. 591-598.
-
(1997)
Proc. Automatic Speech Recognition and Understanding (ASRU)
, pp. 591-598
-
-
Cohen, P.1
Dharanipragada, S.2
Gros, J.3
Monkowski, M.4
Neti, C.5
Roukos, S.6
Ward, T.7
-
22
-
-
33750359664
-
Unsupervised morpheme segmentation and morphology induction from text corpora using morfessor 1.0
-
Helsinki University of Technology, Finland
-
Creutz, M., Lagus, K., 2005. Unsupervised morpheme segmentation and morphology induction from text corpora using Morfessor 1.0. Computer and Information Science, Report A81, Helsinki University of Technology, Finland.
-
(2005)
Computer and Information Science, Report A81
-
-
Creutz, M.1
Lagus, K.2
-
23
-
-
37849048345
-
Morph-based speech recognition and modeling of out-of-vocabulary words across languages
-
Article No. 3
-
Creutz, M., Hirsimaki, T., Kurimo, M., Puurula, A., Pylkkonen, J., Siivola, V., Varjokallio, M., Arisoy, E., Saraclar, M., Stolcke, A., 2007. Morph-based speech recognition and modeling of out-of-vocabulary words across languages. ACM Transactions on Speech and Language Processing 5 (1). Article No. 3.
-
(2007)
ACM Transactions on Speech and Language Processing
, vol.5
, Issue.1
-
-
Creutz, M.1
Hirsimaki, T.2
Kurimo, M.3
Puurula, A.4
Pylkkonen, J.5
Siivola, V.6
Varjokallio, M.7
Arisoy, E.8
Saraclar, M.9
Stolcke, A.10
-
25
-
-
84858978033
-
Investigating the role of machine translated text in ASR domain adaptation: Unsuper-vised and semi-supervised methods
-
Hawaii, USA
-
Cucu, H., Besacier, L., Burileanu, C., Buzo, A., 2011. Investigating the role of machine translated text in ASR domain adaptation: unsuper-vised and semi-supervised methods. In: Proc. ASRU 2011, Hawaii, USA.
-
(2011)
Proc. ASRU 2011
-
-
Cucu, H.1
Besacier, L.2
Burileanu, C.3
Buzo, A.4
-
26
-
-
84893681586
-
ASR domain adaptation methods for low-resourced languages: Application to Romanian language
-
Bucarest, Romania
-
Cucu, H., Besacier, L., Burileanu, C., Buzo, A., 2012. ASR domain adaptation methods for low-resourced languages: application to Romanian language. In: EUSIPCO'2012, Bucarest, Romania.
-
(2012)
EUSIPCO'2012
-
-
Cucu, H.1
Besacier, L.2
Burileanu, C.3
Buzo, A.4
-
27
-
-
84893703276
-
SMT-based ASR domain adaptation methods for under-resourced languages: Application to Romanian
-
http://dx.doi.org/10.1016/j.specom.2013.05.003
-
Cucu, H., Buzo, A., Besacier, L., Burileanu, C., 2013. SMT-based ASR domain adaptation methods for under-resourced languages: application to Romanian. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.05.003.
-
(2013)
Speech Communication
-
-
Cucu, H.1
Buzo, A.2
Besacier, L.3
Burileanu, C.4
-
28
-
-
84865744412
-
Efficient harvesting of internet audio for resource-scarce ASR
-
Davel, M.H., van Heerden, C., Kleynhans, N., Barnard, E., 2011. Efficient harvesting of Internet audio for resource-scarce ASR. In: Proc. Interspeech, pp. 3153-3156.
-
(2011)
Proc. Interspeech
, pp. 3153-3156
-
-
Davel, M.H.1
Van Heerden, C.2
Kleynhans, N.3
Barnard, E.4
-
29
-
-
84865706565
-
Woefzela - An open-source platform for ASR data collection in the developing world
-
De Vries, N.J., Badenhorst, J., Davel, M.H., Barnard, E., De Waal, A., 2011. Woefzela - an open-source platform for ASR data collection in the developing world. In: Proc. Interspeech, pp. 3177-3180.
-
(2011)
Proc. Interspeech
, pp. 3177-3180
-
-
De Vries, N.J.1
Badenhorst, J.2
Davel, M.H.3
Barnard, E.4
De Waal, A.5
-
30
-
-
84893666907
-
A smartphone-based ASR data collection tool for under-resourced languages
-
http://dx.doi.org/10.1016/j.specom.2013.07.001
-
De Vries, N.J., Davel, M.H., Badenhorst, J., Basson, W.D., de Wet, F., Barnard, E., De Waal, A., 2013. A smartphone-based ASR data collection tool for under-resourced languages, Speech Communication. http://dx.doi.org/10.1016/j. specom.2013.07.001.
-
(2013)
Speech Communication
-
-
De Vries, N.J.1
Davel, M.H.2
Badenhorst, J.3
Basson, W.D.4
De Wet, F.5
Barnard, E.6
De Waal, A.7
-
31
-
-
51449096957
-
The character as an appropriate unit of processing for non-segmenting languages
-
Tokyo, Japan
-
Denoual, E., Lepage, Y., 2006. The character as an appropriate unit of processing for non-segmenting languages. In: NLP Annual Meeting, Tokyo, Japan, pp. 731-734.
-
(2006)
NLP Annual Meeting
, pp. 731-734
-
-
Denoual, E.1
Lepage, Y.2
-
32
-
-
84894590340
-
Unsupervised SMT for a low-resourced language pair
-
Penang, Malaysia
-
Do, T., Besacier, L., Castelli, E., 2010. Unsupervised SMT for a low-resourced language pair. In: Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Penang, Malaysia.
-
(2010)
Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU)
-
-
Do, T.1
Besacier, L.2
Castelli, E.3
-
33
-
-
84885550611
-
The philips large-vocabulary recognition system for American english, french, and german
-
Madrid
-
Dugast, C., Aubert, X., Kneser, R., 1995. The Philips large-vocabulary recognition system for American English, French, and German. In: Proc. Eurospeech, Madrid, pp. 197-200.
-
(1995)
Proc. Eurospeech
, pp. 197-200
-
-
Dugast, C.1
Aubert, X.2
Kneser, R.3
-
34
-
-
84893670051
-
Statistical parametric speech synthesis for ibibio
-
http://dx.doi.org/10.1016/j.specom.2013.02.003
-
Ekpenyong, M., Urua, E.-A., Watts, O., King, S., Yamagishi, J., 2013. Statistical parametric speech synthesis for Ibibio, Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.02.003.
-
(2013)
Speech Communication
-
-
Ekpenyong, M.1
Urua, E.-A.2
Watts, O.3
King, S.4
Yamagishi, J.5
-
35
-
-
1942448479
-
Hybrid SVM/HMM architectures for speech recognition
-
Ganapathiraju, A., Hamaker, J., Picone, J., 2000. Hybrid SVM/HMM architectures for speech recognition. In: Proceedings of Speech Transcription Workshop, pp. 504-507.
-
(2000)
Proceedings of Speech Transcription Workshop
, pp. 504-507
-
-
Ganapathiraju, A.1
Hamaker, J.2
Picone, J.3
-
37
-
-
84893680491
-
Using automatic speech recognition for phonological purposes: Study of vowel length in punu (Bantu B40)
-
New Mexico (US), July 2010
-
Gelas, H., Besacier, L., Rossato, S., Pellegrino, F., 2010. Using automatic speech recognition for phonological purposes: study of vowel length in Punu (Bantu B40). In: Laphon 12, New Mexico (US), July 2010.
-
(2010)
Laphon 12
-
-
Gelas, H.1
Besacier, L.2
Rossato, S.3
Pellegrino, F.4
-
38
-
-
84865722538
-
Quality assessment of crowdsourcing transcriptions for african languages
-
Italy, 28-31 August 2011
-
Gelas, H., Teferra Abate, S., Besacier, L., Pellegrino, F., 2011. Quality assessment of crowdsourcing transcriptions for African languages. In: Interspeech 2011 Florence, Italy, 28-31 August 2011.
-
(2011)
Interspeech 2011 Florence
-
-
Gelas, H.1
Teferra Abate, S.2
Besacier, L.3
Pellegrino, F.4
-
39
-
-
85030406898
-
A hierarchical exemplar-based sparse model of speech with an application to ASR
-
HI, USA
-
Gemmeke, J.F., Van hamme, H., 2011. A hierarchical exemplar-based sparse model of speech with an application to ASR. IEEE ASRU 2011, HI, USA.
-
(2011)
IEEE ASRU 2011
-
-
Gemmeke, J.F.1
Van Hamme, H.2
-
40
-
-
70349227690
-
Web-derived pronunciations
-
Ghoshal, A., Jansche, M., Khudanpur, S., Riley, M., Ulinski, M., 2009. Web-derived pronunciations. In: IEEE ICASSP.
-
(2009)
IEEE ICASSP
-
-
Ghoshal, A.1
Jansche, M.2
Khudanpur, S.3
Riley, M.4
Ulinski, M.5
-
41
-
-
84893699024
-
Multiple pronunciation model for amharic speech recognition system
-
Hanoi, Vietnam
-
Gizaw, S., 2008. Multiple pronunciation model for Amharic speech recognition system. In: SLTU 2008, Hanoi, Vietnam.
-
(2008)
SLTU 2008
-
-
Gizaw, S.1
-
42
-
-
0029354754
-
Multi-lingual spoken language understanding in the MIT voyager system
-
Glass, J., Flammia, G., Goodine, D., Phillips, M., Polifroni, J., Sakai, S., Seneff, S., Zue, V., 1995. Multi-lingual spoken language understanding in the MIT voyager system. Speech Communication 17, 1-18.
-
(1995)
Speech Communication
, vol.17
, pp. 1-18
-
-
Glass, J.1
Flammia, G.2
Goodine, D.3
Phillips, M.4
Polifroni, J.5
Sakai, S.6
Seneff, S.7
Zue, V.8
-
43
-
-
85016587886
-
SWITCHBOARD: Telephone speech corpus for research and development
-
Godfrey, J.J., Holliman, E.C., McDaniel, J., 1992. SWITCHBOARD: telephone speech corpus for research and development. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, pp. 517-520.
-
(1992)
IEEE International Conference on Acoustics, Speech, and Signal Processing
, vol.1
, pp. 517-520
-
-
Godfrey, J.J.1
Holliman, E.C.2
McDaniel, J.3
-
44
-
-
0030640721
-
A multilingual phoneme and model set: Towards a universal base for automatic speech recognition
-
St. Barbara CA
-
Gokcen, S., Gokcen, J., 1997. A multilingual phoneme and model set: towards a universal base for automatic speech recognition. In: Proc. Automatic Speech Recognition and Understanding (ASRU), St. Barbara CA, pp. 599-603.
-
(1997)
Proc. Automatic Speech Recognition and Understanding (ASRU)
, pp. 599-603
-
-
Gokcen, S.1
Gokcen, J.2
-
45
-
-
34547548235
-
Probabilistic and bottle-neck features for LVCSR of meetings
-
USA
-
Grezl, F., et al., 2007. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. ICASSP, USA.
-
(2007)
Proc. ICASSP
-
-
Grezl, F.1
-
46
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
Turkey
-
Hermansky, H., Wellis, D., Sharma, S., 2000. Tandem connectionist feature extraction for conventional HMM systems. In: Proc. ICASSP, Turkey.
-
(2000)
Proc. ICASSP
-
-
Hermansky, H.1
Wellis, D.2
Sharma, S.3
-
47
-
-
0141476926
-
Accent modeling based on pronunciation dictionary adaptation for large vocabulary mandarin speech recognition
-
Beijing, China
-
Huang, C., Chang, E., Zhou, J., Lee K.-F., 2000. Accent modeling based on pronunciation dictionary adaptation for large vocabulary Mandarin speech recognition. In: Proc. INTERSPEECH-2000, Beijing, China, pp. 818-821.
-
(2000)
Proc. INTERSPEECH-2000
, pp. 818-821
-
-
Huang, C.1
Chang, E.2
Zhou, J.3
Lee, K.-F.4
-
48
-
-
77950944695
-
Morpho-syntactic postprocessing of N-best lists for improved french automatic speech recognition
-
Huet, S., Gravier, G., Sebillot, P., 2010. Morpho-syntactic postprocessing of N-best lists for improved French automatic speech recognition. Computer Speech and Language 24 (4), 663-684.
-
(2010)
Computer Speech and Language
, vol.24
, Issue.4
, pp. 663-684
-
-
Huet, S.1
Gravier, G.2
Sebillot, P.3
-
49
-
-
79959826604
-
Building transcribed speech corpora quickly and cheaply for many languages
-
Makuhari, Japan
-
Hughes, T., Nakajima, K., Ha, L., Moreno, P., LeBeau, M., 2010. Building transcribed speech corpora quickly and cheaply for many languages. In: Proc. Interspeech, Makuhari, Japan, pp. 1914-1917.
-
(2010)
Proc. Interspeech
, pp. 1914-1917
-
-
Hughes, T.1
Nakajima, K.2
Ha, L.3
Moreno, P.4
LeBeau, M.5
-
51
-
-
69249138826
-
Development of a speech recognition system for Icelandic using machine translated text
-
Hanoi, Vietnam
-
Jensson, A., 2008. Development of a speech recognition system for Icelandic using machine translated text. In: SLTU'08, Hanoi, Vietnam.
-
(2008)
SLTU'08
-
-
Jensson, A.1
-
52
-
-
78649995680
-
Speech recognition system based improved DTW algorithm
-
Jing, Z., Min, Z., 2010. Speech recognition system based improved DTW algorithm. In: Proc. Int. Conf. on Computer, Mechatronics, Control and, Electronic Engineering CMCE-2010, Vol. 5, pp. 320-323.
-
(2010)
Proc. Int. Conf. on Computer, Mechatronics, Control And, Electronic Engineering CMCE-2010
, vol.5
, pp. 320-323
-
-
Jing, Z.1
Min, Z.2
-
53
-
-
85030410304
-
Statistical language modeling using syntactically enhanced LSA
-
Mumbai, India
-
Kanejiya, D.P., Kumar, A., Prasad, S., 2003. Statistical language modeling using syntactically enhanced LSA. In: Proc. TIFR Workshop on Spoken Language Processing, Mumbai, India, pp. 93-100.
-
(2003)
Proc. TIFR Workshop on Spoken Language Processing
, pp. 93-100
-
-
Kanejiya, D.P.1
Kumar, A.2
Prasad, S.3
-
54
-
-
84893657385
-
Multilingual acoustic modeling using graphemes
-
Geneva, Switzerland
-
Kanthak, S., Ney, H., 2003. Multilingual acoustic modeling using graphemes. In: Eurospeech-2003, Geneva, Switzerland, pp. 1145-1148.
-
(2003)
Eurospeech-2003
, pp. 1145-1148
-
-
Kanthak, S.1
Ney, H.2
-
55
-
-
77956597632
-
Comparing SMT methods for automatic generation of pronunciation variants
-
Reykjavik, Iceland
-
Karanasou, P., Lamel, L., 2010. Comparing SMT methods for automatic generation of pronunciation variants. In: IceTAL 2010, Reykjavik, Iceland, p. 167.
-
(2010)
IceTAL 2010
, pp. 167
-
-
Karanasou, P.1
Lamel, L.2
-
56
-
-
84859039566
-
Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis
-
Florence, Italy
-
Karpov, A., Kipyatkova, I., Ronzhin, A., 2011. Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis. In: Proc. Interspeech'2011, Florence, Italy, pp. 3161-3164.
-
(2011)
Proc. Interspeech'2011
, pp. 3161-3164
-
-
Karpov, A.1
Kipyatkova, I.2
Ronzhin, A.3
-
57
-
-
84893652340
-
Large vocabulary Russian speech recognition using syntactico-statistical language modeling
-
http://dx.doi.org/10.1016/j.specom.2013.07.004
-
Karpov, A., Markov, K., Kipyatkova, I., Vazhenina, D., Ronzhin, A., 2013. Large vocabulary Russian speech recognition using syntactico-statistical language modeling. Speech Communication. http://dx.doi.org/10.1016/j.specom. 2013.07.004.
-
(2013)
Speech Communication
-
-
Karpov, A.1
Markov, K.2
Kipyatkova, I.3
Vazhenina, D.4
Ronzhin, A.5
-
58
-
-
0004654418
-
Data-driven determination of appropriate dictionary units for Korean LVCSR
-
Kiecza, D., Schultz, T., Waibel, A., 1999. Data-driven determination of appropriate dictionary units for Korean LVCSR. In: Proceedings of the International Conference on Speech Processing, pp. 323-327.
-
(1999)
Proceedings of the International Conference on Speech Processing
, pp. 323-327
-
-
Kiecza, D.1
Schultz, T.2
Waibel, A.3
-
60
-
-
84872512624
-
Analysis of long-distance word dependencies and pronunciation variability at conversational Russian speech recognition
-
Wroclav, Poland
-
Kipyatkova, I., Karpov, A., Verkhodanova, V., Zelezny, M., 2012. Analysis of long-distance word dependencies and pronunciation variability at conversational Russian speech recognition. In: Proc. FedCSIS-2012, Wroclav, Poland, pp. 719-725.
-
(2012)
Proc. FedCSIS-2012
, pp. 719-725
-
-
Kipyatkova, I.1
Karpov, A.2
Verkhodanova, V.3
Zelezny, M.4
-
61
-
-
0031619917
-
Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks
-
Seattle
-
Köhler, J., 1998. Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks. In: Proc. ICASSP, Seattle, pp. 417-420.
-
(1998)
Proc. ICASSP
, pp. 417-420
-
-
Köhler, J.1
-
62
-
-
84893646836
-
The basic language resource kit (BLARK) as the first milestone for the language resources roadmap
-
Moscow, Russia
-
Krauwer, S., 2003. The basic language resource kit (BLARK) as the first milestone for the language resources roadmap. In: Proceedings of the 2003 International Workshop Speech and Computer SPECOM-2003, Moscow, Russia, pp. 8-15.
-
(2003)
Proceedings of the 2003 International Workshop Speech and Computer SPECOM-2003
, pp. 8-15
-
-
Krauwer, S.1
-
63
-
-
77949369404
-
Syntactic features for arabic speech recognition
-
Merano, Italy
-
Kuo, H.-K.J., Mangu, L., Emami, A., Zitouni, I., Lee, Y.-S., 2009. Syntactic features for Arabic speech recognition. In: Proc. International Workshop ASRU'2009, Merano, Italy, pp. 327-332.
-
(2009)
Proc. International Workshop ASRU'2009
, pp. 327-332
-
-
Kuo, H.-K.J.1
Mangu, L.2
Emami, A.3
Zitouni, I.4
Lee, Y.-S.5
-
64
-
-
84858385446
-
Unlimited vocabulary speech recognition for agglutinative languages
-
NY, USA
-
Kurimo, M., Puurula, A., Arisoy, E., Siivola, V., Hirsimaki, T., Pylkkonen, J., Alumae, T., Saraclar, M., 2006. Unlimited vocabulary speech recognition for agglutinative languages. In: Proc. HLT-NAACL, NY, USA.
-
(2006)
Proc. HLT-NAACL
-
-
Kurimo, M.1
Puurula, A.2
Arisoy, E.3
Siivola, V.4
Hirsimaki, T.5
Pylkkonen, J.6
Alumae, T.7
Saraclar, M.8
-
65
-
-
44949103508
-
Unsupervised segmentation of words into morphemes - Morpho challenge. Application to automatic speech recognition
-
Pittsburgh, PA, USA
-
Kurimo, M., et al., 2006. Unsupervised segmentation of words into morphemes - Morpho Challenge. Application to automatic speech recognition. In: Proc. Interspeech'06, Pittsburgh, PA, USA, pp. 1021-1024.
-
(2006)
Proc. Interspeech'06
, pp. 1021-1024
-
-
Kurimo, M.1
-
66
-
-
0000143923
-
Issues in large vocabulary multilingual speech recognition
-
Madrid
-
Lamel, L., Adda-Decker, M., Gauvain, J.L., 1995. Issues in large vocabulary multilingual speech recognition. In: Proc. Eurospeech, Madrid, pp. 185-189.
-
(1995)
Proc. Eurospeech
, pp. 185-189
-
-
Lamel, L.1
Adda-Decker, M.2
Gauvain, J.L.3
-
67
-
-
70450194704
-
Grapheme to phoneme conversion using an SMT system
-
Brighton, UK
-
Laurent, A., Deléglise, P., Meignier, S., 2009. Grapheme to phoneme conversion using an SMT system. In: Interspeech 2009, Brighton, UK, pp. 708-711.
-
(2009)
Interspeech 2009
, pp. 708-711
-
-
Laurent, A.1
Deléglise, P.2
Meignier, S.3
-
68
-
-
69249139569
-
Automatic speech recognition for under-resourced languages: Application to vietnamese language
-
Le, V.-B., Besacier, L., 2009. Automatic speech recognition for under-resourced languages: application to Vietnamese language. IEEE Transactions on Audio, Speech and Language Processing 17(8), 1471-1482.
-
(2009)
IEEE Transactions on Audio, Speech and Language Processing
, vol.17
, Issue.8
, pp. 1471-1482
-
-
Le, V.-B.1
Besacier, L.2
-
69
-
-
33646820243
-
Using the web for fast language model construction in minority languages
-
Geneva, Switzerland
-
Le, V.B., Bigi, B., Besacier, L., Castelli, E., 2003. Using the Web for fast language model construction in minority languages. In: Euro-speech'03, Geneva, Switzerland, pp. 3117-3120.
-
(2003)
Euro-speech'03
, pp. 3117-3120
-
-
Le, V.B.1
Bigi, B.2
Besacier, L.3
Castelli, E.4
-
70
-
-
85008025475
-
Probabilistic modeling of Korean morphology
-
Lee, D.-G., Rim, H.-C., 2009. Probabilistic modeling of Korean morphology. IEEE Transactions on Audio, Speech & Language Processing 17 (5), 945-955.
-
(2009)
IEEE Transactions on Audio, Speech & Language Processing
, vol.17
, Issue.5
, pp. 945-955
-
-
Lee, D.-G.1
Rim, H.-C.2
-
71
-
-
70450181523
-
Cross-language bootstrapping for unsupervised acoustic model training: Rapid development of a polish speech recognition system
-
Brighton, UK
-
Loof, J., Gollan, C., Ney, H., 2009. Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system. In: Interspeech 2009. Brighton, UK.
-
(2009)
Interspeech 2009
-
-
Loof, J.1
Gollan, C.2
Ney, H.3
-
72
-
-
33646070918
-
Modeling syntax of free word-order languages: Dependency analysis by reduction
-
Springer LNAI 3658, Karlovy Vary, Czech Republic
-
Lopatková, M., Plátek, M., Kuboň, V., 2005. Modeling syntax of free word-order languages: dependency analysis by reduction. In: Proc. TSD'2005, Springer LNAI 3658, Karlovy Vary, Czech Republic, pp. 140-147.
-
(2005)
Proc. TSD'2005
, pp. 140-147
-
-
Lopatková, M.1
Plátek, M.2
Kuboň, V.3
-
73
-
-
56149124525
-
Morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - Like hungarian
-
Antwerp, Belgium
-
Mihajlik, P., Fegyó, T., Tüske, Z., Ircing, P., 2007. Morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. In: Interspeech'07, Antwerp, Belgium.
-
(2007)
Interspeech'07
-
-
Mihajlik, P.1
Fegyó, T.2
Tüske, Z.3
Ircing, P.4
-
74
-
-
79959829092
-
Recurrent neural network based language model
-
Makuhari, Japan
-
Mikolov, T., Karafiat, M., Burget, L., Cernocky, J., Khudanpur, S., 2010. Recurrent neural network based language model. In: Proc. INTER-SPEECH-2010, Makuhari, Japan, pp. 1045-1048.
-
(2010)
Proc. INTER-SPEECH-2010
, pp. 1045-1048
-
-
Mikolov, T.1
Karafiat, M.2
Burget, L.3
Cernocky, J.4
Khudanpur, S.5
-
75
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
Mohamed, A., Dahl, G.E., Hinton, G., 2012. Acoustic modeling using deep belief networks. IEEE Transactions on Audio, Speech, and Language Processing 20 (1), 14-22.
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.E.2
Hinton, G.3
-
77
-
-
33745195515
-
Language model adaptation with additional text generated by machine translation
-
Taipei, Taiwan
-
Nakajima, H., Yamamoto, H., Watanabe, T., 2002. Language model adaptation with additional text generated by machine translation. In: COLING 2002, Vol. 2, Taipei, Taiwan, pp. 716-722.
-
(2002)
COLING 2002
, vol.2
, pp. 716-722
-
-
Nakajima, H.1
Yamamoto, H.2
Watanabe, T.3
-
78
-
-
33646767826
-
A new ASR evaluation measure and minimum bayes-risk decoding for open-domain speech understanding
-
PA, USA
-
Nanjo, H., Kawahara, T., 2005. A new ASR evaluation measure and minimum Bayes-risk decoding for open-domain speech understanding. In: Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP-2005, PA, USA, pp. 1053-1056.
-
(2005)
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing ICASSP-2005
, pp. 1053-1056
-
-
Nanjo, H.1
Kawahara, T.2
-
80
-
-
67649515539
-
Morphological random forests for language modeling of inflectional languages
-
Goa, India
-
Oparin, I., Glembek, O., Burget, L., Černocký, J., 2008. Morphological random forests for language modeling of inflectional languages. In: Proc. IEEE Workshop on Spoken Language Technology SLT'08, Goa, India.
-
(2008)
Proc. IEEE Workshop on Spoken Language Technology SLT'08
-
-
Oparin, I.1
Glembek, O.2
Burget, L.3
Černocký, J.4
-
81
-
-
79951777091
-
Toward better crowdsourced transcription: Transcription of a year of the let's go bus information system data
-
Berkeley, California, December 2010
-
Parent, G., Eskenazi, M., 2010. Toward better crowdsourced transcription: transcription of a year of the Let's Go bus information system data. In: Proceedings of IEEE Workshop on Spoken Language Technology, Berkeley, California, December 2010, pp. 312-317.
-
(2010)
Proceedings of IEEE Workshop on Spoken Language Technology
, pp. 312-317
-
-
Parent, G.1
Eskenazi, M.2
-
82
-
-
84892473615
-
A comparative study of speech and dialed input voice interfaces in rural India
-
ACM, New York, NY, USA
-
Patel, N., Agarwal, S., Rajput, N., Nanavati, A., Dave, P., Parikh, T.S., 2009. A comparative study of speech and dialed input voice interfaces in rural India. In: CHI'09: Proceedings of the 27th international conference on Human factors in computing systems. ACM, New York, NY, USA, pp. 51-54.
-
(2009)
CHI'09: Proceedings of the 27th International Conference on Human Factors in Computing Systems
, pp. 51-54
-
-
Patel, N.1
Agarwal, S.2
Rajput, N.3
Nanavati, A.4
Dave, P.5
Parikh, T.S.6
-
83
-
-
77953970245
-
Avaaj otalo: A field study of an interactive voice forum for small farmers in rural India
-
Patel, N., Chittamuru, D., Jain, A., Dave, P., Parikh, T.S., 2010. Avaaj Otalo: A field study of an interactive voice forum for small farmers in rural India. In: CHI. ACM, pp. 733-742.
-
(2010)
CHI. ACM
, pp. 733-742
-
-
Patel, N.1
Chittamuru, D.2
Jain, A.3
Dave, P.4
Parikh, T.S.5
-
84
-
-
84893644519
-
Investigating automatic decomposition for ASR in less represented languages
-
Pittsburgh
-
Pellegrini, T., Lamel, L., 2006. Investigating automatic decomposition for ASR in less represented languages. In: ICSLP'06, Pittsburgh.
-
(2006)
ICSLP'06
-
-
Pellegrini, T.1
Lamel, L.2
-
85
-
-
69249115863
-
Are audio or textual training data more important for ASR in less-represented languages?
-
Hanoi, Vietnam
-
Pellegrini, T., Lamel, L., 2008. Are audio or textual training data more important for ASR in less-represented languages?. In: SLTU'08, Hanoi, Vietnam.
-
(2008)
SLTU'08
-
-
Pellegrini, T.1
Lamel, L.2
-
86
-
-
85008009596
-
Automatic word decompounding for ASR in a morphologically rich language: Application to amharic
-
Pellegrini, T., Lamel, L., 2009. Automatic word decompounding for ASR in a morphologically rich language: application to Amharic. IEEE Transactions on Audio, Speech & Language Processing 17 (5), 863-873.
-
(2009)
IEEE Transactions on Audio, Speech & Language Processing
, vol.17
, Issue.5
, pp. 863-873
-
-
Pellegrini, T.1
Lamel, L.2
-
87
-
-
84858976609
-
Cross-lingual portability of Chinese and english neural network features for french and german LVCSR
-
USA
-
Plahl, C., Schlueter, R., Ney, H., 2011. Cross-lingual portability of Chinese and English neural network features for French and German LVCSR. In: Proc. ASRU, USA.
-
(2011)
Proc. ASRU
-
-
Plahl, C.1
Schlueter, R.2
Ney, H.3
-
88
-
-
84876792117
-
Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining
-
Jeju, Korea
-
Rastrow, A., Dredze, M., Khudanpur, S., 2012. Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining. In: Proc. 50th Annual Meeting of Association for Computational Linguistics ACL'2012, Jeju, Korea, pp. 175-183.
-
(2012)
Proc. 50th Annual Meeting of Association for Computational Linguistics ACL'2012
, pp. 175-183
-
-
Rastrow, A.1
Dredze, M.2
Khudanpur, S.3
-
90
-
-
34250004339
-
Large vocabulary continuous speech recognition of an inflected language using stems and endings
-
DOI 10.1016/j.specom.2007.02.010, PII S0167639307000428
-
Rotovnik, T., Maucec, M.S., Kacix, Z., 2007. Large vocabulary continuous speech recognition of an inflected language using stems and endings. Speech Communication 49 (6), 437-452. (Pubitemid 46891622)
-
(2007)
Speech Communication
, vol.49
, Issue.6
, pp. 437-452
-
-
Rotovnik, T.1
Maucec, M.S.2
Kacic, Z.3
-
91
-
-
84886596580
-
Developing a multilingual telephone based information retrieval system in african languages
-
Roux, J.C., Botha, E.C., du Preez, J.A., 2000. Developing a multilingual telephone based information retrieval system in African languages. In: Proceedings of the Second International Conference on Language Resources and Evaluation, pp. 975-980.
-
(2000)
Proceedings of the Second International Conference on Language Resources and Evaluation
, pp. 975-980
-
-
Roux, J.C.1
Botha, E.C.2
Du Preez, J.A.3
-
92
-
-
78049399086
-
Morphology-based and subword language modeling for turkish speech recognition
-
Sak, H., Saraclar, M., Güngör, T., 2010. Morphology-based and subword language modeling for Turkish speech recognition. In: ICASSP 2010, pp. 5402-5405.
-
(2010)
ICASSP 2010
, pp. 5402-5405
-
-
Sak, H.1
Saraclar, M.2
Güngör, T.3
-
93
-
-
34547535012
-
Joint morphological-lexical language modeling (JMLLM) for arabic
-
Sarikaya, R., Afify, M., Gao, Y., 2007. Joint morphological-lexical language modeling (JMLLM) for Arabic. In: Proc. ICASSP'07, Vol. 4, pp. 181-184.
-
(2007)
Proc. ICASSP'07
, vol.4
, pp. 181-184
-
-
Sarikaya, R.1
Afify, M.2
Gao, Y.3
-
94
-
-
79959851710
-
Wiktionary as a source for automatic pronunciation extraction
-
Makuhari, Japan, 26-30 September 2010
-
Schlippe, T., Ochs, S., Schultz, T., 2010. Wiktionary as a source for automatic pronunciation extraction. In: Interspeech 2010, Makuhari, Japan, 26-30 September 2010.
-
(2010)
Interspeech 2010
-
-
Schlippe, T.1
Ochs, S.2
Schultz, T.3
-
95
-
-
84867605828
-
Grapheme-to-phoneme model generation for indo-european languages
-
Kyoto, Japan, 25-30 March 2012
-
Schlippe, T., Ochs, S., Schultz, T., 2012a. Grapheme-to-phoneme model generation for indo-European languages. In: ICASSP 2012, Kyoto, Japan, 25-30 March 2012.
-
(2012)
ICASSP 2012
-
-
Schlippe, T.1
Ochs, S.2
Schultz, T.3
-
96
-
-
84878526331
-
Automatic error recovery for pronunciation dictionaries
-
Portland, Oregon, 9-13 September 2012
-
Schlippe, T., Ochs, S., Vu, N.T., Schultz, T., 2012b. Automatic error recovery for pronunciation dictionaries. In: Interspeech 2012, Portland, Oregon, 9-13 September 2012.
-
(2012)
Interspeech 2012
-
-
Schlippe, T.1
Ochs, S.2
Vu, N.T.3
Schultz, T.4
-
97
-
-
84893706991
-
Web-based tools and methods for rapid pronunciation dictionary creation
-
http://dx.doi.org/10.1016/j.specom.2013.06.015
-
Schlippe, T., Ochs, S., Schultz, T., 2013. Web-based tools and methods for rapid pronunciation dictionary creation. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.06.015.
-
(2013)
Speech Communication
-
-
Schlippe, T.1
Ochs, S.2
Schultz, T.3
-
98
-
-
85009274666
-
GlobalPhone: A multilingual speech and text database developed at karlsruhe university
-
Schultz, T., 2002. GlobalPhone: A multilingual speech and text database developed at Karlsruhe University. In: ICSLP, pp. 345-348.
-
(2002)
ICSLP
, pp. 345-348
-
-
Schultz, T.1
-
99
-
-
85013700737
-
-
Tanja Schultz, Katrin Kirchhoff (Eds.), Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5, 2006
-
Schultz, T., 2006. Multilingual speech processing. In: Tanja Schultz, Katrin Kirchhoff (Eds.), Elsevier, Academic Press, ISBN 13: 978-0-12-088501-5, 2006.
-
(2006)
Multilingual Speech Processing
-
-
Schultz, T.1
-
100
-
-
43849102326
-
SPICE: Web-based tools for rapid language adaptation in speech processing systems
-
Antwerp, Belgium
-
Schultz, T., Black, A.W., Badaskar, S., Hornyak, M., Kominek, J., 2007. SPICE: web-based tools for rapid language adaptation in speech processing systems. In: Interspeech 2007, Antwerp, Belgium.
-
(2007)
Interspeech 2007
-
-
Schultz, T.1
Black, A.W.2
Badaskar, S.3
Hornyak, M.4
Kominek, J.5
-
101
-
-
84890463379
-
GlobalPhone: A multilingual text & speech database in 20 languages
-
Vancouver, Canada
-
Schultz, T., Vu, N.T., Schlippe, T., 2013. GlobalPhone: A multilingual text & speech database in 20 languages. In: ICASSP 2013, Vancouver, Canada.
-
(2013)
ICASSP 2013
-
-
Schultz, T.1
Vu, N.T.2
Schlippe, T.3
-
102
-
-
0001216191
-
Language independent and language adaptive LVCSR
-
Sydney
-
Schultz, T., Waibel, A., 1998. Language independent and language adaptive LVCSR. In: Proc. ICSLP, Sydney, pp. 1819-1822.
-
(1998)
Proc. ICSLP
, pp. 1819-1822
-
-
Schultz, T.1
Waibel, A.2
-
103
-
-
0035426931
-
Language-independent and language-adaptive acoustic modeling for speech recognition
-
DOI 10.1016/S0167-6393(00)00094-7, PII S0167639300000947
-
Schultz, T., Waibel, A., 2001. Language independent and language adaptive acoustic modeling for speech recognition. Speech Communication 35, 31-51. (Pubitemid 32599645)
-
(2001)
Speech Communication
, vol.35
, Issue.1-2
, pp. 31-51
-
-
Schultz, T.1
Waibel, A.2
-
104
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
HI, USA
-
Seide, F., Li, G., Chen, X., Yu, D., 2011. Feature engineering in context-dependent deep neural networks for conversational speech transcription. In: Proc. ASRU-2011 International Workshop, HI, USA, pp. 24-29.
-
(2011)
Proc. ASRU-2011 International Workshop
, pp. 24-29
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
105
-
-
84867333761
-
Universal attribute characterization of spoken languages for automatic spoken language recognition
-
Siniscalchi, S.M., Reed, J., Svendsen, T., Lee, C.-H., 2013. Universal attribute characterization of spoken languages for automatic spoken language recognition. Computer Speech & Language 27 (1), 209-227.
-
(2013)
Computer Speech & Language
, vol.27
, Issue.1
, pp. 209-227
-
-
Siniscalchi, S.M.1
Reed, J.2
Svendsen, T.3
Lee, C.-H.4
-
106
-
-
34047146422
-
Robust ASR using Support Vector Machines
-
DOI 10.1016/j.specom.2007.01.013, PII S0167639307000246
-
Solera-Urena, R., Martin-Iglesias, D., Gallardo-Antolin, A., Pelaez-Moreno, C., Diaz-de-Maria, F., 2007. Robust ASR using support vector machines. Speech Communication 49 (4), 253-267. (Pubitemid 46517709)
-
(2007)
Speech Communication
, vol.49
, Issue.4
, pp. 253-267
-
-
Solera-Urena, R.1
Martin-Iglesias, D.2
Gallardo-Antolin, A.3
Pelaez-Moreno, C.4
Diaz-de-Maria, F.5
-
107
-
-
84874250689
-
Word segmentation through cross-lingual word-to-phoneme alignment
-
Miami, Florida, 2-5 December 2012
-
Stahlberg, F., Schlippe, T., Vogel, S., Schultz, T., 2012. Word segmentation through cross-lingual word-to-phoneme alignment. In: Proceedings of The Fourth IEEE Workshop on Spoken Language Technology (SLT 2012), Miami, Florida, 2-5 December 2012.
-
(2012)
Proceedings of the Fourth IEEE Workshop on Spoken Language Technology (SLT 2012)
-
-
Stahlberg, F.1
Schlippe, T.2
Vogel, S.3
Schultz, T.4
-
108
-
-
84883149008
-
Pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment
-
Tarragona, Spain, 29-31 July 2013
-
Stahlberg, F., Schlippe, T., Vogel, S., Schultz, T., 2013. Pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment. In: Proceedings of the 1st international conference on statistical language and speech processing (SLSP 2013), Tarragona, Spain, 29-31 July 2013.
-
(2013)
Proceedings of the 1st International Conference on Statistical Language and Speech Processing (SLSP 2013)
-
-
Stahlberg, F.1
Schlippe, T.2
Vogel, S.3
Schultz, T.4
-
109
-
-
85030406878
-
Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables
-
Stephenson, T.A., Escofet, J., Magimai-Doss, M., Bourlard, H., 2002. Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables, Technical Report Idiap-RR-24-2002, p. 10.
-
(2002)
Technical Report Idiap-RR-24-2002
, pp. 10
-
-
Stephenson, T.A.1
Escofet, J.2
Magimai-Doss, M.3
Bourlard, H.4
-
110
-
-
33947619591
-
Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons
-
Stolcke, A., Grezl, F., Hwang, M.-Y., Lei, X., Morgan, N., Vergyri, D., 2006. Cross-domain and cross-lingual portability of acoustic features estimated by multilayer perceptrons. In: Proc. ICASSP 2006.
-
(2006)
Proc. ICASSP 2006
-
-
Stolcke, A.1
Grezl, F.2
Hwang, M.-Y.3
Lei, X.4
Morgan, N.5
Vergyri, D.6
-
111
-
-
85133311295
-
Integrating thai grapheme based acoustic models into the ML-mix framework - For language independent and cross-language ASR
-
Hanoi, Vietnam
-
Stüker, S., 2008. Integrating Thai grapheme based acoustic models into the ML-mix framework - for language independent and cross-language ASR. In: SLTU'08, Hanoi, Vietnam.
-
(2008)
SLTU'08
-
-
Stüker, S.1
-
112
-
-
0141591550
-
Multilingual articulatory features
-
Stüker, S., Schultz, T., Metze, F., Waibel, A., 2003. Multilingual articulatory features, In: ICASSP 2003.
-
(2003)
ICASSP 2003
-
-
Stüker, S.1
Schultz, T.2
Metze, F.3
Waibel, A.4
-
113
-
-
0141591550
-
Multilingual articulatory features
-
Stuker, S., Schultz, T., Metze, F., Waibel, A., 2003. Multilingual articulatory features. In: Proceedings. ICASSP'03 IEEE International Conference on Acoustics, Speech, and, Signal Processing.
-
(2003)
Proceedings. ICASSP'03 IEEE International Conference on Acoustics, Speech, And, Signal Processing
-
-
Stuker, S.1
Schultz, T.2
Metze, F.3
Waibel, A.4
-
114
-
-
70450198124
-
Human translations guided language discovery for ASR systems
-
Brighton, UK
-
Stüker, S., Besacier, L., Waibel, A., 2009. Human translations guided language discovery for ASR systems. In: InterSpeech-2009, Brighton, UK.
-
(2009)
InterSpeech-2009
-
-
Stüker, S.1
Besacier, L.2
Waibel, A.3
-
115
-
-
70450172181
-
Localization of speech recognition in spoken dialog systems: How machine translation can make our lives
-
Brighton, UK
-
Suenderman, K., Liscombe, J., 2009. Localization of speech recognition in spoken dialog systems: how machine translation can make our lives. In: Interspeech 2009, Brighton, UK, pp. 1475-1478.
-
(2009)
Interspeech 2009
, pp. 1475-1478
-
-
Suenderman, K.1
Liscombe, J.2
-
116
-
-
0141703236
-
Finite-state transducer based modeling of morphosyntax with applications to hungarian LVCSR
-
HongKong, China
-
Szarvas, M., Furui, S., 2003. Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR. In: Proc. ICASSP, HongKong, China, pp. 368-371.
-
(2003)
Proc. ICASSP
, pp. 368-371
-
-
Szarvas, M.1
Furui, S.2
-
117
-
-
85071192719
-
Syllable-based and hybrid acoustic models for amharic speech recognition
-
Cape-Town, South Africa
-
Tachbelie, M., Abate, S.T., Besacier, L., Rossato, S., 2012. Syllable-based and hybrid acoustic models for Amharic speech recognition. In: SLTU - Workshop on Spoken Language Technologies for Under-Resourced Languages, Cape-Town, South Africa.
-
(2012)
SLTU - Workshop on Spoken Language Technologies for Under-Resourced Languages
-
-
Tachbelie, M.1
Abate, S.T.2
Besacier, L.3
Rossato, S.4
-
118
-
-
84893678734
-
Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic
-
http://dx.doi.org/10.1016/j.specom.2013.01.008
-
Tachbelie, M., Abate, S.T., Besacier, L., 2013. Using different acoustic, lexical and language modeling units for ASR of an under-resourced language - Amharic. Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.01.008.
-
(2013)
Speech Communication
-
-
Tachbelie, M.1
Abate, S.T.2
Besacier, L.3
-
120
-
-
84867606552
-
Multilingual MLP features for low-resource LVCSR systems
-
Japan
-
Thomas, S., Ganapathy, S., Hermansky, H., 2012a. Multilingual MLP features for low-resource LVCSR systems. In: Proc. ICASSP, Japan.
-
(2012)
Proc. ICASSP
-
-
Thomas, S.1
Ganapathy, S.2
Hermansky, H.3
-
121
-
-
84878392008
-
Data-driven posterior features for low resource speech recognition applications
-
USA
-
Thomas, S., Ganapathy, S., Jansen, A., Hermansky, H., 2012b. Data-driven posterior features for low resource speech recognition applications. In: Proc. Interspeech, USA.
-
(2012)
Proc. Interspeech
-
-
Thomas, S.1
Ganapathy, S.2
Jansen, A.3
Hermansky, H.4
-
122
-
-
84858985238
-
Cross-lingual portability of MLP-based tandem features - A case study for english and hungarian
-
Toth, L., Frankel, J., Gosztolya, G., King, S., 2008. Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian. In: Proc. Interspeech.
-
(2008)
Proc. Interspeech
-
-
Toth, L.1
Frankel, J.2
Gosztolya, G.3
King, S.4
-
123
-
-
0035101535
-
A survey of hybrid ANN/HMM models for automatic speech recognition
-
Trentin, E., Gori, M., 2001. A survey of hybrid ANN/HMM models for automatic speech recognition. Neurocomputing 37 (1), 91-126.
-
(2001)
Neurocomputing
, vol.37
, Issue.1
, pp. 91-126
-
-
Trentin, E.1
Gori, M.2
-
124
-
-
85133315126
-
Pooling ASR data for closely related languages
-
Penang, Malaysia, May 2010
-
van Heerden, C., Kleynhans, N., Barnard, E., Davel, M., 2010. Pooling ASR data for closely related languages. In: Proceedings of the Workshop on Spoken Languages Technologies for Under-Resourced Languages (SLTU 2010), Penang, Malaysia, May 2010, pp. 17-23.
-
(2010)
Proceedings of the Workshop on Spoken Languages Technologies for Under-Resourced Languages (SLTU 2010)
, pp. 17-23
-
-
Van Heerden, C.1
Kleynhans, N.2
Barnard, E.3
Davel, M.4
-
125
-
-
84893679840
-
Predicting utterance pitch targets in yoruba for tone realisation in speech synthesis
-
http://dx.doi.org/10.1016/j.specom.2013.01.009
-
van Niekerk, D.R., Barnard, E., 2013. Predicting utterance pitch targets in Yoruba for tone realisation in speech synthesis, Speech Communication. http://dx.doi.org/10.1016/j.specom.2013.01.009.
-
(2013)
Speech Communication
-
-
Van Niekerk, D.R.1
Barnard, E.2
-
126
-
-
85009110467
-
Morphology-based language modeling for arabic speech recognition
-
Vergyri, D., Kirchhoff, K., Duh, K., Stolcke, A., 2004. Morphology-based language modeling for Arabic speech recognition. In: Proc. ICSLP'04, pp. 2245-2248.
-
(2004)
Proc. ICSLP'04
, pp. 2245-2248
-
-
Vergyri, D.1
Kirchhoff, K.2
Duh, K.3
Stolcke, A.4
-
127
-
-
84874226274
-
The language-independent bottleneck features
-
USA
-
Vesely, K., Karafiat, M., Grezl, F., Janda, M., Egorova, E., 2012. The language-independent bottleneck features. In: Proc. SLT, USA.
-
(2012)
Proc. SLT
-
-
Vesely, K.1
Karafiat, M.2
Grezl, F.3
Janda, M.4
Egorova, E.5
-
128
-
-
79951796711
-
Multilingual A-stabil: A new confidence score for multilingual unsupervised training
-
USA
-
Vu, N.T., Kraus, F., Schultz, T., 2010. Multilingual A-stabil: A new confidence score for multilingual unsupervised training. In: Proc. SLT, USA.
-
(2010)
Proc. SLT
-
-
Vu, N.T.1
Kraus, F.2
Schultz, T.3
-
129
-
-
84865764419
-
Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training
-
Italy
-
Vu, N.T., Kraus, F., Schultz, T., 2011. Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training. In: Proc. Interspeech, Italy.
-
(2011)
Proc. Interspeech
-
-
Vu, N.T.1
Kraus, F.2
Schultz, T.3
-
130
-
-
84890464069
-
Multilingual bottle-neck feature for under resourced languages
-
South Africa
-
Vu, N.T., Metze, F., Schultz, T., 2012a. Multilingual bottle-neck feature for under resourced languages. In: Proc. SLTU, South Africa.
-
(2012)
Proc. SLTU
-
-
Vu, N.T.1
Metze, F.2
Schultz, T.3
-
131
-
-
84878559540
-
An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance
-
USA
-
Vu, N.T., Breiter, W., Metze, F., Schultz, T., 2012b. An investigation on initialization schemes for multilayer perceptron training using multilingual data and their effect on ASR performance. In: Proc. Interspeech, USA.
-
(2012)
Proc. Interspeech
-
-
Vu, N.T.1
Breiter, W.2
Metze, F.3
Schultz, T.4
-
132
-
-
84856280064
-
An evaluation of cross-language adaptation for rapid HMM development in a new language
-
Adelaide
-
Wheatley, B., Kondo, K., Anderson, W., Muthusamy, Y., 1994. An evaluation of cross-language adaptation for rapid HMM development in a new language. In: Proc. ICASSP, Adelaide, pp. 237-240.
-
(1994)
Proc. ICASSP
, pp. 237-240
-
-
Wheatley, B.1
Kondo, K.2
Anderson, W.3
Muthusamy, Y.4
-
134
-
-
0034848043
-
Efficient class-based language modelling for very large vocabularies
-
Whittaker, E.W.D., Woodland, P.C., 2001. Efficient class-based language modelling for very large vocabularies. In: ICASSP-2001, Salt Lake City, USA, pp. 545-548. (Pubitemid 32839308)
-
(2001)
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
, vol.1
, pp. 545-548
-
-
Whittaker, E.W.D.1
Woodland, P.C.2
-
135
-
-
55849099996
-
Vowel variations in southern sotho: An acoustical investigation
-
Wissing, D., Barnard, E., 2008. Vowel variations in Southern Sotho: an acoustical investigation. Southern African Linguistics and Applied Language Studies 26 (2), 255-265.
-
(2008)
Southern African Linguistics and Applied Language Studies
, vol.26
, Issue.2
, pp. 255-265
-
-
Wissing, D.1
Barnard, E.2
-
136
-
-
0030718943
-
Multilingual large vocabulary speech recognition: The European SQALE project
-
Young, S.J., Adda-Decker, M., Aubert, X., Dugast, C., Gauvain, J.L., Kershaw, D.J., Lamel, L., Leeuwen, D.A., Pye, D., Robinson, A.J., Steeneken, H.J.M., Woodland, P.C., 1997. Multilingual large vocabulary speech recognition: the European SQALE project. Computer Speech & Language 11, 73-89. (Pubitemid 127375894)
-
(1997)
Computer Speech and Language
, vol.11
, Issue.1
, pp. 73-89
-
-
Young, S.J.1
Adda-Dekker, M.2
Aubert, X.3
Dugast, C.4
Gauvain, J.-L.5
Kershaw, D.J.6
Lamel, L.7
Leeuwen, D.A.8
Pye, D.9
Robinson, A.J.10
Steeneken, H.J.M.11
Woodland, P.C.12
-
137
-
-
85075927145
-
HMMs and related speech recognition technologies
-
Springer-Verlag, Berlin Heidelberg
-
Young, S., 2008. HMMs and related speech recognition technologies. In: Springer Handbook of Speech Processing. Springer-Verlag, Berlin Heidelberg, pp. 539-557.
-
(2008)
Springer Handbook of Speech Processing
, pp. 539-557
-
-
Young, S.1
-
138
-
-
84867329143
-
Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition
-
Yu, D., Siniscalchi, S.M., Deng, L., Lee, C.-H., 2012. Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition. In: Proc. ICASSP-2012, pp. 4169-4172.
-
(2012)
Proc. ICASSP-2012
, pp. 4169-4172
-
-
Yu, D.1
Siniscalchi, S.M.2
Deng, L.3
Lee, C.-H.4
|