SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4225-4229

Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks

(4) Rao, Kanishka a Peng, Fuchun a Sak, Hasim a Beaufays, Francoise a

a GOOGLE INC (United States)

Author keywords

CTC; G2P; LSTM; pronunciation; RNN; speech recognition

Indexed keywords

AUDIO SIGNAL PROCESSING; BRAIN; CHARACTER RECOGNITION; HYBRID SYSTEMS; SPEECH; SPEECH COMMUNICATION; SPEECH RECOGNITION;

GRAPHEME TO PHONEMES; GRAPHEME-TO-PHONEME CONVERSION; LSTM; PRONUNCIATION; RECURRENT NEURAL NETWORK (RNN); TEMPORAL CLASSIFICATION; TEXT-TO-SPEECH SYSTEM; TRADITIONAL JOINTS;

LONG SHORT-TERM MEMORY;

EID: 84946032010 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178767 Document Type: Conference Paper

Times cited : (194)

References (18)

1
- 84910048048
- P ronunciation learning for named-entities through crowd-sourcing
- A. Rutherford, F. P eng, and F. Beaufays, "P ronunciation learning for named-entities through crowd-sourcing, " in Proceedings of InterSpeech, 2014
- (2014) Proceedings of InterSpeech
- Rutherford, A.¹ Eng, F.P.² Beaufays, F.³

2
- 41049105254
- Joint-sequence models for grapheme-to-phoneme conversion
- M. Bisani and H. Ney, "joint-sequence models for grapheme-to-phoneme conversion, " Speech Communications, vol. 50, no. 5, pp. 434-451, 2008
- (2008) Speech Communications , vol.50 , Issue.5 , pp. 434-451
- Bisani, M.¹ Ney, H.²

3
- 0031573117
- Long short-term memory
- S. Hochreiter and]. Schmidhuber, "Long short-term memory," Neural Computation, vol. 9(8), pp. 17351780, 1997
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 17351780
- Hochreiter, S.¹ Schmidhuber²

4
- 84910064367
- Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion
- L. Galescu and]. F. Allen, "Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion, " in Proceedings of InterSpeech,2002
- (2002) Proceedings of InterSpeech
- Galescu, L.¹ Allen, F.²

5
- 84878571769
- Improving wfst-based g2p conversion with alignment constraints and rnnlm n-best rescoring
- J. R. Novak et aI., "Improving wfst-based g2p conversion with alignment constraints and rnnlm n-best rescoring, " in Proceedings of InterSpeech, 2012
- (2012) Proceedings of InterSpeech
- Novak et aI, J.R.¹

6
- 84906272849
- Failure transitions for joint n-gram models and g2p conversion
- J. R. Novak, N. Minematu, and K. Hirose, "Failure transitions for joint n-gram models and g2p conversion, " in Proceedings of InterSpeech, 2013
- (2013) Proceedings of InterSpeech
- Novak, J.R.¹ Minematu, N.² Hirose, K.³

7
- 33745216153
- Conditional and joint models for graphemeto-phoneme conversion
- S. F. Chen, "Conditional and joint models for graphemeto-phoneme conversion, " in Proceedings ofInterSpeech, 2003
- (2003) Proceedings OfInterSpeech
- Chen, S.F.¹

8
- 78650966704
- Letter-to-sound pronunciation prediction using conditional random fields
- D. Wang and S. King, "Letter-to-sound pronunciation prediction using conditional random fields, " IEEE Signal Processing Letters, vol. 18 (2), pp. 122-125, 20 1 1
- (2011) IEEE Signal Processing Letters , vol.18 , Issue.2 , pp. 122-125
- Wang, D.¹ King, S.²

9
- 84906222525
- Structure learning in hidden conditional random fields for grapheme-to-phoneme conversion
- P. Lehnen, A. Allauzen, T. Lavergne, F. Yvon, S. Hahn, and H. Ney, "Structure learning in hidden conditional random fields for grapheme-to-phoneme conversion, " in Proceedings of InterSpeech, 2013
- (2013) Proceedings of InterSpeech
- Lehnen, P.¹ Allauzen, A.² Lavergne, T.³ Yvon, F.⁴ Hahn, S.⁵ Ney, H.⁶

10
- 79952264781
- Joint processing and discriminative training for letter-tophoneme conversion
- S. Jiampojamarn, C. Cherry, and G. Kondrak, "Joint processing and discriminative training for letter-tophoneme conversion, " in Proceedings of ACL, 2008, pp. 905-9 13
- (2008) Proceedings of ACL , pp. 905-913
- Jiampojamarn, S.¹ Cherry, C.² Kondrak, G.³

11
- 84937940425
- Ph.D. thesis, Tampere University of Technology
- E. B. Bilcu, Text-to-Phoneme Mapping Using Neural Networks, Ph.D. thesis, Tampere University of Technology, 2008
- (2008) Text-to-Phoneme Mapping Using Neural Networks
- Bilcu, E.B.¹

12
- 84910037610
- Encoding linear models as weighted finite-state transducers
- K. Wu et aI., "Encoding linear models as weighted finite-state transducers, " in Proceedings of InterSpeech, 2014
- (2014) Proceedings of InterSpeech
- Wu et aI, K.¹

13
- 84878590885
- Comparison of grapheme-to-phoneme methods on large pronunciation dictionaries and lvcsr tasks
- S. Hahn, P. Vozila, and M. Bisani, "Comparison of grapheme-to-phoneme methods on large pronunciation dictionaries and lvcsr tasks, " in Proceedings of InterSpeech, 2012
- (2012) Proceedings of InterSpeech
- Hahn, S.¹ Vozila, P.² Bisani, M.³

14
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks, " in Proceedings of ICASSP, 2013, pp. 6645-6649
- (2013) Proceedings of ICASSP , pp. 6645-6649
- Graves, A.¹ Mohamed, A.² Hinton, G.³

15
- 84908677215
- Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition
- H. Sak, A. Senior, and F. Beau fays, "Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition, " in Proceedings of InterSpeech, 2014
- (2014) Proceedings of InterSpeech
- Sak, H.¹ Senior, A.² Beau Fays, F.³

16
- 71249112130
- Offline handwriting recognition with multidimensional recurrent neural networks
- A. Graves and]. Schmidhuber, "Offline handwriting recognition with multidimensional recurrent neural networks, " in Proceedings of NIPS, 2008, pp. 545-552
- (2008) Proceedings of NIPS , pp. 545-552
- Graves, A.¹ Schmidhuber²

17
- 84878402147
- Lstm neural networks for language modeling
- M. Sundermeyer, R. Schluter, and H. Ney, "Lstm neural networks for language modeling, " in Proceedings of InterSpeech, 2012, pp. 194-197
- (2012) Proceedings of InterSpeech , pp. 194-197
- Sundermeyer, M.¹ Schluter, R.² Ney, H.³

18
- 33749259827
- Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks
- A. Graves, S. Fernandez, F. Gomez, and]. Schmidhuber, "Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks, " in Proceedings of ICML, 2006
- (2006) Proceedings of ICML
- Graves, A.¹ Fernandez, S.² Gomez, F.³ Schmidhuber⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.