메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4844-4848

Phonological vocoding using artificial neural networks

Author keywords

low bit rate speech coding; Parametric vocoding; phonology

Indexed keywords

DEEP NEURAL NETWORKS; NEURAL NETWORKS; SIGNAL ENCODING; SPEECH CODING; SPEECH COMMUNICATION; VOCODERS;

EID: 84946076199     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178891     Document Type: Conference Paper
Times cited : (25)

References (30)
  • 1
    • 84890528919 scopus 로고    scopus 로고
    • On the (UN)importance of the contextual factors in HMM-based speech synthesis and coding
    • May IEEE
    • M. Cernak, P. Motlicek, and P. N. Garner, "On the (UN)importance of the contextual factors in HMM-based speech synthesis and coding, " in Proc. of ICASSP. May 2013, pp. 8140-8143, IEEE
    • (2013) Proc. of ICASSP , pp. 8140-8143
    • Cernak, M.1    Motlicek, P.2    Garner, P.N.3
  • 2
    • 84906268958 scopus 로고    scopus 로고
    • SyllableBased pitch encoding for low bit rate speech coding with recognition/synthesis architecture
    • Aug.2013
    • Milos Cernak, Xingyu Na, and Philip N. Garner, "SyllableBased Pitch Encoding for Low Bit Rate Speech Coding with Recognition/Synthesis Architecture, " in Proc. of Interspeech, Aug.2013, pp. 3449-3452
    • Proc. of Interspeech , pp. 3452
    • Cernak, M.1    Na, X.2    Garner, P.N.3
  • 3
    • 84910046086 scopus 로고    scopus 로고
    • Stress and accent transmission in HMMBased syllable-context very low bit rate speech coding
    • Sept
    • Milos Cernak, Alexandros Lazaridis, Philip N. Garner, and Petr Motlicek, "Stress and Accent Transmission In HMMBased Syllable-Context Very Low Bit Rate Speech Coding, " in Proc. of Interspeech, Sept. 2014, pp. 2799-2803
    • (2014) Proc. of Interspeech , pp. 2799-2803
    • Cernak, M.1    Lazaridis, A.2    Garner, P.N.3    Motlicek, P.4
  • 4
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • May, IEEE
    • Heiga Ze, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. of ICASSP. May 2013, pp. 7962-7966, IEEE
    • (2013) Proc. of ICASSP , pp. 7962-7966
    • Ze, H.1    Senior, A.2    Schuster, M.3
  • 5
    • 84905251808 scopus 로고    scopus 로고
    • On the training aspects of deep neural network ( DNN) for parametric ITS synthesis
    • May IEEE
    • Yao Qian, Yuchen Fan, Wenping Hu, and F. K. Soong, "On the training aspects of Deep Neural Network ( DNN) for parametric ITS synthesis, " in Proc. of ICASSP. May 2014, pp. 3829-3833, IEEE
    • (2014) Proc. of ICASSP , pp. 3829-3833
    • Qian, Y.1    Fan, Y.2    Hu, W.3    Soong, F.K.4
  • 6
    • 84929157442 scopus 로고    scopus 로고
    • Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
    • Heng Lu, Simon King, and Oliver Watts, "Combining a Vector Space Representation of Linguistic Context with a Deep Neural Network for Text-To-Speech Synthesis, " in Proc. of 8th ISCA Workshop on Speech Synthesis, 2013, pp. 281-285
    • (2013) Proc. of 8th ISCA Workshop on Speech Synthesis , pp. 281-285
    • Lu, H.1    King, S.2    Watts, O.3
  • 7
    • 84905234316 scopus 로고    scopus 로고
    • Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis
    • May, IEEE
    • Xiang Yin, Zhen-Hua Ling, and Li-Rong Dai, "Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis, " in Proc. of ICASSP. May 2014, pp. 3824-3828, IEEE
    • (2014) Proc. of ICASSP , pp. 3824-3828
    • Yin, X.1    Ling, Z.-H.2    Dai, L.-R.3
  • 8
    • 0024909981 scopus 로고
    • A phonetic vocoder
    • May voU, IEEE
    • J. Picone and G. R. Doddington, "A phonetic vocoder, " in Proc. of ICASSP. May 1989, pp. 580-583 voU, IEEE
    • (1989) Proc. of ICASSP , pp. 580-583
    • Picone, J.1    Doddington, G.R.2
  • 9
    • 0034297586 scopus 로고    scopus 로고
    • Detection of phonological features in continuous speech using neural networks
    • Oct
    • Simon King and Paul Taylor, " Detection of phonological features in continuous speech using neural networks, " Computer Speech &Language, vol. 14, no. 4, pp. 333-353, Oct. 2000
    • (2000) Computer Speech &Language , vol.14 , Issue.4 , pp. 333-353
    • King, S.1    Taylor, P.2
  • 10
    • 84862931515 scopus 로고    scopus 로고
    • Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
    • Mar
    • S. M. Siniscalchi, Dau-Cheng Lyu, T. Svendsen, and Chin-Hui Lee, "Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data, " IEEE Trans. on Audio, Speech, and Language Processing, vol. 20, no. 3, pp. 875-887, Mar. 2012
    • (2012) IEEE Trans. on Audio, Speech, and Language Processing , vol.20 , Issue.3 , pp. 875-887
    • Siniscalchi, S.M.1    Lyu, D.-C.2    Svendsen, T.3    Lee, C.-H.4
  • 14
    • 84867329143 scopus 로고    scopus 로고
    • Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition
    • March, IEEE SPS
    • Dong Yu, Sabato Siniscalchi, Li Deng, and Chin-Hui Lee, "Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition, " in Proc. of ICASSP. March 2012, IEEE SPS
    • (2012) Proc. of ICASSP
    • Yu, D.1    Siniscalchi, S.2    Deng, L.3    Lee, C.-H.4
  • 17
    • 0022896067 scopus 로고
    • B D L E X A data and cognition base of spoken French
    • G. Perennou, "B.D.L.E. X.: A data and cognition base of spoken French, " in Proc. of ICASSP, 1986, vol. 11, pp. 325-328
    • (1986) Proc. of ICASSP , vol.11 , pp. 325-328
    • Perennou, G.1
  • 19
    • 85135145174 scopus 로고    scopus 로고
    • Acoustic modeling based on the M D L principle for speech recognition
    • Koichi Shinoda and Takao Watanabe, "Acoustic modeling based on the M D L principle for speech recognition, " in Proc. of Eurospeech, 1997, pp. I-99-102
    • (1997) Proc. of Eurospeech , pp. 199-102
    • Shinoda, K.1    Watanabe, T.2
  • 20
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • July
    • Geoffrey E. Hinton, Simon Osindero, and Yee W. Teh, "A Fast Learning Algorithm for Deep Belief Nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, July 2006
    • (2006) Neural Comput , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.W.3
  • 22
    • 0028996993 scopus 로고
    • Speech parameter generation from HMM using dynamic features
    • May voU, IEEE
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features, " in Proc. of ICASSP. May 1995, vol. 1, pp. 660-663 voU, IEEE
    • (1995) Proc. of ICASSP , vol.1 , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 24
    • 0027247004 scopus 로고
    • Mel-cepstral distance measure for objective speech quality assessment
    • May voU, IEEE
    • R. F. Kubichek, "Mel-cepstral distance measure for objective speech quality assessment, " in Proc. of ICASSP. May 1993, vol. 1, pp. 125-128 voU, IEEE
    • (1993) Proc. of ICASSP , vol.1 , pp. 125-128
    • Kubichek, R.F.1
  • 26
    • 84928118106 scopus 로고    scopus 로고
    • Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of FO and periodicity
    • Budapest, Hungary
    • H. Kawahara, H. Katayose, A. de Cheveigne, and R. D. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of FO and periodicity, " in Proc. of Eurospeech, Budapest, Hungary, 1999
    • (1999) Proc. of Eurospeech
    • Kawahara, H.1    Katayose, H.2    De Cheveigne, A.3    Patterson, R.D.4
  • 28
    • 84946038494 scopus 로고    scopus 로고
    • Voice source modelling using deep neural networks for statistical parametric speech synthesis
    • Lisbon, Portugal, September
    • Tuomo Raitio, Heng Lu, John Kane, Antti Suni, Martti Vainio, Simon King, and Paavo Alku, "Voice source modelling using deep neural networks for statistical parametric speech synthesis, " in Proc. of EUSIPCO, Lisbon, Portugal, September 2014
    • (2014) Proc. of EUSIPCO
    • Raitio, T.1    Lu, H.2    Kane, J.3    Suni, A.4    Vainio, M.5    King, S.6    Alku, P.7
  • 29
    • 85032752177 scopus 로고    scopus 로고
    • Parametric representation of speech signals
    • J.L. Flanagan, "Parametric representation of speech signals, " IEEE Signal Processing Magazine, vol. 27, no. 3, pp. 141-145, 2010
    • (2010) IEEE Signal Processing Magazine , vol.27 , Issue.3 , pp. 141-145
    • Flanagan, J.L.1
  • 30
    • 84936526522 scopus 로고
    • Towards an articulatory phonology
    • May
    • Catherine P. Browman and Louis M. Goldstein, "Towards an articulatory phonology, " Phonology, vol. 3, pp. 219-252, May 1986
    • (1986) Phonology , vol.3 , pp. 219-252
    • Browman, C.P.1    Goldstein, L.M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.