메뉴 건너뛰기




Volumn 24, Issue 12, 2016, Pages 2301-2312

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding

Author keywords

continuous F0 coding; deep neural networks; spiking neural networks; Very low bit rate speech coding

Indexed keywords

CODES (SYMBOLS); CODING ERRORS; CONTINUOUS SPEECH RECOGNITION; DEEP NEURAL NETWORKS; HIDDEN MARKOV MODELS; IMAGE CODING; MARKOV PROCESSES; NETWORK CODING; NEURAL NETWORKS; SIGNAL ENCODING; SPEECH; SPEECH CODING;

EID: 85027056450     PISSN: 23299290     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2016.2604566     Document Type: Article
Times cited : (27)

References (78)
  • 1
    • 0028550870 scopus 로고
    • Current objectives in 4-kb/s wireline-quality speech coding standardization
    • Nov.
    • S. Dimolitsas, C. Ravishankar, and G. Schroder, "Current objectives in 4-kb/s wireline-quality speech coding standardization," IEEE Signal Process. Lett., vol. 1, no. 11, pp. 157-159, Nov. 1994.
    • (1994) IEEE Signal Process. Lett. , vol.1 , Issue.11 , pp. 157-159
    • Dimolitsas, S.1    Ravishankar, C.2    Schroder, G.3
  • 2
    • 0035397411 scopus 로고    scopus 로고
    • A very low bit rate speech coder based on a recognition/synthesis paradigm
    • Jul.
    • K.-S. Lee and R. Cox, "A very low bit rate speech coder based on a recognition/synthesis paradigm," IEEE Trans. Audio, Speech, Lang. Process., vol. 9, no. 5, pp. 482-491, Jul. 2001.
    • (2001) IEEE Trans. Audio, Speech, Lang. Process. , vol.9 , Issue.5 , pp. 482-491
    • Lee, K.-S.1    Cox, R.2
  • 5
    • 84892164904 scopus 로고    scopus 로고
    • A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques
    • Piscataway, NJ, USA: IEEE May , vol. 2
    • K. Tokuda, T. Masuko, J. Hiroi, T. Kobayashi, and T. Kitamura, "A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., vol. 2. Piscataway, NJ, USA: IEEE, May 1998, vol. 2, pp. 609-612.
    • (1998) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. , vol.2 , pp. 609-612
    • Tokuda, K.1    Masuko, T.2    Hiroi, J.3    Kobayashi, T.4    Kitamura, T.5
  • 13
    • 84867329143 scopus 로고    scopus 로고
    • Boosting attribute and phone estimation accuracies with deep neural networks for detectionbased speech recognition
    • Mar.
    • D. Yu, S. Siniscalchi, L. Deng, and C.-H. Lee, "Boosting attribute and phone estimation accuracies with deep neural networks for detectionbased speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. Mar. 2012, pp. 4169-4172.
    • (2012) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. , pp. 4169-4172
    • Yu, D.1    Siniscalchi, S.2    Deng, L.3    Lee, C.-H.4
  • 14
    • 84862931515 scopus 로고    scopus 로고
    • Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
    • Mar.
    • S. M. Siniscalchi, D.-C. Lyu, T. Svendsen, and C.-H. Lee, "Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 875-887, Mar. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.3 , pp. 875-887
    • Siniscalchi, S.M.1    Lyu, D.-C.2    Svendsen, T.3    Lee, C.-H.4
  • 15
    • 84959104818 scopus 로고    scopus 로고
    • On compressibility of neural network phonological features for low bit rate speech coding
    • Sep.
    • A. Asaei, M. Cernak, and H. Bourlard, "On compressibility of neural network phonological features for low bit rate speech coding," in Proc. Interspeech, Sep. 2015, pp. 418-422.
    • (2015) Proc. Interspeech , pp. 418-422
    • Asaei, A.1    Cernak, M.2    Bourlard, H.3
  • 16
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for AI
    • Jan.
    • Y. Bengio, "Learning deep architectures for AI," Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, Jan. 2009.
    • (2009) Found. Trends Mach. Learn. , vol.2 , Issue.1 , pp. 1-127
    • Bengio, Y.1
  • 19
    • 85032751458 scopus 로고    scopus 로고
    • Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
    • Nov.
    • G. Hinton, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
    • (2012) IEEE Signal Process. Mag. , vol.29 , Issue.6 , pp. 82-97
    • Hinton, G.1
  • 20
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pretrained deep neural networks for large vocabulary speech recognition
    • Jan.
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 23
    • 79955538498 scopus 로고    scopus 로고
    • Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
    • Jul.
    • K. Yu, H. Zen, F. Mairesse, and S. Young, "Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis," Speech Commun., vol. 53, no. 6, pp. 914-923, Jul. 2011.
    • (2011) Speech Commun. , vol.53 , Issue.6 , pp. 914-923
    • Yu, K.1    Zen, H.2    Mairesse, F.3    Young, S.4
  • 25
    • 84945929642 scopus 로고    scopus 로고
    • DNN-based speech synthesis: Importance of input features and training data
    • A. Lazaridis, B. Potard, and P. N. Garner, "DNN-based speech synthesis: Importance of input features and training data," in Proc. Int. Conf. Speech Comput., 2015, pp. 193-200.
    • (2015) Proc. Int. Conf. Speech Comput. , pp. 193-200
    • Lazaridis, A.1    Potard, B.2    Garner, P.N.3
  • 26
    • 84989426403 scopus 로고
    • A new model of LPC excitation for producing natural-sounding speech at low bit rates
    • Piscataway, NJ, USA: IEEE May
    • B. S. Atal and J. R. Remde, "A new model of LPC excitation for producing natural-sounding speech at low bit rates," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., vol. 7. Piscataway, NJ, USA: IEEE, May 1982, pp. 614-617. [Online]. Available: Http://dx. doi.org/10.1109/icassp.1982.1171649
    • (1982) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. , vol.7 , pp. 614-617
    • Atal, B.S.1    Remde, J.R.2
  • 27
    • 0022219187 scopus 로고
    • Code-excited linear prediction (CELP): High-quality speech at very low bit rates
    • Piscataway, NJ, USA: IEEE Apr. [Online]. Available:
    • M. Schroeder and B. Atal, "Code-excited linear prediction (CELP): High-quality speech at very low bit rates," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 10. Piscataway, NJ, USA: IEEE, Apr. 1985, pp. 937-940. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1985.1168147
    • (1985) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.10 , pp. 937-940
    • Schroeder, M.1    Atal, B.2
  • 29
    • 0030386937 scopus 로고    scopus 로고
    • Prediction in speech coding: The modification of the coding of LPC parameters and nonlinear estimation technique by using ANN
    • Oct. [Online]. Available:
    • Y. Zhen, "Prediction in speech coding: The modification of the coding of LPC parameters and nonlinear estimation technique by using ANN," in Proc. 3rd Int. Conf. Signal Process., Oct. 1996, vol. 1, pp. 690-693. [Online]. Available: Http://dx.doi.org/10.1109/icsigp. 1996.567357
    • (1996) Proc. 3rd Int. Conf. Signal Process. , vol.1 , pp. 690-693
    • Zhen, Y.1
  • 30
    • 0029727465 scopus 로고    scopus 로고
    • A nonlinear adaptive predictor for speech compression
    • Piscataway, NJ, USA: IEEE Jun. [Online]. Available:
    • S. Hunt, "A nonlinear adaptive predictor for speech compression," in Proc. IEEE Int. Conf. Neural Netw., vol. 4. Piscataway, NJ, USA: IEEE, Jun. 1996, pp. 1998-2002. [Online]. Available: Http://dx.doi. org/10.1109/icnn.1996.549208
    • (1996) Proc. IEEE Int. Conf. Neural Netw. , vol.4 , pp. 1998-2002
    • Hunt, S.1
  • 31
    • 0033309597 scopus 로고    scopus 로고
    • Discriminative coding with predictive neural networks
    • Hertfordshire, U.K.: IET [Online]. Available:
    • C. Chavy, B. Gas, and J. L. Zarader, "Discriminative coding with predictive neural networks," in Proc. 9th Int. Conf. Art. Neural Netw., vol. 1. Hertfordshire, U.K.: IET, 1999, pp. 216-220. [Online]. Available: Http://dx.doi.org/10.1049/cp:19991111
    • (1999) Proc. 9th Int. Conf. Art. Neural Netw. , vol.1 , pp. 216-220
    • Chavy, C.1    Gas, B.2    Zarader, J.L.3
  • 33
    • 84962840611 scopus 로고    scopus 로고
    • Packet loss concealment based on deep neural networks for digital speech transmission
    • Feb. [Online]. Available:
    • B.-K. Lee and J.-H. Chang, "Packet loss concealment based on deep neural networks for digital speech transmission," IEEE/ACMTrans. Audio, Speech, Lang. Process., vol. 24, no. 2, pp. 378-387, Feb. 2016. [Online]. Available: Http://dx.doi.org/10.1109/taslp.2015.2509780
    • (2016) IEEE/ACMTrans. Audio, Speech, Lang. Process. , vol.24 , Issue.2 , pp. 378-387
    • Lee, B.-K.1    Chang, J.-H.2
  • 34
    • 84864942567 scopus 로고    scopus 로고
    • Complexity reduction of LDCELP speech coding in prediction of gain using neural networks
    • M. Sheikhan, V. T. Vakili, and S. Garoucy, "Complexity reduction of LDCELP speech coding in prediction of gain using neural networks," World Appl. Sci. J., vol. 7, no. 7, pp. 38-44, 2009.
    • (2009) World Appl. Sci. J. , vol.7 , Issue.7 , pp. 38-44
    • Sheikhan, M.1    Vakili, V.T.2    Garoucy, S.3
  • 35
    • 0026384943 scopus 로고
    • A CELP codebook and search technique using a Hopfield net
    • Piscataway, NJ, USA: IEEE Apr. vol. 1. [Online]. Available:
    • M. G. Easton and C. C. Goodyear, "A CELP codebook and search technique using a Hopfield net," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 1. Piscataway, NJ, USA: IEEE, Apr. 1991, pp. 685-688 vol. 1. [Online]. Available: Http://dx.doi.org/10. 1109/icassp.1991.150432
    • (1991) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 685-688
    • Easton, M.G.1    Goodyear, C.C.2
  • 36
    • 0028516668 scopus 로고
    • Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding
    • Oct. [Online].Available:
    • L. Wu, M. Niranjan, and F. Fallside, "Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding," IEEE Trans. Audio, Speech, Lang. Process., vol. 2, no. 4, pp. 482-489, Oct. 1994. [Online].Available: Http://dx.doi.org/10.1109/89. 326608
    • (1994) IEEE Trans. Audio, Speech, Lang. Process. , vol.2 , Issue.4 , pp. 482-489
    • Wu, L.1    Niranjan, M.2    Fallside, F.3
  • 37
    • 0024060644 scopus 로고
    • Multiband excitation vocoder
    • Aug. [Online]. Available:
    • D.W. Griffin and J. S. Lim, "Multiband excitation vocoder," IEEE Trans. Audio, Speech, Lang. Process., vol. 36, no. 8, pp. 1223-1235, Aug. 1988. [Online]. Available: Http://dx.doi.org/10.1109/29.1651
    • (1988) IEEE Trans. Audio, Speech, Lang. Process. , vol.36 , Issue.8 , pp. 1223-1235
    • Griffin, D.W.1    Lim, J.S.2
  • 38
    • 1642601602 scopus 로고    scopus 로고
    • A robust 800 bps MBE coder with VQ and MLP
    • Piscataway, NJ, USA: IEEE, Oct. [Online]. Available:
    • H. Cui and H. Jiang, "A robust 800 bps MBE coder with VQ and MLP," in Proc. Int. Conf. Commun. Technol. Proc., vol. 2. Piscataway, NJ, USA: IEEE, Oct. 1998, pp. 4. [Online]. Available: Http://dx.doi.org/10.1109/icct.1998.741011
    • (1998) Proc. Int. Conf. Commun. Technol. Proc. , vol.2 , pp. 4
    • Cui, H.1    Jiang, H.2
  • 39
    • 0020194708 scopus 로고
    • An 800 bit/s vector quantization LPC vocoder
    • Oct. [Online]. Available:
    • D. Wong, B.-H. Juang, and A. Gray, "An 800 bit/s vector quantization LPC vocoder," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-30, no. 5, pp. 770-780, Oct. 1982. [Online]. Available: Http://dx.doi.org/10.1109/tassp.1982.1163960
    • (1982) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-30 , Issue.5 , pp. 770-780
    • Wong, D.1    Juang, B.-H.2    Gray, A.3
  • 40
    • 84871173623 scopus 로고
    • Segment quantization for very-low-rate speech coding
    • Piscataway, NJ, USA: IEEE May [Online]. Available:
    • S. Roucos, R. Schwartz, and J. Makhoul, "Segment quantization for very-low-rate speech coding," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 7. Piscataway, NJ, USA: IEEE, May 1982, pp. 1565-1568. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1982.1171472
    • (1982) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.7 , pp. 1565-1568
    • Roucos, S.1    Schwartz, R.2    Makhoul, J.3
  • 41
    • 0020550073 scopus 로고
    • A segment vocoder at 150 b/s
    • Piscataway, NJ, USA: IEEE Apr. [Online]. Available:
    • S. Roucos, R. Schwartz, and J.Makhoul, "A segment vocoder at 150 b/s," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 8. Piscataway, NJ, USA: IEEE, Apr. 1983, pp. 61-64. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1983.1172241
    • (1983) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.8 , pp. 61-64
    • Roucos, S.1    Schwartz, R.2    Makhoul, J.3
  • 42
    • 0020548652 scopus 로고
    • Very low data rate speech compression with LPC vector and matrix quantization
    • Piscataway, NJ, USA: IEEE Apr. [Online]. Available:
    • D. Wong, B. Juang, and D. Cheng, "Very low data rate speech compression with LPC vector and matrix quantization," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 8. Piscataway, NJ, USA: IEEE, Apr. 1983, pp. 65-68. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1983.1172244
    • (1983) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.8 , pp. 65-68
    • Wong, D.1    Juang, B.2    Cheng, D.3
  • 43
    • 0022084026 scopus 로고
    • Matrix quantizer design for LPC speech using the generalized Llyod algorithm
    • Jun. [Online].Available:
    • C. Tsao and R. Gray, "Matrix quantizer design for LPC speech using the generalized Llyod algorithm," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 3, pp. 537-545, Jun. 1985. [Online].Available: Http://dx.doi.org/10.1109/tassp.1985.1164584
    • (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.3 , pp. 537-545
    • Tsao, C.1    Gray, R.2
  • 44
    • 0024075701 scopus 로고
    • LPC speech coding based on variablelength segment quantization
    • Sep. [Online]. Available:
    • Y. Shiraki and M. Honda, "LPC speech coding based on variablelength segment quantization," IEEE Trans. Audio, Speech, Lang. Process., vol. 36, no. 9, pp. 1437-1444, Sep. 1988. [Online]. Available: Http://dx.doi.org/10.1109/29.90372
    • (1988) IEEE Trans. Audio, Speech, Lang. Process. , vol.36 , Issue.9 , pp. 1437-1444
    • Shiraki, Y.1    Honda, M.2
  • 45
    • 84976552353 scopus 로고    scopus 로고
    • Speech Compression
    • Jun. [Online]. Available:
    • J. Gibson, "Speech Compression," Information, vol. 7, no. 2, p. 32, Jun. 2016. [Online]. Available: Http://dx.doi.org/10.3390/info7020032
    • (2016) Information , vol.7 , Issue.2 , pp. 32
    • Gibson, J.1
  • 47
    • 80052637232 scopus 로고    scopus 로고
    • Demodulation as probabilistic inference
    • Nov.
    • R. E. Turner and M. Sahani, "Demodulation as probabilistic inference," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2398-2411, Nov. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.8 , pp. 2398-2411
    • Turner, R.E.1    Sahani, M.2
  • 48
    • 84903976036 scopus 로고    scopus 로고
    • A role for amplitude modulation phase relationships in speech rhythm perception
    • Jul.
    • V. Leong, M. A. Stone, R. E. Turner, and U. Goswami, "A role for amplitude modulation phase relationships in speech rhythm perception." J. Acoust. Soc. Amer., vol. 136, no. 1, pp. 366-381, Jul. 2014.
    • (2014) J. Acoust. Soc. Amer. , vol.136 , Issue.1 , pp. 366-381
    • Leong, V.1    Stone, M.A.2    Turner, R.E.3    Goswami, U.4
  • 49
    • 23944484420 scopus 로고    scopus 로고
    • An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex
    • Sep.
    • P. Lakatos, A. S. Shah, K. H. Knuth, I. Ulbert, G. Karmos, and C. E. Schroeder, "An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex." J. Neurophysiol., vol. 94, no. 3, pp. 1904-1911, Sep. 2005.
    • (2005) J. Neurophysiol. , vol.94 , Issue.3 , pp. 1904-1911
    • Lakatos, P.1    Shah, A.S.2    Knuth, K.H.3    Ulbert, I.4    Karmos, G.5    Schroeder, C.E.6
  • 52
    • 85008023596 scopus 로고    scopus 로고
    • Continuous F0 modeling for HMM based statistical parametric speech synthesis
    • Jul. [Online]. Available:
    • K. Yu and S. Young, "Continuous F0 modeling for HMM based statistical parametric speech synthesis," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1071-1079, Jul. 2011. [Online]. Available: Http://dx.doi.org/10.1109/tasl.2010.2076805
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.5 , pp. 1071-1079
    • Yu, K.1    Young, S.2
  • 53
    • 84902953892 scopus 로고    scopus 로고
    • Using noisy speech to study the robustness of a continuous F0 modelling method in HMM-based speech synthesis
    • K. U. Ogbureke, J. P. Cabral, and J. Carson-Berndsen, "Using noisy speech to study the robustness of a continuous F0 modelling method in HMM-based speech synthesis," in Proc. Speech Prosody, 2012, pp. 67-70.
    • (2012) Proc. Speech Prosody , pp. 67-70
    • Ogbureke, K.U.1    Cabral, J.P.2    Carson-Berndsen, J.3
  • 54
    • 84959123110 scopus 로고    scopus 로고
    • Neuromorphic based oscillatory device for incremental syllable boundary detection
    • Sep.
    • A. Hyafil and M. Cernak, "Neuromorphic based oscillatory device for incremental syllable boundary detection," in Proc. Interspeech, Sep. 2015, pp. 1191-1195.
    • (2015) Proc. Interspeech , pp. 1191-1195
    • Hyafil, A.1    Cernak, M.2
  • 55
    • 84906268958 scopus 로고    scopus 로고
    • Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture
    • Aug.
    • M. Cernak, X. Na, and P. N. Garner, "Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture," in Proc. Interspeech, Aug. 2013, pp. 3449-3452.
    • (2013) Proc. Interspeech , pp. 3449-3452
    • Cernak, M.1    Na, X.2    Garner, P.N.3
  • 56
    • 84994246116 scopus 로고    scopus 로고
    • PhonVoc: A phonetic and phonological vocoding toolkit
    • M. Cernak and P. N. Garner, "PhonVoc: A phonetic and phonological vocoding toolkit," in Proc. Interspeech, 2016.
    • (2016) Proc. Interspeech
    • Cernak, M.1    Garner, P.N.2
  • 57
    • 0012330750 scopus 로고
    • The design for the wall street journal-based CSR corpus
    • D. B. Paul and J. M. Baker, "The design for the wall street journal-based CSR corpus," in Proc. Workshop Speech Nat. Lang., 1992, pp. 357-362.
    • (1992) Proc. Workshop Speech Nat. Lang. , pp. 357-362
    • Paul, D.B.1    Baker, J.M.2
  • 60
    • 70350498327 scopus 로고    scopus 로고
    • The HMM-based speech synthesis system version 2.0
    • H. Zen, "The HMM-based speech synthesis system version 2.0," in Proc. 6th ISCA Speech Synthesis Workshop, 2007, pp. 131-136.
    • (2007) Proc. 6th ISCA Speech Synthesis Workshop , pp. 131-136
    • Zen, H.1
  • 61
    • 85074721580 scopus 로고    scopus 로고
    • Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project
    • M. Wester, "Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project," in Proc. 7th ISCA Speech Synthesis Workshop, 2010, pp. 192-197.
    • (2010) Proc. 7th ISCA Speech Synthesis Workshop , pp. 192-197
    • Wester, M.1
  • 62
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • Jul.
    • G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, Jul. 2006.
    • (2006) Neural Comput. , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.W.3
  • 64
    • 84930661557 scopus 로고    scopus 로고
    • Speech encoding by coupled cortical theta and gamma oscillations
    • May [Online]. Available:
    • A. Hyafil, L. Fontolan, C. Kabdebon, B. Gutkin, A.-L. Giraud, and H. Brownell, "Speech encoding by coupled cortical theta and gamma oscillations," eLife, vol. 2015, no. 4, May 2015, Art. no. e06213. [Online]. Available: Http://dx.doi.org/10.7554/elife.06213
    • (2015) ELife , vol.2015 , Issue.4
    • Hyafil, A.1    Fontolan, L.2    Kabdebon, C.3    Gutkin, B.4    Giraud, A.-L.5    Brownell, H.6
  • 65
    • 84930614319 scopus 로고    scopus 로고
    • [Online]. Available:
    • W. M. Fisher, tsylb2. 1996. [Online]. Available: Http://www.nist. gov/speech/tools
    • (1996) Tsylb2
    • Fisher, W.M.1
  • 67
    • 0027247004 scopus 로고
    • Mel-cepstral distance measure for objective speech quality assessment
    • Piscataway, NJ, USA: IEEE May
    • R. F. Kubichek, "Mel-cepstral distance measure for objective speech quality assessment," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 1. Piscataway, NJ, USA: IEEE, May 1993, pp. 125-128.
    • (1993) Proc. IEEE Int. Conf. Acoust. Speech Signal Process , vol.1 , pp. 125-128
    • Kubichek, R.F.1
  • 69
    • 85075908665 scopus 로고    scopus 로고
    • Speech quality assessment
    • J. Benesty,M.M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer
    • V. Grancharov andW. B. Kleijn, "Speech quality assessment," in Springer Handbook of Speech Processing, J. Benesty,M.M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer, 2008, pp. 83-100.
    • (2008) Springer Handbook of Speech Processing , pp. 83-100
    • Grancharov, V.1    Kleijn, W.B.2
  • 70
    • 79960916745 scopus 로고    scopus 로고
    • An algorithm for intelligibility prediction of time-frequency weighted noisy speech
    • [Online]. Available:
    • C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, "An algorithm for intelligibility prediction of time-frequency weighted noisy speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2125-2136, 2011. [Online]. Available: Http://dx.doi.org/10.1109/tasl.2011.2114881
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.7 , pp. 2125-2136
    • Taal, C.H.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 71
    • 84942636431 scopus 로고    scopus 로고
    • Speech intelligibility evaluation for mobile phones
    • [Online]. Available:
    • S. Jørgensen, J. Cubick, and T. Dau, "Speech intelligibility evaluation for mobile phones," Acta Acust. United with Acust., vol. 105, pp. 1016-1025. [Online]. Available: Http://dx.doi.org/10.3813/aaa.918896
    • Acta Acust. United with Acust. , vol.105 , pp. 1016-1025
    • Jørgensen, S.1    Cubick, J.2    Dau, T.3
  • 72
    • 78049365405 scopus 로고    scopus 로고
    • A shorttime objective intelligibility measure for time-frequency weighted noisy speech
    • Mar. [Online]. Available:
    • C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, "A shorttime objective intelligibility measure for time-frequency weighted noisy speech," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Mar. 2010, pp. 4214-4217. [Online]. Available: Http://dx.doi.org/10.1109/icassp.2010.5495701
    • (2010) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. , pp. 4214-4217
    • Taal, C.H.1    Hendriks, R.C.2    Heusdens, R.3    Jensen, J.4
  • 75
    • 84959104369 scopus 로고    scopus 로고
    • Compressing deep neural networks using a rank-constrained topology
    • P. Nakkiran, R. Alvarez, R. Prabhavalkar, and C. Parada, "Compressing deep neural networks using a rank-constrained topology," in Proc. Interspeech, 2015, pp. 1473-1477.
    • (2015) Proc. Interspeech , pp. 1473-1477
    • Nakkiran, P.1    Alvarez, R.2    Prabhavalkar, R.3    Parada, C.4
  • 77
    • 85027574156 scopus 로고    scopus 로고
    • Small-footprint deep neural networks with highway connections for speech recognition
    • vol. abs/1512.04280 [Online]. Available:
    • L. Lu and S. Renals, "Small-footprint deep neural networks with highway connections for speech recognition," CoRR, vol. abs/1512.04280, 2015. [Online]. Available: Http://arxiv.org/abs/1512.04280
    • (2015) CoRR
    • Lu, L.1    Renals, S.2
  • 78
    • 84960944045 scopus 로고    scopus 로고
    • Deepear: Robust smartphone audio sensing in unconstrained acoustic environments using deep learning
    • [Online]. Available:
    • N. D. Lane, P. Georgiev, and L. Qendro, "Deepear: Robust smartphone audio sensing in unconstrained acoustic environments using deep learning," in Proc. ACM Int. Joint Conf. Pervasive Ubiquitous Comput., 2015, pp. 283-294. [Online]. Available: Http://doi.acm. org/10.1145/2750858.2804262
    • (2015) Proc. ACM Int. Joint Conf. Pervasive Ubiquitous Comput. , pp. 283-294
    • Lane, N.D.1    Georgiev, P.2    Qendro, L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.