-
1
-
-
0028550870
-
Current objectives in 4-kb/s wireline-quality speech coding standardization
-
Nov.
-
S. Dimolitsas, C. Ravishankar, and G. Schroder, "Current objectives in 4-kb/s wireline-quality speech coding standardization," IEEE Signal Process. Lett., vol. 1, no. 11, pp. 157-159, Nov. 1994.
-
(1994)
IEEE Signal Process. Lett.
, vol.1
, Issue.11
, pp. 157-159
-
-
Dimolitsas, S.1
Ravishankar, C.2
Schroder, G.3
-
2
-
-
0035397411
-
A very low bit rate speech coder based on a recognition/synthesis paradigm
-
Jul.
-
K.-S. Lee and R. Cox, "A very low bit rate speech coder based on a recognition/synthesis paradigm," IEEE Trans. Audio, Speech, Lang. Process., vol. 9, no. 5, pp. 482-491, Jul. 2001.
-
(2001)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.9
, Issue.5
, pp. 482-491
-
-
Lee, K.-S.1
Cox, R.2
-
3
-
-
0141702209
-
Corpus based very low bit rate speech coding
-
Piscataway, NJ, USA: IEEE Apr.
-
G. V. Baudoin and F. El Chami, "Corpus based very low bit rate speech coding," in Proc. 2003 IEEE Int. Conf. Acoust. Speech Signal Process., vol. 1. Piscataway, NJ, USA: IEEE, Apr. 2003, pp. I-792-I-795.
-
(2003)
Proc. 2003 IEEE Int. Conf. Acoust. Speech Signal Process.
, vol.1
, pp. I792-I795
-
-
Baudoin, G.V.1
El Chami, F.2
-
4
-
-
0024909981
-
A phonetic vocoder
-
Piscataway, NJ, USA: IEEE May
-
J. Picone and G. R. Doddington, "A phonetic vocoder," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., vol. 1. Piscataway, NJ, USA: IEEE, May 1989, pp. 580-583.
-
(1989)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, vol.1
, pp. 580-583
-
-
Picone, J.1
Doddington, G.R.2
-
5
-
-
84892164904
-
A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques
-
Piscataway, NJ, USA: IEEE May , vol. 2
-
K. Tokuda, T. Masuko, J. Hiroi, T. Kobayashi, and T. Kitamura, "A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., vol. 2. Piscataway, NJ, USA: IEEE, May 1998, vol. 2, pp. 609-612.
-
(1998)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, vol.2
, pp. 609-612
-
-
Tokuda, K.1
Masuko, T.2
Hiroi, J.3
Kobayashi, T.4
Kitamura, T.5
-
6
-
-
51449103684
-
Multisensor very lowbit rate speech coding using segment quantization
-
Mar.
-
A. McCree, K. Brady, and T. F. Quatieri, "Multisensor very lowbit rate speech coding using segment quantization," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Mar. 2008, pp. 3997-4000.
-
(2008)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, pp. 3997-4000
-
-
McCree, A.1
Brady, K.2
Quatieri, T.F.3
-
7
-
-
84928152663
-
Incremental syllable-context phonetic vocoding
-
Jun.
-
M. Cernak, P. N. Garner, A. Lazaridis, P. Motlicek, and X. Na, "Incremental syllable-context phonetic vocoding," IEEE/ACMTrans. Audio, Speech, Lang. Process., vol. 23, no. 6, pp. 1019-1030, Jun. 2015.
-
(2015)
IEEE/ACMTrans. Audio, Speech, Lang. Process.
, vol.23
, Issue.6
, pp. 1019-1030
-
-
Cernak, M.1
Garner, P.N.2
Lazaridis, A.3
Motlicek, P.4
Na, X.5
-
11
-
-
0041385414
-
-
Harlow, Essex, U.K.: Longman
-
J. Harris and G. Lindsey, The Elements of Phonological Representation. Harlow, Essex, U.K.: Longman, 1995, pp. 34-79.
-
(1995)
The Elements of Phonological Representation
, pp. 34-79
-
-
Harris, J.1
Lindsey, G.2
-
13
-
-
84867329143
-
Boosting attribute and phone estimation accuracies with deep neural networks for detectionbased speech recognition
-
Mar.
-
D. Yu, S. Siniscalchi, L. Deng, and C.-H. Lee, "Boosting attribute and phone estimation accuracies with deep neural networks for detectionbased speech recognition," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. Mar. 2012, pp. 4169-4172.
-
(2012)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, pp. 4169-4172
-
-
Yu, D.1
Siniscalchi, S.2
Deng, L.3
Lee, C.-H.4
-
14
-
-
84862931515
-
Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data
-
Mar.
-
S. M. Siniscalchi, D.-C. Lyu, T. Svendsen, and C.-H. Lee, "Experiments on cross-language attribute detection and phone recognition with minimal target-specific training data," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 3, pp. 875-887, Mar. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.3
, pp. 875-887
-
-
Siniscalchi, S.M.1
Lyu, D.-C.2
Svendsen, T.3
Lee, C.-H.4
-
15
-
-
84959104818
-
On compressibility of neural network phonological features for low bit rate speech coding
-
Sep.
-
A. Asaei, M. Cernak, and H. Bourlard, "On compressibility of neural network phonological features for low bit rate speech coding," in Proc. Interspeech, Sep. 2015, pp. 418-422.
-
(2015)
Proc. Interspeech
, pp. 418-422
-
-
Asaei, A.1
Cernak, M.2
Bourlard, H.3
-
16
-
-
69349090197
-
Learning deep architectures for AI
-
Jan.
-
Y. Bengio, "Learning deep architectures for AI," Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, Jan. 2009.
-
(2009)
Found. Trends Mach. Learn.
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
19
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
Nov.
-
G. Hinton, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 82-97, Nov. 2012.
-
(2012)
IEEE Signal Process. Mag.
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
-
20
-
-
84055222005
-
Context-dependent pretrained deep neural networks for large vocabulary speech recognition
-
Jan.
-
G. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pretrained deep neural networks for large vocabulary speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 30-42, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.1
Yu, D.2
Deng, L.3
Acero, A.4
-
21
-
-
84890490547
-
Statistical parametric speech synthesis using deep neural networks
-
H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks," in Proc. IEEE Int. Conf. Acoust., Speech Signal Process., 2013, pp. 7962-7966.
-
(2013)
Proc. IEEE Int. Conf. Acoust., Speech Signal Process.
, pp. 7962-7966
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
22
-
-
34249043508
-
Anytime learning of decision trees
-
S. Esmeir, S. Markovitch, and C. Sammut, "Anytime learning of decision trees," J. Mach. Learn. Res., vol. 8, pp. 891-933, 2007.
-
(2007)
J. Mach. Learn. Res.
, vol.8
, pp. 891-933
-
-
Esmeir, S.1
Markovitch, S.2
Sammut, C.3
-
23
-
-
79955538498
-
Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
-
Jul.
-
K. Yu, H. Zen, F. Mairesse, and S. Young, "Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis," Speech Commun., vol. 53, no. 6, pp. 914-923, Jul. 2011.
-
(2011)
Speech Commun.
, vol.53
, Issue.6
, pp. 914-923
-
-
Yu, K.1
Zen, H.2
Mairesse, F.3
Young, S.4
-
24
-
-
84905251808
-
On the training aspects of deep neural network (DNN) for parametric TTS synthesis
-
Y. Qian, Y. Fan, W. Hu, and F. Soong, "On the training aspects of deep neural network (DNN) for parametric TTS synthesis," in Proc. IEEE Int. Conf. Acoust., Speech Signal Process., 2014, pp. 3829-3833.
-
(2014)
Proc. IEEE Int. Conf. Acoust., Speech Signal Process.
, pp. 3829-3833
-
-
Qian, Y.1
Fan, Y.2
Hu, W.3
Soong, F.4
-
25
-
-
84945929642
-
DNN-based speech synthesis: Importance of input features and training data
-
A. Lazaridis, B. Potard, and P. N. Garner, "DNN-based speech synthesis: Importance of input features and training data," in Proc. Int. Conf. Speech Comput., 2015, pp. 193-200.
-
(2015)
Proc. Int. Conf. Speech Comput.
, pp. 193-200
-
-
Lazaridis, A.1
Potard, B.2
Garner, P.N.3
-
26
-
-
84989426403
-
A new model of LPC excitation for producing natural-sounding speech at low bit rates
-
Piscataway, NJ, USA: IEEE May
-
B. S. Atal and J. R. Remde, "A new model of LPC excitation for producing natural-sounding speech at low bit rates," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., vol. 7. Piscataway, NJ, USA: IEEE, May 1982, pp. 614-617. [Online]. Available: Http://dx. doi.org/10.1109/icassp.1982.1171649
-
(1982)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, vol.7
, pp. 614-617
-
-
Atal, B.S.1
Remde, J.R.2
-
27
-
-
0022219187
-
Code-excited linear prediction (CELP): High-quality speech at very low bit rates
-
Piscataway, NJ, USA: IEEE Apr. [Online]. Available:
-
M. Schroeder and B. Atal, "Code-excited linear prediction (CELP): High-quality speech at very low bit rates," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 10. Piscataway, NJ, USA: IEEE, Apr. 1985, pp. 937-940. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1985.1168147
-
(1985)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.10
, pp. 937-940
-
-
Schroeder, M.1
Atal, B.2
-
28
-
-
0025694638
-
Speech coding based on a multi-layer neural network
-
Apr. [Online]. Available:
-
S. Morishima, H. Harashima, and Y. Katayama, "Speech coding based on a multi-layer neural network," in Proc. IEEE Int. Conf. Commun., Incl. Supercomm Tech. Sessions. Conf. Record., Apr. 1990, vol. 2, pp. 429-433. [Online]. Available: Http://dx.doi.org/10.1109/icc.1990. 117117
-
(1990)
Proc. IEEE Int. Conf. Commun., Incl. Supercomm Tech. Sessions. Conf. Record.
, vol.2
, pp. 429-433
-
-
Morishima, S.1
Harashima, H.2
Katayama, Y.3
-
29
-
-
0030386937
-
Prediction in speech coding: The modification of the coding of LPC parameters and nonlinear estimation technique by using ANN
-
Oct. [Online]. Available:
-
Y. Zhen, "Prediction in speech coding: The modification of the coding of LPC parameters and nonlinear estimation technique by using ANN," in Proc. 3rd Int. Conf. Signal Process., Oct. 1996, vol. 1, pp. 690-693. [Online]. Available: Http://dx.doi.org/10.1109/icsigp. 1996.567357
-
(1996)
Proc. 3rd Int. Conf. Signal Process.
, vol.1
, pp. 690-693
-
-
Zhen, Y.1
-
30
-
-
0029727465
-
A nonlinear adaptive predictor for speech compression
-
Piscataway, NJ, USA: IEEE Jun. [Online]. Available:
-
S. Hunt, "A nonlinear adaptive predictor for speech compression," in Proc. IEEE Int. Conf. Neural Netw., vol. 4. Piscataway, NJ, USA: IEEE, Jun. 1996, pp. 1998-2002. [Online]. Available: Http://dx.doi. org/10.1109/icnn.1996.549208
-
(1996)
Proc. IEEE Int. Conf. Neural Netw.
, vol.4
, pp. 1998-2002
-
-
Hunt, S.1
-
31
-
-
0033309597
-
Discriminative coding with predictive neural networks
-
Hertfordshire, U.K.: IET [Online]. Available:
-
C. Chavy, B. Gas, and J. L. Zarader, "Discriminative coding with predictive neural networks," in Proc. 9th Int. Conf. Art. Neural Netw., vol. 1. Hertfordshire, U.K.: IET, 1999, pp. 216-220. [Online]. Available: Http://dx.doi.org/10.1049/cp:19991111
-
(1999)
Proc. 9th Int. Conf. Art. Neural Netw.
, vol.1
, pp. 216-220
-
-
Chavy, C.1
Gas, B.2
Zarader, J.L.3
-
32
-
-
33745745020
-
Adaptive hybrid speech coding with a MLP/LPC structure
-
June 2-4 1999 Proceedings, Volume II. Berlin, Germany: Springer [Online]. Available:
-
M. Faúndez-Zanuy, "Adaptive hybrid speech coding with a MLP/LPC structure," in Engineering Applications of Bio-Inspired Artificial Neural Networks: International Work-Conference on Artificial and Natural Neural Networks, IWANN'99 Alicante, Spain, June 2-4, 1999 Proceedings, Volume II. Berlin, Germany: Springer, 1999, pp. 814-823. [Online]. Available: Http://dx.doi.org/10.1007/BFb0100549
-
(1999)
Engineering Applications of Bio-Inspired Artificial Neural Networks: International Work-Conference On Artificial and Natural Neural Networks, IWANN'99 Alicante, Spain
, pp. 814-823
-
-
Faúndez-Zanuy, M.1
-
33
-
-
84962840611
-
Packet loss concealment based on deep neural networks for digital speech transmission
-
Feb. [Online]. Available:
-
B.-K. Lee and J.-H. Chang, "Packet loss concealment based on deep neural networks for digital speech transmission," IEEE/ACMTrans. Audio, Speech, Lang. Process., vol. 24, no. 2, pp. 378-387, Feb. 2016. [Online]. Available: Http://dx.doi.org/10.1109/taslp.2015.2509780
-
(2016)
IEEE/ACMTrans. Audio, Speech, Lang. Process.
, vol.24
, Issue.2
, pp. 378-387
-
-
Lee, B.-K.1
Chang, J.-H.2
-
34
-
-
84864942567
-
Complexity reduction of LDCELP speech coding in prediction of gain using neural networks
-
M. Sheikhan, V. T. Vakili, and S. Garoucy, "Complexity reduction of LDCELP speech coding in prediction of gain using neural networks," World Appl. Sci. J., vol. 7, no. 7, pp. 38-44, 2009.
-
(2009)
World Appl. Sci. J.
, vol.7
, Issue.7
, pp. 38-44
-
-
Sheikhan, M.1
Vakili, V.T.2
Garoucy, S.3
-
35
-
-
0026384943
-
A CELP codebook and search technique using a Hopfield net
-
Piscataway, NJ, USA: IEEE Apr. vol. 1. [Online]. Available:
-
M. G. Easton and C. C. Goodyear, "A CELP codebook and search technique using a Hopfield net," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 1. Piscataway, NJ, USA: IEEE, Apr. 1991, pp. 685-688 vol. 1. [Online]. Available: Http://dx.doi.org/10. 1109/icassp.1991.150432
-
(1991)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 685-688
-
-
Easton, M.G.1
Goodyear, C.C.2
-
36
-
-
0028516668
-
Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding
-
Oct. [Online].Available:
-
L. Wu, M. Niranjan, and F. Fallside, "Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding," IEEE Trans. Audio, Speech, Lang. Process., vol. 2, no. 4, pp. 482-489, Oct. 1994. [Online].Available: Http://dx.doi.org/10.1109/89. 326608
-
(1994)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.2
, Issue.4
, pp. 482-489
-
-
Wu, L.1
Niranjan, M.2
Fallside, F.3
-
37
-
-
0024060644
-
Multiband excitation vocoder
-
Aug. [Online]. Available:
-
D.W. Griffin and J. S. Lim, "Multiband excitation vocoder," IEEE Trans. Audio, Speech, Lang. Process., vol. 36, no. 8, pp. 1223-1235, Aug. 1988. [Online]. Available: Http://dx.doi.org/10.1109/29.1651
-
(1988)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.36
, Issue.8
, pp. 1223-1235
-
-
Griffin, D.W.1
Lim, J.S.2
-
38
-
-
1642601602
-
A robust 800 bps MBE coder with VQ and MLP
-
Piscataway, NJ, USA: IEEE, Oct. [Online]. Available:
-
H. Cui and H. Jiang, "A robust 800 bps MBE coder with VQ and MLP," in Proc. Int. Conf. Commun. Technol. Proc., vol. 2. Piscataway, NJ, USA: IEEE, Oct. 1998, pp. 4. [Online]. Available: Http://dx.doi.org/10.1109/icct.1998.741011
-
(1998)
Proc. Int. Conf. Commun. Technol. Proc.
, vol.2
, pp. 4
-
-
Cui, H.1
Jiang, H.2
-
39
-
-
0020194708
-
An 800 bit/s vector quantization LPC vocoder
-
Oct. [Online]. Available:
-
D. Wong, B.-H. Juang, and A. Gray, "An 800 bit/s vector quantization LPC vocoder," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-30, no. 5, pp. 770-780, Oct. 1982. [Online]. Available: Http://dx.doi.org/10.1109/tassp.1982.1163960
-
(1982)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-30
, Issue.5
, pp. 770-780
-
-
Wong, D.1
Juang, B.-H.2
Gray, A.3
-
40
-
-
84871173623
-
Segment quantization for very-low-rate speech coding
-
Piscataway, NJ, USA: IEEE May [Online]. Available:
-
S. Roucos, R. Schwartz, and J. Makhoul, "Segment quantization for very-low-rate speech coding," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 7. Piscataway, NJ, USA: IEEE, May 1982, pp. 1565-1568. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1982.1171472
-
(1982)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.7
, pp. 1565-1568
-
-
Roucos, S.1
Schwartz, R.2
Makhoul, J.3
-
41
-
-
0020550073
-
A segment vocoder at 150 b/s
-
Piscataway, NJ, USA: IEEE Apr. [Online]. Available:
-
S. Roucos, R. Schwartz, and J.Makhoul, "A segment vocoder at 150 b/s," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 8. Piscataway, NJ, USA: IEEE, Apr. 1983, pp. 61-64. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1983.1172241
-
(1983)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.8
, pp. 61-64
-
-
Roucos, S.1
Schwartz, R.2
Makhoul, J.3
-
42
-
-
0020548652
-
Very low data rate speech compression with LPC vector and matrix quantization
-
Piscataway, NJ, USA: IEEE Apr. [Online]. Available:
-
D. Wong, B. Juang, and D. Cheng, "Very low data rate speech compression with LPC vector and matrix quantization," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 8. Piscataway, NJ, USA: IEEE, Apr. 1983, pp. 65-68. [Online]. Available: Http://dx.doi.org/10.1109/icassp.1983.1172244
-
(1983)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.8
, pp. 65-68
-
-
Wong, D.1
Juang, B.2
Cheng, D.3
-
43
-
-
0022084026
-
Matrix quantizer design for LPC speech using the generalized Llyod algorithm
-
Jun. [Online].Available:
-
C. Tsao and R. Gray, "Matrix quantizer design for LPC speech using the generalized Llyod algorithm," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 3, pp. 537-545, Jun. 1985. [Online].Available: Http://dx.doi.org/10.1109/tassp.1985.1164584
-
(1985)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-33
, Issue.3
, pp. 537-545
-
-
Tsao, C.1
Gray, R.2
-
44
-
-
0024075701
-
LPC speech coding based on variablelength segment quantization
-
Sep. [Online]. Available:
-
Y. Shiraki and M. Honda, "LPC speech coding based on variablelength segment quantization," IEEE Trans. Audio, Speech, Lang. Process., vol. 36, no. 9, pp. 1437-1444, Sep. 1988. [Online]. Available: Http://dx.doi.org/10.1109/29.90372
-
(1988)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.36
, Issue.9
, pp. 1437-1444
-
-
Shiraki, Y.1
Honda, M.2
-
45
-
-
84976552353
-
Speech Compression
-
Jun. [Online]. Available:
-
J. Gibson, "Speech Compression," Information, vol. 7, no. 2, p. 32, Jun. 2016. [Online]. Available: Http://dx.doi.org/10.3390/info7020032
-
(2016)
Information
, vol.7
, Issue.2
, pp. 32
-
-
Gibson, J.1
-
46
-
-
84946076199
-
Phonological vocoding using artificial neural networks
-
Apr.
-
M. Cernak, B. Potard, and P. N. Garner, "Phonological vocoding using artificial neural networks," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Apr. 2015, pp. 4844-4848.
-
(2015)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, pp. 4844-4848
-
-
Cernak, M.1
Potard, B.2
Garner, P.N.3
-
47
-
-
80052637232
-
Demodulation as probabilistic inference
-
Nov.
-
R. E. Turner and M. Sahani, "Demodulation as probabilistic inference," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 8, pp. 2398-2411, Nov. 2011.
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.8
, pp. 2398-2411
-
-
Turner, R.E.1
Sahani, M.2
-
48
-
-
84903976036
-
A role for amplitude modulation phase relationships in speech rhythm perception
-
Jul.
-
V. Leong, M. A. Stone, R. E. Turner, and U. Goswami, "A role for amplitude modulation phase relationships in speech rhythm perception." J. Acoust. Soc. Amer., vol. 136, no. 1, pp. 366-381, Jul. 2014.
-
(2014)
J. Acoust. Soc. Amer.
, vol.136
, Issue.1
, pp. 366-381
-
-
Leong, V.1
Stone, M.A.2
Turner, R.E.3
Goswami, U.4
-
49
-
-
23944484420
-
An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex
-
Sep.
-
P. Lakatos, A. S. Shah, K. H. Knuth, I. Ulbert, G. Karmos, and C. E. Schroeder, "An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex." J. Neurophysiol., vol. 94, no. 3, pp. 1904-1911, Sep. 2005.
-
(2005)
J. Neurophysiol.
, vol.94
, Issue.3
, pp. 1904-1911
-
-
Lakatos, P.1
Shah, A.S.2
Knuth, K.H.3
Ulbert, I.4
Karmos, G.5
Schroeder, C.E.6
-
51
-
-
85090774282
-
From discontinuous to continuous F0 modelling in HMM-based speech synthesis
-
K. Yu, B. Thomson, S. Young, and T. Street, "From discontinuous to continuous F0 modelling In HMM-based speech synthesis," in Proc. 7th ISCA Speech Synthesis Workshop, 2010, pp. 94-99.
-
(2010)
Proc. 7th ISCA Speech Synthesis Workshop
, pp. 94-99
-
-
Yu, K.1
Thomson, B.2
Young, S.3
Street, T.4
-
52
-
-
85008023596
-
Continuous F0 modeling for HMM based statistical parametric speech synthesis
-
Jul. [Online]. Available:
-
K. Yu and S. Young, "Continuous F0 modeling for HMM based statistical parametric speech synthesis," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1071-1079, Jul. 2011. [Online]. Available: Http://dx.doi.org/10.1109/tasl.2010.2076805
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.5
, pp. 1071-1079
-
-
Yu, K.1
Young, S.2
-
53
-
-
84902953892
-
Using noisy speech to study the robustness of a continuous F0 modelling method in HMM-based speech synthesis
-
K. U. Ogbureke, J. P. Cabral, and J. Carson-Berndsen, "Using noisy speech to study the robustness of a continuous F0 modelling method in HMM-based speech synthesis," in Proc. Speech Prosody, 2012, pp. 67-70.
-
(2012)
Proc. Speech Prosody
, pp. 67-70
-
-
Ogbureke, K.U.1
Cabral, J.P.2
Carson-Berndsen, J.3
-
54
-
-
84959123110
-
Neuromorphic based oscillatory device for incremental syllable boundary detection
-
Sep.
-
A. Hyafil and M. Cernak, "Neuromorphic based oscillatory device for incremental syllable boundary detection," in Proc. Interspeech, Sep. 2015, pp. 1191-1195.
-
(2015)
Proc. Interspeech
, pp. 1191-1195
-
-
Hyafil, A.1
Cernak, M.2
-
55
-
-
84906268958
-
Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture
-
Aug.
-
M. Cernak, X. Na, and P. N. Garner, "Syllable-based pitch encoding for low bit rate speech coding with recognition/synthesis architecture," in Proc. Interspeech, Aug. 2013, pp. 3449-3452.
-
(2013)
Proc. Interspeech
, pp. 3449-3452
-
-
Cernak, M.1
Na, X.2
Garner, P.N.3
-
56
-
-
84994246116
-
PhonVoc: A phonetic and phonological vocoding toolkit
-
M. Cernak and P. N. Garner, "PhonVoc: A phonetic and phonological vocoding toolkit," in Proc. Interspeech, 2016.
-
(2016)
Proc. Interspeech
-
-
Cernak, M.1
Garner, P.N.2
-
57
-
-
0012330750
-
The design for the wall street journal-based CSR corpus
-
D. B. Paul and J. M. Baker, "The design for the wall street journal-based CSR corpus," in Proc. Workshop Speech Nat. Lang., 1992, pp. 357-362.
-
(1992)
Proc. Workshop Speech Nat. Lang.
, pp. 357-362
-
-
Paul, D.B.1
Baker, J.M.2
-
58
-
-
6344222337
-
-
Philadelphia, PA, USA: Linguistic Data Consortium
-
J. Garofolo, S. Lamel, L. F. Fisher, W. M. Fiscus, G. Jonathon , and D. S. Pallett, "DARPA TIMIT acoustic-phonetic continuous speech corpus CD-ROM.NIST speech disc 1-1.1.NASA STI/Recon technical report, 93, LDC93S1," Philadelphia, PA, USA: Linguistic Data Consortium, 1993.
-
(1993)
DARPA TIMIT Acoustic-phonetic Continuous Speech Corpus CD-ROM.NIST Speech Disc 1-1.1.NASA STI/Recon Technical Report, 93, LDC93S1
-
-
Garofolo, J.1
Lamel, S.2
Fisher, L.F.3
Fiscus, W.M.4
Jonathon, G.5
Pallett, D.S.6
-
59
-
-
0003571407
-
-
Human Commun. Res. Centre, Univ. of Edinburgh, Tech. Rep.
-
A. Black, P. Taylor, and R. Caley, "The festival speech synthesis system," Human Commun. Res. Centre, Univ. of Edinburgh, Tech. Rep., 1997.
-
(1997)
The Festival Speech Synthesis System
-
-
Black, A.1
Taylor, P.2
Caley, R.3
-
60
-
-
70350498327
-
The HMM-based speech synthesis system version 2.0
-
H. Zen, "The HMM-based speech synthesis system version 2.0," in Proc. 6th ISCA Speech Synthesis Workshop, 2007, pp. 131-136.
-
(2007)
Proc. 6th ISCA Speech Synthesis Workshop
, pp. 131-136
-
-
Zen, H.1
-
61
-
-
85074721580
-
Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project
-
M. Wester, "Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project," in Proc. 7th ISCA Speech Synthesis Workshop, 2010, pp. 192-197.
-
(2010)
Proc. 7th ISCA Speech Synthesis Workshop
, pp. 192-197
-
-
Wester, M.1
-
62
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Jul.
-
G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, no. 7, pp. 1527-1554, Jul. 2006.
-
(2006)
Neural Comput.
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.W.3
-
64
-
-
84930661557
-
Speech encoding by coupled cortical theta and gamma oscillations
-
May [Online]. Available:
-
A. Hyafil, L. Fontolan, C. Kabdebon, B. Gutkin, A.-L. Giraud, and H. Brownell, "Speech encoding by coupled cortical theta and gamma oscillations," eLife, vol. 2015, no. 4, May 2015, Art. no. e06213. [Online]. Available: Http://dx.doi.org/10.7554/elife.06213
-
(2015)
ELife
, vol.2015
, Issue.4
-
-
Hyafil, A.1
Fontolan, L.2
Kabdebon, C.3
Gutkin, B.4
Giraud, A.-L.5
Brownell, H.6
-
65
-
-
84930614319
-
-
[Online]. Available:
-
W. M. Fisher, tsylb2. 1996. [Online]. Available: Http://www.nist. gov/speech/tools
-
(1996)
Tsylb2
-
-
Fisher, W.M.1
-
67
-
-
0027247004
-
Mel-cepstral distance measure for objective speech quality assessment
-
Piscataway, NJ, USA: IEEE May
-
R. F. Kubichek, "Mel-cepstral distance measure for objective speech quality assessment," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process, vol. 1. Piscataway, NJ, USA: IEEE, May 1993, pp. 125-128.
-
(1993)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 125-128
-
-
Kubichek, R.F.1
-
69
-
-
85075908665
-
Speech quality assessment
-
J. Benesty,M.M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer
-
V. Grancharov andW. B. Kleijn, "Speech quality assessment," in Springer Handbook of Speech Processing, J. Benesty,M.M. Sondhi, and Y. Huang, Eds. Berlin, Germany: Springer, 2008, pp. 83-100.
-
(2008)
Springer Handbook of Speech Processing
, pp. 83-100
-
-
Grancharov, V.1
Kleijn, W.B.2
-
70
-
-
79960916745
-
An algorithm for intelligibility prediction of time-frequency weighted noisy speech
-
[Online]. Available:
-
C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, "An algorithm for intelligibility prediction of time-frequency weighted noisy speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2125-2136, 2011. [Online]. Available: Http://dx.doi.org/10.1109/tasl.2011.2114881
-
(2011)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.19
, Issue.7
, pp. 2125-2136
-
-
Taal, C.H.1
Hendriks, R.C.2
Heusdens, R.3
Jensen, J.4
-
71
-
-
84942636431
-
Speech intelligibility evaluation for mobile phones
-
[Online]. Available:
-
S. Jørgensen, J. Cubick, and T. Dau, "Speech intelligibility evaluation for mobile phones," Acta Acust. United with Acust., vol. 105, pp. 1016-1025. [Online]. Available: Http://dx.doi.org/10.3813/aaa.918896
-
Acta Acust. United with Acust.
, vol.105
, pp. 1016-1025
-
-
Jørgensen, S.1
Cubick, J.2
Dau, T.3
-
72
-
-
78049365405
-
A shorttime objective intelligibility measure for time-frequency weighted noisy speech
-
Mar. [Online]. Available:
-
C. H. Taal, R. C. Hendriks, R. Heusdens, and J. Jensen, "A shorttime objective intelligibility measure for time-frequency weighted noisy speech," in Proc. IEEE Int. Conf. Acoust. Speech Signal Process., Mar. 2010, pp. 4214-4217. [Online]. Available: Http://dx.doi.org/10.1109/icassp.2010.5495701
-
(2010)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process.
, pp. 4214-4217
-
-
Taal, C.H.1
Hendriks, R.C.2
Heusdens, R.3
Jensen, J.4
-
75
-
-
84959104369
-
Compressing deep neural networks using a rank-constrained topology
-
P. Nakkiran, R. Alvarez, R. Prabhavalkar, and C. Parada, "Compressing deep neural networks using a rank-constrained topology," in Proc. Interspeech, 2015, pp. 1473-1477.
-
(2015)
Proc. Interspeech
, pp. 1473-1477
-
-
Nakkiran, P.1
Alvarez, R.2
Prabhavalkar, R.3
Parada, C.4
-
77
-
-
85027574156
-
Small-footprint deep neural networks with highway connections for speech recognition
-
vol. abs/1512.04280 [Online]. Available:
-
L. Lu and S. Renals, "Small-footprint deep neural networks with highway connections for speech recognition," CoRR, vol. abs/1512.04280, 2015. [Online]. Available: Http://arxiv.org/abs/1512.04280
-
(2015)
CoRR
-
-
Lu, L.1
Renals, S.2
-
78
-
-
84960944045
-
Deepear: Robust smartphone audio sensing in unconstrained acoustic environments using deep learning
-
[Online]. Available:
-
N. D. Lane, P. Georgiev, and L. Qendro, "Deepear: Robust smartphone audio sensing in unconstrained acoustic environments using deep learning," in Proc. ACM Int. Joint Conf. Pervasive Ubiquitous Comput., 2015, pp. 283-294. [Online]. Available: Http://doi.acm. org/10.1145/2750858.2804262
-
(2015)
Proc. ACM Int. Joint Conf. Pervasive Ubiquitous Comput.
, pp. 283-294
-
-
Lane, N.D.1
Georgiev, P.2
Qendro, L.3
|