-
1
-
-
33646255447
-
Further intelligibility results from human listening tests using the short-time phase spectrum
-
Alsteris, L.D., Paliwal, K.K., Further intelligibility results from human listening tests using the short-time phase spectrum. Speech Commun. 48:6 (2006), 727–736.
-
(2006)
Speech Commun.
, vol.48
, Issue.6
, pp. 727-736
-
-
Alsteris, L.D.1
Paliwal, K.K.2
-
2
-
-
0018656516
-
Epoch extraction from linear prediction residual for identification of closed glottis interval
-
Ananthapadmanabha, T., Yegnanarayana, B., Epoch extraction from linear prediction residual for identification of closed glottis interval. Acoust. Speech Signal Process. IEEE Trans. 27:4 (1979), 309–319.
-
(1979)
Acoust. Speech Signal Process. IEEE Trans.
, vol.27
, Issue.4
, pp. 309-319
-
-
Ananthapadmanabha, T.1
Yegnanarayana, B.2
-
3
-
-
0003435075
-
Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms
-
Oxford University Press Oxford, UK
-
Bäck, T., Evolutionary Algorithms in Theory and Practice: Evolution Strategies, Evolutionary Programming, Genetic Algorithms. 1996, Oxford University Press, Oxford, UK.
-
(1996)
-
-
Bäck, T.1
-
4
-
-
38749108114
-
Private emotions versus social interaction: a data-driven approach towards analysing emotion in speech
-
Batliner, A., Steidl, S., Hacker, C., Nöth, E., Private emotions versus social interaction: a data-driven approach towards analysing emotion in speech. User Model. User-Adapt. Interact. 18:1–2 (2008), 175–206, 10.1007/s11257-007-9039-4.
-
(2008)
User Model. User-Adapt. Interact.
, vol.18
, Issue.1-2
, pp. 175-206
-
-
Batliner, A.1
Steidl, S.2
Hacker, C.3
Nöth, E.4
-
5
-
-
33846449094
-
Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients’ speech signal with unilateral vocal fold paralysis
-
Behroozmand, R., Almasganj, F., Optimal selection of wavelet-packet-based features using genetic algorithm in pathological assessment of patients’ speech signal with unilateral vocal fold paralysis. Comput. Biol. Med. 37:4 (2007), 474–485, 10.1016/j.compbiomed.2006.08.016.
-
(2007)
Comput. Biol. Med.
, vol.37
, Issue.4
, pp. 474-485
-
-
Behroozmand, R.1
Almasganj, F.2
-
6
-
-
67650565075
-
Springer Handbook of Speech Processing
-
Springer Science & Business Media
-
Benesty, J., Sondhi, M.M., Huang, Y., Springer Handbook of Speech Processing. 2008, Springer Science & Business Media.
-
(2008)
-
-
Benesty, J.1
Sondhi, M.M.2
Huang, Y.3
-
7
-
-
0037776118
-
Time Frequency Analysis
-
Gulf Professional Publishing
-
Boashash, B., Time Frequency Analysis. 2003, Gulf Professional Publishing.
-
(2003)
-
-
Boashash, B.1
-
9
-
-
3042691943
-
Empirical mode decomposition of voiced speech signal
-
IEEE
-
Bouzid, A., Ellouze, N., Empirical mode decomposition of voiced speech signal. Control, Communications and Signal Processing, 2004. First International Symposium on, 2004, IEEE, 603–606.
-
(2004)
Control, Communications and Signal Processing, 2004. First International Symposium on
, pp. 603-606
-
-
Bouzid, A.1
Ellouze, N.2
-
10
-
-
38549125582
-
Voiced speech analysis by empirical mode decomposition
-
Springer
-
Bouzid, A., Ellouze, N., Voiced speech analysis by empirical mode decomposition. Advances in Nonlinear Speech Processing, 2007, Springer, 213–220.
-
(2007)
Advances in Nonlinear Speech Processing
, pp. 213-220
-
-
Bouzid, A.1
Ellouze, N.2
-
11
-
-
0027874671
-
Am-fm energy detection and separation in noise using multiband energy operators
-
Bovik, A.C., Maragos, P., Quatieri, T.F., Am-fm energy detection and separation in noise using multiband energy operators. Signal Process. IEEE Trans. 41:12 (1993), 3245–3265.
-
(1993)
Signal Process. IEEE Trans.
, vol.41
, Issue.12
, pp. 3245-3265
-
-
Bovik, A.C.1
Maragos, P.2
Quatieri, T.F.3
-
12
-
-
79953288449
-
Data driven design of filter bank for speech recognition
-
Springer Lecture Notes in Computer Science
-
Burget, L., Heřmanský, H., Data driven design of filter bank for speech recognition. Text, Speech and Dialogue, 2001, Springer, 299–304 Lecture Notes in Computer Science.
-
(2001)
Text, Speech and Dialogue
, pp. 299-304
-
-
Burget, L.1
Heřmanský, H.2
-
13
-
-
33745202280
-
A database of German emotional speech.
-
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W.F., Weiss, B., A database of German emotional speech. Interspeech, 5, 2005, 1517–1520.
-
(2005)
Interspeech
, vol.5
, pp. 1517-1520
-
-
Burkhardt, F.1
Paeschke, A.2
Rolfes, M.3
Sendlmeier, W.F.4
Weiss, B.5
-
14
-
-
44949251671
-
Data-driven design of front-end filter bank for Lombard speech recognition
-
Pittsburgh, Pennsylvania
-
BǒrilH. and Fousek, P. and Pollák, P., Data-driven design of front-end filter bank for Lombard speech recognition. Proceedings of INTERSPEECH 2006 - ICSLP, 2006, 381–384 Pittsburgh, Pennsylvania.
-
(2006)
Proceedings of INTERSPEECH 2006 - ICSLP
, pp. 381-384
-
-
BǒrilH. and Fousek, P. and Pollák, P.,1
-
15
-
-
67649119677
-
Optimizing feature complementarity by evolution strategy: application to automatic speaker verification
-
Charbuillet, C., Gas, B., Chetouani, M., Zarader, J., Optimizing feature complementarity by evolution strategy: application to automatic speaker verification. Speech Commun. 51:9 (2009), 724–731.
-
(2009)
Speech Commun.
, vol.51
, Issue.9
, pp. 724-731
-
-
Charbuillet, C.1
Gas, B.2
Chetouani, M.3
Zarader, J.4
-
16
-
-
84857276262
-
Emd-based filtering (emdf) of low-frequency noise for speech enhancement
-
Chatlani, N., Soraghan, J.J., Emd-based filtering (emdf) of low-frequency noise for speech enhancement. Audio Speech Lang. Process. IEEE Trans. 20:4 (2012), 1158–1166.
-
(2012)
Audio Speech Lang. Process. IEEE Trans.
, vol.20
, Issue.4
, pp. 1158-1166
-
-
Chatlani, N.1
Soraghan, J.J.2
-
17
-
-
4544254654
-
A technique to improve the empirical mode decomposition in the Hilbert-Huang transform
-
Chen, Y., Feng, M.Q., A technique to improve the empirical mode decomposition in the Hilbert-Huang transform. Earthquake Eng. Eng. Vib. 2:1 (2003), 75–85.
-
(2003)
Earthquake Eng. Eng. Vib.
, vol.2
, Issue.1
, pp. 75-85
-
-
Chen, Y.1
Feng, M.Q.2
-
18
-
-
0003733873
-
Time-Frequency Analysis
-
Prentice Hall PTR Englewood Cliffs, NJ:
-
Cohen, L., Time-Frequency Analysis. 1406, 1995, Prentice Hall PTR Englewood Cliffs, NJ:.
-
(1995)
, vol.1406
-
-
Cohen, L.1
-
19
-
-
0026686048
-
Entropy-based algorithms for best basis selection
-
Coifman, R., Wickerhauser, M.V., Entropy-based algorithms for best basis selection. IEEE Trans. Inf. Theory 38:2 (1992), 713–718.
-
(1992)
IEEE Trans. Inf. Theory
, vol.38
, Issue.2
, pp. 713-718
-
-
Coifman, R.1
Wickerhauser, M.V.2
-
20
-
-
84904579720
-
Improved complete ensemble emd: a suitable tool for biomedical signal processing
-
Colominas, M.A., Schlotthauer, G., Torres, M.E., Improved complete ensemble emd: a suitable tool for biomedical signal processing. Biomed. Signal Process. Control 14 (2014), 19–29.
-
(2014)
Biomed. Signal Process. Control
, vol.14
, pp. 19-29
-
-
Colominas, M.A.1
Schlotthauer, G.2
Torres, M.E.3
-
21
-
-
84933676946
-
An unconstrained optimization approach to empirical mode decomposition
-
Colominas, M.A., Schlotthauer, G., Torres, M.E., An unconstrained optimization approach to empirical mode decomposition. Digit. Signal Process. 40 (2015), 164–175.
-
(2015)
Digit. Signal Process.
, vol.40
, pp. 164-175
-
-
Colominas, M.A.1
Schlotthauer, G.2
Torres, M.E.3
-
23
-
-
57649245616
-
The chains corpus: characterizing individual speakers
-
Cummins, F., Grimaldi, M., Leonard, T., Simko, J., The chains corpus: characterizing individual speakers. Proceedings of SPECOM, 6, 2006, 431–435.
-
(2006)
Proceedings of SPECOM
, vol.6
, pp. 431-435
-
-
Cummins, F.1
Grimaldi, M.2
Leonard, T.3
Simko, J.4
-
24
-
-
85009725897
-
Onditas perceptualmente diseñadas para el reconocimiento automático del habla
-
Rosario, Argentina
-
Dabin, A., Milone, D.H., Rufiner, H.L., Onditas perceptualmente diseñadas para el reconocimiento automático del habla. Proceedings of 7th Argentine Symposium on Artificial Intelligence, 2005, 249–260 Rosario, Argentina.
-
(2005)
Proceedings of 7th Argentine Symposium on Artificial Intelligence
, pp. 249-260
-
-
Dabin, A.1
Milone, D.H.2
Rufiner, H.L.3
-
25
-
-
0031012371
-
Acoustic characteristics of the piriform fossa in models and humans
-
Dang, J., Honda, K., Acoustic characteristics of the piriform fossa in models and humans. J. Acoust. Soc. Am. 101:1 (1997), 456–465.
-
(1997)
J. Acoust. Soc. Am.
, vol.101
, Issue.1
, pp. 456-465
-
-
Dang, J.1
Honda, K.2
-
26
-
-
33646770000
-
The use of a masking signal to improve empirical mode decomposition
-
IEEE
-
Deering, R., Kaiser, J.F., The use of a masking signal to improve empirical mode decomposition. Acoustics, Speech, and Signal Processing, 2005. Proceedings.(ICASSP’05). IEEE International Conference on, 4, 2005, IEEE, iv–485.
-
(2005)
Acoustics, Speech, and Signal Processing, 2005. Proceedings.(ICASSP’05). IEEE International Conference on
, vol.4
, pp. iv-485
-
-
Deering, R.1
Kaiser, J.F.2
-
27
-
-
0003424145
-
Discrete-Time Processing of Speech Signals
-
Macmillan Publishing NewYork
-
Deller, J.R., Proakis, J.G., Hansen, J.H., Discrete-Time Processing of Speech Signals. 1993, Macmillan Publishing, NewYork.
-
(1993)
-
-
Deller, J.R.1
Proakis, J.G.2
Hansen, J.H.3
-
28
-
-
33947677733
-
A database of vocal tract resonance trajectories for research in speech processing
-
IEEE I–I
-
Deng, L., Cui, X., Pruvenok, R., Chen, Y., Momen, S., Alwan, A., A database of vocal tract resonance trajectories for research in speech processing. Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, 1, 2006, IEEE I–I.
-
(2006)
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
, vol.1
-
-
Deng, L.1
Cui, X.2
Pruvenok, R.3
Chen, Y.4
Momen, S.5
Alwan, A.6
-
29
-
-
77949617300
-
Speaker identification based on robust am-fm features
-
IEEE
-
Deshpande, M.S., Holambe, R.S., Speaker identification based on robust am-fm features. Emerging Trends in Engineering and Technology (ICETET), 2009 2nd International Conference on, 2009, IEEE, 880–884.
-
(2009)
Emerging Trends in Engineering and Technology (ICETET), 2009 2nd International Conference on
, pp. 880-884
-
-
Deshpande, M.S.1
Holambe, R.S.2
-
30
-
-
0036298770
-
Modulation features for speech recognition
-
IEEE I–377
-
Dimitriadis, D., Maragos, P., Potamianos, A., Modulation features for speech recognition. Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, 1, 2002, IEEE I–377.
-
(2002)
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
, vol.1
-
-
Dimitriadis, D.1
Maragos, P.2
Potamianos, A.3
-
31
-
-
27644455860
-
Robust am-fm features for speech recognition
-
Dimitriadis, D., Maragos, P., Potamianos, A., Robust am-fm features for speech recognition. Signal Process. Lett. IEEE 12:9 (2005), 621–624.
-
(2005)
Signal Process. Lett. IEEE
, vol.12
, Issue.9
, pp. 621-624
-
-
Dimitriadis, D.1
Maragos, P.2
Potamianos, A.3
-
32
-
-
70450198169
-
Glottal closure and opening instant detection from speech signals.
-
Drugman, T., Dutoit, T., Glottal closure and opening instant detection from speech signals. Interspeech, 2009, 2891–2894.
-
(2009)
Interspeech
, pp. 2891-2894
-
-
Drugman, T.1
Dutoit, T.2
-
33
-
-
84863419425
-
Detection of glottal closure instants from speech signals: a quantitative review
-
Drugman, T., Thomas, M., Gudnason, J., Naylor, P., Dutoit, T., Detection of glottal closure instants from speech signals: a quantitative review. Audio Speech Lang. Process. IEEE Trans. 20:3 (2012), 994–1006.
-
(2012)
Audio Speech Lang. Process. IEEE Trans.
, vol.20
, Issue.3
, pp. 994-1006
-
-
Drugman, T.1
Thomas, M.2
Gudnason, J.3
Naylor, P.4
Dutoit, T.5
-
34
-
-
0037363455
-
Approximations with evolutionary pursuit
-
Ferreira da Silva, A.R., Approximations with evolutionary pursuit. Signal Process. 83:3 (2003), 465–481.
-
(2003)
Signal Process.
, vol.83
, Issue.3
, pp. 465-481
-
-
Ferreira da Silva, A.R.1
-
36
-
-
23344453279
-
Empirical mode decompositions as data-driven wavelet-like expansions
-
Flandrin, P., Goncalves, P., Empirical mode decompositions as data-driven wavelet-like expansions. Int. J. Wavelets Multiresolution Inf. Process. 2:04 (2004), 477–496.
-
(2004)
Int. J. Wavelets Multiresolution Inf. Process.
, vol.2
, Issue.4
, pp. 477-496
-
-
Flandrin, P.1
Goncalves, P.2
-
37
-
-
85115671775
-
Emd equivalent filter banks, from interpretation to applications
-
Flandrin, P., Gonçalves, P., Rilling, G., Emd equivalent filter banks, from interpretation to applications. Hilbert-Huang TransformAppl., 2005, 57–74.
-
(2005)
Hilbert-Huang TransformAppl.
, pp. 57-74
-
-
Flandrin, P.1
Gonçalves, P.2
Rilling, G.3
-
38
-
-
34547520971
-
Detrending and Denoising with Empirical Mode Decompositions
-
Citeseer
-
Flandrin, P., Gonçalves, P., Rilling, G., et al. Detrending and Denoising with Empirical Mode Decompositions. 2004, Citeseer.
-
(2004)
-
-
Flandrin, P.1
Gonçalves, P.2
Rilling, G.3
-
39
-
-
0442326792
-
Empirical mode decomposition as a filter bank
-
Flandrin, P., Rilling, G., Goncalves, P., Empirical mode decomposition as a filter bank. Signal Process. Lett. IEEE 11:2 (2004), 112–114.
-
(2004)
Signal Process. Lett. IEEE
, vol.11
, Issue.2
, pp. 112-114
-
-
Flandrin, P.1
Rilling, G.2
Goncalves, P.3
-
40
-
-
47049116566
-
Comparative evaluation of various MFCC implementations on the speaker verification task
-
Ganchev, T., Fakotakis, N., Kokkinakis, G., Comparative evaluation of various MFCC implementations on the speaker verification task. Proceedings of the SPECOM, 1, 2005, 191–194.
-
(2005)
Proceedings of the SPECOM
, vol.1
, pp. 191-194
-
-
Ganchev, T.1
Fakotakis, N.2
Kokkinakis, G.3
-
41
-
-
0003548585
-
DARPA TIMIT acoustic phonetic continuous speech corpus CDROM
-
Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L., DARPA TIMIT acoustic phonetic continuous speech corpus CDROM. 1993.
-
(1993)
-
-
Garofolo, J.S.1
Lamel, L.F.2
Fisher, W.M.3
Fiscus, J.G.4
Pallett, D.S.5
Dahlgren, N.L.6
-
42
-
-
66149120614
-
Speaker identification using instantaneous frequencies
-
Grimaldi, M., Cummins, F., Speaker identification using instantaneous frequencies. Audio Speech Lang. Process. IEEE Trans. 16:6 (2008), 1097–1111.
-
(2008)
Audio Speech Lang. Process. IEEE Trans.
, vol.16
, Issue.6
, pp. 1097-1111
-
-
Grimaldi, M.1
Cummins, F.2
-
43
-
-
84888271524
-
Speech denoising based on empirical mode decomposition and improved thresholding
-
Springer
-
Hadhami, I., Bouzid, A., Speech denoising based on empirical mode decomposition and improved thresholding. Advances in Nonlinear Speech Processing, 2013, Springer, 200–207.
-
(2013)
Advances in Nonlinear Speech Processing
, pp. 200-207
-
-
Hadhami, I.1
Bouzid, A.2
-
44
-
-
79956708123
-
Speech Production and Speech Modelling
-
Springer Science & Business Media
-
Hardcastle, W.J., Marchal, A., Speech Production and Speech Modelling. 55, 1990, Springer Science & Business Media.
-
(1990)
, vol.55
-
-
Hardcastle, W.J.1
Marchal, A.2
-
45
-
-
84865743286
-
Robust speaker recognition in non-stationary room environments based on empirical mode decomposition.
-
Hasan, T., Hansen, J.H., Robust speaker recognition in non-stationary room environments based on empirical mode decomposition. INTERSPEECH, 2011, 2733–2736.
-
(2011)
INTERSPEECH
, pp. 2733-2736
-
-
Hasan, T.1
Hansen, J.H.2
-
46
-
-
58049207757
-
Suppression of residual noise from speech signals using empirical mode decomposition
-
Hasan, T., Hasan, M.K., Suppression of residual noise from speech signals using empirical mode decomposition. Signal Process. Lett. IEEE 16:1 (2009), 2–5.
-
(2009)
Signal Process. Lett. IEEE
, vol.16
, Issue.1
, pp. 2-5
-
-
Hasan, T.1
Hasan, M.K.2
-
47
-
-
0028996912
-
Text-dependent speaker recognition using the information in the higher frequency band
-
IEEE
-
Hayakawa, S., Itakura, F., Text-dependent speaker recognition using the information in the higher frequency band. Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, 1, 1994, IEEE, I–137.
-
(1994)
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
, vol.1
, pp. I-137
-
-
Hayakawa, S.1
Itakura, F.2
-
48
-
-
79952707334
-
Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech
-
He, L., Lech, M., Maddage, N.C., Allen, N.B., Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech. Biomed. Signal Process. Control 6:2 (2011), 139–146.
-
(2011)
Biomed. Signal Process. Control
, vol.6
, Issue.2
, pp. 139-146
-
-
He, L.1
Lech, M.2
Maddage, N.C.3
Allen, N.B.4
-
49
-
-
84979784313
-
Advances in Non-Linear Modeling for Speech Processing
-
Springer Science & Business Media
-
Holambe, R.S., Deshpande, M.S., Advances in Non-Linear Modeling for Speech Processing. 2012, Springer Science & Business Media.
-
(2012)
-
-
Holambe, R.S.1
Deshpande, M.S.2
-
50
-
-
77954686615
-
Visualisation of hypopharyngeal cavities and vocal-tract acoustic modelling
-
Honda, K., Kitamura, T., Takemoto, H., Adachi, S., Mokhtari, P., Takano, S., Nota, Y., Hirata, H., Fujimoto, I., Shimada, Y., et al. Visualisation of hypopharyngeal cavities and vocal-tract acoustic modelling. Comput. Methods Biomech. Biomed. Eng. 13:4 (2010), 443–453.
-
(2010)
Comput. Methods Biomech. Biomed. Eng.
, vol.13
, Issue.4
, pp. 443-453
-
-
Honda, K.1
Kitamura, T.2
Takemoto, H.3
Adachi, S.4
Mokhtari, P.5
Takano, S.6
Nota, Y.7
Hirata, H.8
Fujimoto, I.9
Shimada, Y.10
-
51
-
-
84893241518
-
Feature normalization using MVAW processing for spoken language recognition
-
Huang, C.-L., Matsuda, S., Hori, C., Feature normalization using MVAW processing for spoken language recognition. Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific, 2013, 1–4, 10.1109/APSIPA.2013.6694104.
-
(2013)
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
, pp. 1-4
-
-
Huang, C.-L.1
Matsuda, S.2
Hori, C.3
-
52
-
-
33846169080
-
Speech formant frequency estimation based on Hilbert-Huang transform
-
Huang, H., Chen, X.-x., Speech formant frequency estimation based on Hilbert-Huang transform. J.-ZHEJIANG Univ. Eng. Sci., 40(11), 2006, 1926.
-
(2006)
J.-ZHEJIANG Univ. Eng. Sci.
, vol.40
, Issue.11
, pp. 1926
-
-
Huang, H.1
Chen, X.-X.2
-
53
-
-
32644438199
-
Speech pitch determination based on Hilbert-Huang transform
-
Huang, H., Pan, J., Speech pitch determination based on Hilbert-Huang transform. Signal Process. 86:4 (2006), 792–803.
-
(2006)
Signal Process.
, vol.86
, Issue.4
, pp. 792-803
-
-
Huang, H.1
Pan, J.2
-
54
-
-
85009732552
-
Empirical mode decomposition and Hilbert spectral analysis
-
Huang, N.E., Empirical mode decomposition and Hilbert spectral analysis. 1998.
-
(1998)
-
-
Huang, N.E.1
-
55
-
-
85115665605
-
Hilbert-Huang Transform and Its Applications
-
World Scientific
-
Huang, N.E., Shen, S.S., Hilbert-Huang Transform and Its Applications. 5, 2005, World Scientific.
-
(2005)
, vol.5
-
-
Huang, N.E.1
Shen, S.S.2
-
56
-
-
5444236478
-
The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis
-
Huang, N.E., Shen, Z., Long, S.R., Wu, M.C., Shih, H.H., Zheng, Q., Yen, N.-C., Tung, C.C., Liu, H.H., The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. London. Ser.A 454:1971 (1998), 903–995.
-
(1998)
Proc. R. Soc. London. Ser.A
, vol.454
, Issue.1971
, pp. 903-995
-
-
Huang, N.E.1
Shen, Z.2
Long, S.R.3
Wu, M.C.4
Shih, H.H.5
Zheng, Q.6
Yen, N.-C.7
Tung, C.C.8
Liu, H.H.9
-
57
-
-
1542357546
-
A confidence limit for the empirical mode decomposition and Hilbert spectral analysis
-
Huang, N.E., Wu, M.-L.C., Long, S.R., Shen, S.S., Qu, W., Gloersen, P., Fan, K.L., A confidence limit for the empirical mode decomposition and Hilbert spectral analysis. Proc. R. Soc. London. Ser.A 459:2037 (2003), 2317–2345.
-
(2003)
Proc. R. Soc. London. Ser.A
, vol.459
, Issue.2037
, pp. 2317-2345
-
-
Huang, N.E.1
Wu, M.-L.C.2
Long, S.R.3
Shen, S.S.4
Qu, W.5
Gloersen, P.6
Fan, K.L.7
-
58
-
-
85009773294
-
Empirical mode decomposition for advanced speech signal processing
-
Islam Molla, M.K., Das, S., Hamid, M.E., Hirose, K., Empirical mode decomposition for advanced speech signal processing. J. Signal Process. 17:6 (2013), 215–229.
-
(2013)
J. Signal Process.
, vol.17
, Issue.6
, pp. 215-229
-
-
Islam Molla, M.K.1
Das, S.2
Hamid, M.E.3
Hirose, K.4
-
59
-
-
0033328948
-
Teager energy based feature parameters for speech recognition in car noise
-
Jabloun, F., Cetin, A.E., Erzin, E., Teager energy based feature parameters for speech recognition in car noise. Signal Process. Lett. IEEE 6:10 (1999), 259–261.
-
(1999)
Signal Process. Lett. IEEE
, vol.6
, Issue.10
, pp. 259-261
-
-
Jabloun, F.1
Cetin, A.E.2
Erzin, E.3
-
60
-
-
0028996918
-
Measuring fine structure in speech: application to speaker identification
-
IEEE
-
Jankowski Jr, C., Quatieri, T., Reynolds, D., Measuring fine structure in speech: application to speaker identification. Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, 1, 1995, IEEE, 325–328.
-
(1995)
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
, vol.1
, pp. 325-328
-
-
Jankowski Jr, C.1
Quatieri, T.2
Reynolds, D.3
-
61
-
-
84979272271
-
Classification of environmental background noise sources using Hilbert-Huang transform
-
Jhanwar, D., Sharma, K.K., Modani, S., Classification of environmental background noise sources using Hilbert-Huang transform. Int. J. Signal Process. Syst., 1, 2013.
-
(2013)
Int. J. Signal Process. Syst.
, vol.1
-
-
Jhanwar, D.1
Sharma, K.K.2
Modani, S.3
-
62
-
-
79959823356
-
A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification
-
Jung, C.-S., Han, K.J., Seo, H., Narayanan, S.S., Kang, H.-G., A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification. Eleventh Annual Conference of the International Speech Communication Association, 2010.
-
(2010)
Eleventh Annual Conference of the International Speech Communication Association
-
-
Jung, C.-S.1
Han, K.J.2
Seo, H.3
Narayanan, S.S.4
Kang, H.-G.5
-
63
-
-
0001059592
-
Some observations on vocal tract operation from a fluid flow point of view
-
Kaiser, J.F., Some observations on vocal tract operation from a fluid flow point of view. Vocal Fold Physiol., 1983, 358–386.
-
(1983)
Vocal Fold Physiol.
, pp. 358-386
-
-
Kaiser, J.F.1
-
64
-
-
0025635254
-
On a simple algorithm to calculate the energy'of a signal
-
Kaiser, J.F., On a simple algorithm to calculate the energy'of a signal. Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, 1990, 381–384.
-
(1990)
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
, pp. 381-384
-
-
Kaiser, J.F.1
-
65
-
-
84879074231
-
Pathological speech signal analysis and classification using empirical mode decomposition
-
Kaleem, M., Ghoraani, B., Guergachi, A., Krishnan, S., Pathological speech signal analysis and classification using empirical mode decomposition. Med. Biol. Eng. Comput. 51:7 (2013), 811–821.
-
(2013)
Med. Biol. Eng. Comput.
, vol.51
, Issue.7
, pp. 811-821
-
-
Kaleem, M.1
Ghoraani, B.2
Guergachi, A.3
Krishnan, S.4
-
66
-
-
84929094020
-
Speech vs music discrimination using empirical mode decomposition
-
IEEE
-
Khonglah, B.K., Sharma, R., Mahadeva Prasanna, S., Speech vs music discrimination using empirical mode decomposition. Communications (NCC), 2015 Twenty First National Conference on, 2015, IEEE, 1–6.
-
(2015)
Communications (NCC), 2015 Twenty First National Conference on
, pp. 1-6
-
-
Khonglah, B.K.1
Sharma, R.2
Mahadeva Prasanna, S.3
-
67
-
-
12844282873
-
Individual variation of the hypopharyngeal cavities and its acoustic effects
-
Kitamura, T., Honda, K., Takemoto, H., Individual variation of the hypopharyngeal cavities and its acoustic effects. Acoust. Sci. Technol. 26:1 (2005), 16–26.
-
(2005)
Acoust. Sci. Technol.
, vol.26
, Issue.1
, pp. 16-26
-
-
Kitamura, T.1
Honda, K.2
Takemoto, H.3
-
68
-
-
43949145296
-
Improved emd using doubly-iterative sifting and high order spline interpolation
-
Kopsinis, Y., McLaughlin, S., Improved emd using doubly-iterative sifting and high order spline interpolation. EURASIP J. Adv. Signal Process., 2008, 2008, 120.
-
(2008)
EURASIP J. Adv. Signal Process.
, vol.2008
, pp. 120
-
-
Kopsinis, Y.1
McLaughlin, S.2
-
69
-
-
63449122839
-
Development of emd-based denoising methods inspired by wavelet thresholding
-
Kopsinis, Y., McLaughlin, S., Development of emd-based denoising methods inspired by wavelet thresholding. Signal Process. IEEE Trans. 57:4 (2009), 1351–1362.
-
(2009)
Signal Process. IEEE Trans.
, vol.57
, Issue.4
, pp. 1351-1362
-
-
Kopsinis, Y.1
McLaughlin, S.2
-
70
-
-
79955928904
-
Speech emotion recognition using novel hht-teo based features
-
Li, X., Li, X., Speech emotion recognition using novel hht-teo based features. J. Comput. 6:5 (2011), 989–998.
-
(2011)
J. Comput.
, vol.6
, Issue.5
, pp. 989-998
-
-
Li, X.1
Li, X.2
-
71
-
-
34547545161
-
Physiological feature extraction for text independent speaker identification using non-uniform subband processing
-
IEEE
-
Lu, X., Dang, J., Physiological feature extraction for text independent speaker identification using non-uniform subband processing. Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, 4, 2007, IEEE, IV–461.
-
(2007)
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
, vol.4
, pp. IV-461
-
-
Lu, X.1
Dang, J.2
-
72
-
-
40249090511
-
An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification
-
Lu, X., Dang, J., An investigation of dependencies between frequency components and speaker characteristics for text-independent speaker identification. Speech Commun. 50:4 (2008), 312–322.
-
(2008)
Speech Commun.
, vol.50
, Issue.4
, pp. 312-322
-
-
Lu, X.1
Dang, J.2
-
73
-
-
77449135882
-
Speech endpoint detection in strong noisy environment based on the Hilbert-Huang transform
-
IEEE
-
Lu, Z., Liu, B., Shen, L., Speech endpoint detection in strong noisy environment based on the Hilbert-Huang transform. Mechatronics and Automation, 2009. ICMA 2009. International Conference on, 2009, IEEE, 4322–4326.
-
(2009)
Mechatronics and Automation, 2009. ICMA 2009. International Conference on
, pp. 4322-4326
-
-
Lu, Z.1
Liu, B.2
Shen, L.3
-
74
-
-
84996728830
-
On separating amplitude from frequency modulations using energy operators
-
IEEE
-
Maragos, P., Kaiser, J.F., Quatieri, T.F., On separating amplitude from frequency modulations using energy operators. Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on, 2, 1992, IEEE, 1–4.
-
(1992)
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
, vol.2
, pp. 1-4
-
-
Maragos, P.1
Kaiser, J.F.2
Quatieri, T.F.3
-
75
-
-
0027676955
-
Energy separation in signal modulations with application to speech analysis
-
Maragos, P., Kaiser, J.F., Quatieri, T.F., Energy separation in signal modulations with application to speech analysis. Signal Process. IEEE Trans. 41:10 (1993), 3024–3051.
-
(1993)
Signal Process. IEEE Trans.
, vol.41
, Issue.10
, pp. 3024-3051
-
-
Maragos, P.1
Kaiser, J.F.2
Quatieri, T.F.3
-
76
-
-
0028460895
-
Comparison of text-independent speaker recognition methods using vq-distortion and discrete/continuous hmm's
-
Matsui, T., Furui, S., Comparison of text-independent speaker recognition methods using vq-distortion and discrete/continuous hmm's. Speech Audio Process. IEEE Trans. 2:3 (1994), 456–459.
-
(1994)
Speech Audio Process. IEEE Trans.
, vol.2
, Issue.3
, pp. 456-459
-
-
Matsui, T.1
Furui, S.2
-
77
-
-
84863772450
-
Speech analysis/synthesis based on a sinusoidal representation
-
McAulay, R., Quatieri, T.F., Speech analysis/synthesis based on a sinusoidal representation. Acoust. Speech Signal Process. IEEE Trans. 34:4 (1986), 744–754.
-
(1986)
Acoust. Speech Signal Process. IEEE Trans.
, vol.34
, Issue.4
, pp. 744-754
-
-
McAulay, R.1
Quatieri, T.F.2
-
78
-
-
84873634448
-
Nonlinear methods for speech analysis and synthesis
-
McLaughlin, S., Maragos, P., Nonlinear methods for speech analysis and synthesis. Adv.Nonlinear SignalImage Process., 6, 2006, 103.
-
(2006)
Adv.Nonlinear SignalImage Process.
, vol.6
, pp. 103
-
-
McLaughlin, S.1
Maragos, P.2
-
79
-
-
79952279619
-
Assessment of pain expression in infant cry signals using empirical mode decomposition
-
Mijovic, B., Silva, M., Van den Bergh, B., Allegaert, K., Aerts, J.-M., Berckmans, D., Van Huffel, S., et al. Assessment of pain expression in infant cry signals using empirical mode decomposition. Methods Inf. Med. 49:5 (2010), 448–452.
-
(2010)
Methods Inf. Med.
, vol.49
, Issue.5
, pp. 448-452
-
-
Mijovic, B.1
Silva, M.2
Van den Bergh, B.3
Allegaert, K.4
Aerts, J.-M.5
Berckmans, D.6
Van Huffel, S.7
-
80
-
-
84867211310
-
Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model.
-
Molla, M.K.I., Hirose, K., Minematsu, N., Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model. INTERSPEECH, 2008, 2530–2533.
-
(2008)
INTERSPEECH
, pp. 2530-2533
-
-
Molla, M.K.I.1
Hirose, K.2
Minematsu, N.3
-
81
-
-
85032750831
-
Auditory perception and cognition
-
Munkong, R., Juang, B.-H., Auditory perception and cognition. Signal Process. Mag. IEEE 25:3 (2008), 98–117, 10.1109/MSP.2008.918418.
-
(2008)
Signal Process. Mag. IEEE
, vol.25
, Issue.3
, pp. 98-117
-
-
Munkong, R.1
Juang, B.-H.2
-
82
-
-
0037751491
-
-
PhD dissertation, Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India Ph.D. thesis
-
Murthy, H.A., Algorithms for Processing Fourier Transform Phase of Signals, 1992, PhD dissertation, Department of Computer Science and Engineering, Indian Institute of Technology, Madras, India Ph.D. thesis.
-
(1992)
Algorithms for Processing Fourier Transform Phase of Signals
-
-
Murthy, H.A.1
-
83
-
-
0024681756
-
Effectiveness of representation of signals through group delay functions
-
Murthy, K.M., Yegnanarayana, B., Effectiveness of representation of signals through group delay functions. Signal Process. 17:2 (1989), 141–150.
-
(1989)
Signal Process.
, vol.17
, Issue.2
, pp. 141-150
-
-
Murthy, K.M.1
Yegnanarayana, B.2
-
84
-
-
65249091627
-
Epoch extraction from speech signals
-
Murty, K.S.R., Yegnanarayana, B., Epoch extraction from speech signals. Audio Speech Lang. Process. IEEE Trans. 16:8 (2008), 1602–1613.
-
(2008)
Audio Speech Lang. Process. IEEE Trans.
, vol.16
, Issue.8
, pp. 1602-1613
-
-
Murty, K.S.R.1
Yegnanarayana, B.2
-
85
-
-
41049089736
-
Estimation of glottal closure instants in voiced speech using the dypsa algorithm
-
Naylor, P.A., Kounoudes, A., Gudnason, J., Brookes, M., Estimation of glottal closure instants in voiced speech using the dypsa algorithm. Audio Speech Lang. Process. IEEE Trans. 15:1 (2007), 34–43.
-
(2007)
Audio Speech Lang. Process. IEEE Trans.
, vol.15
, Issue.1
, pp. 34-43
-
-
Naylor, P.A.1
Kounoudes, A.2
Gudnason, J.3
Brookes, M.4
-
86
-
-
84899710850
-
A new approach of audio emotion recognition
-
Ooi, C.S., Seng, K.P., Ang, L.-M., Chew, L.W., A new approach of audio emotion recognition. Expert Syst. Appl. 41:13 (2014), 5858–5869, 10.1016/j.eswa.2014.03.026.
-
(2014)
Expert Syst. Appl.
, vol.41
, Issue.13
, pp. 5858-5869
-
-
Ooi, C.S.1
Seng, K.P.2
Ang, L.-M.3
Chew, L.W.4
-
87
-
-
85009100883
-
Usefulness of phase spectrum in human speech perception.
-
Paliwal, K.K., Alsteris, L.D., Usefulness of phase spectrum in human speech perception. INTERSPEECH, 2003.
-
(2003)
INTERSPEECH
-
-
Paliwal, K.K.1
Alsteris, L.D.2
-
88
-
-
13544259544
-
On the usefulness of stft phase spectrum in human listening tests
-
Paliwal, K.K., Alsteris, L.D., On the usefulness of stft phase spectrum in human listening tests. Speech Commun. 45:2 (2005), 153–170.
-
(2005)
Speech Commun.
, vol.45
, Issue.2
, pp. 153-170
-
-
Paliwal, K.K.1
Alsteris, L.D.2
-
89
-
-
78049294305
-
Adaptive am–fm signal decomposition with application to speech analysis
-
Pantazis, Y., Rosec, O., Stylianou, Y., Adaptive am–fm signal decomposition with application to speech analysis. IEEE Trans. Audio Speech Lang. Process. 19:2 (2011), 290–300.
-
(2011)
IEEE Trans. Audio Speech Lang. Process.
, vol.19
, Issue.2
, pp. 290-300
-
-
Pantazis, Y.1
Rosec, O.2
Stylianou, Y.3
-
90
-
-
0141626061
-
The wavelet tutorial
-
Polikar, R., The wavelet tutorial. 1996.
-
(1996)
-
-
Polikar, R.1
-
91
-
-
0030008906
-
Speech formant frequency and bandwidth tracking using multiband energy demodulation
-
Potamianos, A., Maragos, P., Speech formant frequency and bandwidth tracking using multiband energy demodulation. J. Acoust. Soc. Am. 99:6 (1996), 3795–3806.
-
(1996)
J. Acoust. Soc. Am.
, vol.99
, Issue.6
, pp. 3795-3806
-
-
Potamianos, A.1
Maragos, P.2
-
92
-
-
84887301670
-
Epoch extraction based on integrated linear prediction residual using plosion index
-
Prathosh, A., Ananthapadmanabha, T., Ramakrishnan, A., Epoch extraction based on integrated linear prediction residual using plosion index. Audio Speech Lang. Process. IEEE Trans. 21:12 (2013), 2471–2480.
-
(2013)
Audio Speech Lang. Process. IEEE Trans.
, vol.21
, Issue.12
, pp. 2471-2480
-
-
Prathosh, A.1
Ananthapadmanabha, T.2
Ramakrishnan, A.3
-
93
-
-
84893027328
-
A bag-of-tones model with MFCC features for musical genre classification
-
Motoda H. Wu Z. Cao L. Zaiane O. Yao M. Wang W. Springer Berlin Heidelberg
-
Qin, Z., Liu, W., Wan, T., A bag-of-tones model with MFCC features for musical genre classification. Motoda, H., Wu, Z., Cao, L., Zaiane, O., Yao, M., Wang, W., (eds.) Advanced Data Mining and Applications Lecture Notes in Computer Science, 8346, 2013, Springer Berlin Heidelberg, 564–575, 10.1007/978-3-642-53914-5_48.
-
(2013)
Advanced Data Mining and Applications, Lecture Notes in Computer Science
, vol.8346
, pp. 564-575
-
-
Qin, Z.1
Liu, W.2
Wan, T.3
-
94
-
-
0031238152
-
Am-fm separation using auditory-motivated filters
-
Quatieri, T.F., Hanna, T.E., O'Leary, G.C., Am-fm separation using auditory-motivated filters. IEEE Trans.SpeechAudio Process. 5:5 (1997), 465–480.
-
(1997)
IEEE Trans.SpeechAudio Process.
, vol.5
, Issue.5
, pp. 465-480
-
-
Quatieri, T.F.1
Hanna, T.E.2
O'Leary, G.C.3
-
95
-
-
0003425258
-
Digital Processing of Speech Signals
-
Prentice-hall Englewood Cliffs
-
Rabiner, L.R., Schafer, R.W., Digital Processing of Speech Signals. 100, 1978, Prentice-hall Englewood Cliffs.
-
(1978)
, vol.100
-
-
Rabiner, L.R.1
Schafer, R.W.2
-
96
-
-
76249085823
-
Introduction to digital speech processing
-
Rabiner, L.R., Schafer, R.W., Introduction to digital speech processing. Found.TrendsSignal Process. 1:1 (2007), 1–194.
-
(2007)
Found.TrendsSignal Process.
, vol.1
, Issue.1
, pp. 1-194
-
-
Rabiner, L.R.1
Schafer, R.W.2
-
97
-
-
84885066718
-
Hierarchical clustering and classification of emotions in human speech using confusion matrices
-
Springer
-
Reyes-Vargas, M., Sánchez-Gutiérrez, M., Rufiner, L., Albornoz, M., Vignolo, L., Martínez-Licona, F., Goddard-Close, J., Hierarchical clustering and classification of emotions in human speech using confusion matrices. Lecture Notes in Artificial Intelligence, 8113, 2013, Springer, 162–169.
-
(2013)
Lecture Notes in Artificial Intelligence
, vol.8113
, pp. 162-169
-
-
Reyes-Vargas, M.1
Sánchez-Gutiérrez, M.2
Rufiner, L.3
Albornoz, M.4
Vignolo, L.5
Martínez-Licona, F.6
Goddard-Close, J.7
-
98
-
-
84876432523
-
-
PhD thesis, Ecole normale supérieure de Lyon Ph.D. thesis
-
Rilling, G., Décompositions modales empiriques, 2007, PhD thesis, Ecole normale supérieure de Lyon Ph.D. thesis.
-
(2007)
Décompositions modales empiriques
-
-
Rilling, G.1
-
99
-
-
33947638915
-
On the influence of sampling on the empirical mode decomposition.
-
Rilling, G., Flandrin, P., On the influence of sampling on the empirical mode decomposition. ICASSP (3), 2006, 444–447.
-
(2006)
ICASSP (3)
, pp. 444-447
-
-
Rilling, G.1
Flandrin, P.2
-
100
-
-
85008018510
-
One or two frequencies? The empirical mode decomposition answers
-
Rilling, G., Flandrin, P., One or two frequencies? The empirical mode decomposition answers. Signal Process. IEEE Trans. 56:1 (2008), 85–95.
-
(2008)
Signal Process. IEEE Trans.
, vol.56
, Issue.1
, pp. 85-95
-
-
Rilling, G.1
Flandrin, P.2
-
101
-
-
33646819710
-
Empirical mode decomposition, fractional Gaussian noise and Hurst exponent estimation.
-
Rilling, G., Flandrin, P., Gonçalves, P., Empirical mode decomposition, fractional Gaussian noise and Hurst exponent estimation. ICASSP (4), 2005, 489–492.
-
(2005)
ICASSP (4)
, pp. 489-492
-
-
Rilling, G.1
Flandrin, P.2
Gonçalves, P.3
-
102
-
-
1642457413
-
On empirical mode decomposition and its algorithms
-
NSIP-03, Grado (I)
-
Rilling, G., Flandrin, P., Goncalves, P., et al. On empirical mode decomposition and its algorithms. IEEE-EURASIP Workshop on Nonlinear Signal and Image Processing, 3, 2003, NSIP-03, Grado (I), 8–11.
-
(2003)
IEEE-EURASIP Workshop on Nonlinear Signal and Image Processing
, vol.3
, pp. 8-11
-
-
Rilling, G.1
Flandrin, P.2
Goncalves, P.3
-
103
-
-
0031361032
-
A method of wavelet selection in phoneme recognition
-
Rufiner, H., Goddard, J., A method of wavelet selection in phoneme recognition. Proceedings of the 40th Midwest Symposium on Circuits and Systems, 2, 1997, 889–891.
-
(1997)
Proceedings of the 40th Midwest Symposium on Circuits and Systems
, vol.2
, pp. 889-891
-
-
Rufiner, H.1
Goddard, J.2
-
104
-
-
84877703493
-
Wavelet adaptation for automatic voice disorders sorting
-
Saeedi, N.E., Almasganj, F., Wavelet adaptation for automatic voice disorders sorting. Comput. Biol. Med. 43:6 (2013), 699–704, 10.1016/j.compbiomed.2013.03.006.
-
(2013)
Comput. Biol. Med.
, vol.43
, Issue.6
, pp. 699-704
-
-
Saeedi, N.E.1
Almasganj, F.2
-
105
-
-
84872166152
-
A novel windowing technique for efficient computation of MFCC for speaker recognition
-
Sahidullah, M., Saha, G., A novel windowing technique for efficient computation of MFCC for speaker recognition. Signal Process. Lett. IEEE 20:2 (2013), 149–152, 10.1109/LSP.2012.2235067.
-
(2013)
Signal Process. Lett. IEEE
, vol.20
, Issue.2
, pp. 149-152
-
-
Sahidullah, M.1
Saha, G.2
-
106
-
-
84893688455
-
Learning filter banks within a deep neural network framework
-
Sainath, T., Kingsbury, B., Mohamed, A.-R., Ramabhadran, B., Learning filter banks within a deep neural network framework. Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on, 2013, 297–302, 10.1109/ASRU.2013.6707746.
-
(2013)
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
, pp. 297-302
-
-
Sainath, T.1
Kingsbury, B.2
Mohamed, A.-R.3
Ramabhadran, B.4
-
107
-
-
0000453879
-
Local discriminant bases and their applications
-
Saito, N., Coifman, R., Local discriminant bases and their applications. J. Math. Imaging Vis. 5:4 (1995), 337–358, 10.1007/BF01250288.
-
(1995)
J. Math. Imaging Vis.
, vol.5
, Issue.4
, pp. 337-358
-
-
Saito, N.1
Coifman, R.2
-
108
-
-
77950139087
-
Voice fundamental frequency extraction algorithm based on ensemble empirical mode decomposition and entropies
-
Springer
-
Schlotthauer, G., Torres, M., Rufiner, H., Voice fundamental frequency extraction algorithm based on ensemble empirical mode decomposition and entropies. World Congress on Medical Physics and Biomedical Engineering, September 7–12, 2009, Munich, Germany, 2010, Springer, 984–987.
-
(2010)
World Congress on Medical Physics and Biomedical Engineering, September 7–12, 2009, Munich, Germany
, pp. 984-987
-
-
Schlotthauer, G.1
Torres, M.2
Rufiner, H.3
-
109
-
-
77952083041
-
A new algorithm for instantaneous f0 speech extraction based on ensemble empirical mode decomposition
-
Schlotthauer, G., Torres, M.E., Rufiner, H.L., A new algorithm for instantaneous f0 speech extraction based on ensemble empirical mode decomposition. Proceedings of 17th European Signal Procesing Conference, 2009, 2347–2351.
-
(2009)
Proceedings of 17th European Signal Procesing Conference
, pp. 2347-2351
-
-
Schlotthauer, G.1
Torres, M.E.2
Rufiner, H.L.3
-
110
-
-
77952023058
-
Pathological voice analysis and classification based on empirical mode decomposition
-
Springer
-
Schlotthauer, G., Torres, M.E., Rufiner, H.L., Pathological voice analysis and classification based on empirical mode decomposition. Development of Multimodal Interfaces: Active Listening and Synchrony, 2010, Springer, 364–381.
-
(2010)
Development of Multimodal Interfaces: Active Listening and Synchrony
, pp. 364-381
-
-
Schlotthauer, G.1
Torres, M.E.2
Rufiner, H.L.3
-
111
-
-
70349223037
-
An auditory-based feature for robust speech recognition
-
Shao, Y., Jin, Z., Wang, D., Srinivasan, S., An auditory-based feature for robust speech recognition. Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on, 2009, 4625–4628, 10.1109/ICASSP.2009.4960661.
-
(2009)
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
, pp. 4625-4628
-
-
Shao, Y.1
Jin, Z.2
Wang, D.3
Srinivasan, S.4
-
112
-
-
84979763198
-
A better decomposition of speech obtained using modified empirical mode decomposition
-
Sharma, R., Prasanna, S.M., A better decomposition of speech obtained using modified empirical mode decomposition. Digit. Signal Process. 58 (2016), 26–39 http://dx.doi.org/10.1016/j.dsp.2016.07.012.
-
(2016)
Digit. Signal Process.
, vol.58
, pp. 26-39
-
-
Sharma, R.1
Prasanna, S.M.2
-
114
-
-
84924194710
-
Analysis of electroglottograph signal using ensemble empirical mode decomposition
-
IEEE
-
Sharma, R., Ramesh, K., Prasanna, S., Analysis of electroglottograph signal using ensemble empirical mode decomposition. India Conference (INDICON), 2014 Annual IEEE, 2014, IEEE, 1–6.
-
(2014)
India Conference (INDICON), 2014 Annual IEEE
, pp. 1-6
-
-
Sharma, R.1
Ramesh, K.2
Prasanna, S.3
-
115
-
-
79954431608
-
Stressed speech processing: human vs automatic in non-professional speakers scenario
-
Shukla, S., Prasanna, S., Dandapat, S., Stressed speech processing: human vs automatic in non-professional speakers scenario. Communications (NCC), 2011 National Conference on, 2011, 1–5.
-
(2011)
Communications (NCC), 2011 National Conference on
, pp. 1-5
-
-
Shukla, S.1
Prasanna, S.2
Dandapat, S.3
-
116
-
-
4444368779
-
Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition
-
Skowronski, M., Harris, J., Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition. J. Acoust. Soc. Am. 116:3 (2004), 1774–1780.
-
(2004)
J. Acoust. Soc. Am.
, vol.116
, Issue.3
, pp. 1774-1780
-
-
Skowronski, M.1
Harris, J.2
-
117
-
-
0029375490
-
Determination of instants of significant excitation in speech using group delay function
-
Smits, R., Yegnanarayana, B., Determination of instants of significant excitation in speech using group delay function. Speech Audio Process. IEEE Trans. 3:5 (1995), 325–333.
-
(1995)
Speech Audio Process. IEEE Trans.
, vol.3
, Issue.5
, pp. 325-333
-
-
Smits, R.1
Yegnanarayana, B.2
-
118
-
-
34548794790
-
Determination of instants of significant excitation in speech using Hilbert envelope and group delay function
-
Sreenivasa Rao, K., Prasanna, S., Yegnanarayana, B., Determination of instants of significant excitation in speech using Hilbert envelope and group delay function. Signal Process. Lett. IEEE 14:10 (2007), 762–765.
-
(2007)
Signal Process. Lett. IEEE
, vol.14
, Issue.10
, pp. 762-765
-
-
Sreenivasa Rao, K.1
Prasanna, S.2
Yegnanarayana, B.3
-
119
-
-
70449388050
-
Automatic Classification of Emotion-Related User States in Spontaneous Children's Speech
-
Logos Verlag
-
Steidl, S., Automatic Classification of Emotion-Related User States in Spontaneous Children's Speech. 2009, Logos Verlag.
-
(2009)
-
-
Steidl, S.1
-
120
-
-
84922740791
-
Joint variable frame rate and length analysis for speech recognition under adverse conditions
-
Tan, Z.-H., Kraljevski, I., Joint variable frame rate and length analysis for speech recognition under adverse conditions. Comput. Electr. Eng. 40:7 (2014), 2139–2149.
-
(2014)
Comput. Electr. Eng.
, vol.40
, Issue.7
, pp. 2139-2149
-
-
Tan, Z.-H.1
Kraljevski, I.2
-
121
-
-
0019075685
-
Some observations on oral air flow during phonation
-
Teager, H., Some observations on oral air flow during phonation. Acoust. Speech Signal Process. IEEE Trans. 28:5 (1980), 599–601.
-
(1980)
Acoust. Speech Signal Process. IEEE Trans.
, vol.28
, Issue.5
, pp. 599-601
-
-
Teager, H.1
-
122
-
-
0003236089
-
Evidence for nonlinear sound production mechanisms in the vocal tract
-
Springer
-
Teager, H., Teager, S., Evidence for nonlinear sound production mechanisms in the vocal tract. Speech Production and Speech Modelling, 1990, Springer, 241–261.
-
(1990)
Speech Production and Speech Modelling
, pp. 241-261
-
-
Teager, H.1
Teager, S.2
-
123
-
-
85008529793
-
Estimation of glottal closing and opening instants in voiced speech using the yaga algorithm
-
Thomas, M.R., Gudnason, J., Naylor, P.A., Estimation of glottal closing and opening instants in voiced speech using the yaga algorithm. Audio Speech Lang. Process. IEEE Trans. 20:1 (2012), 82–91.
-
(2012)
Audio Speech Lang. Process. IEEE Trans.
, vol.20
, Issue.1
, pp. 82-91
-
-
Thomas, M.R.1
Gudnason, J.2
Naylor, P.A.3
-
124
-
-
85009773075
-
Clasificación de fonemas mediante paquetes de onditas orientadas perceptualmente
-
México
-
Torres, H.M., Rufiner, H.L., Clasificación de fonemas mediante paquetes de onditas orientadas perceptualmente. Anales del 1er Congreso Latinoamericano de Ingeniería Biomédica, Mazatlán 98, 1, 1998, 163–166 México.
-
(1998)
Anales del 1er Congreso Latinoamericano de Ingeniería Biomédica, Mazatlán 98
, vol.1
, pp. 163-166
-
-
Torres, H.M.1
Rufiner, H.L.2
-
125
-
-
0034443356
-
Automatic speaker identification by means of mel cepstrum, wavelets and wavelets packets
-
Paper No. TU–E201–02
-
Torres, H.M., Rufiner, H.L., Automatic speaker identification by means of mel cepstrum, wavelets and wavelets packets. Proceedings of the Chicago 2000 World Congress IEEE EMBS, 2000 Paper No. TU–E201–02.
-
(2000)
Proceedings of the Chicago 2000 World Congress IEEE EMBS
-
-
Torres, H.M.1
Rufiner, H.L.2
-
126
-
-
80051634709
-
A complete ensemble empirical mode decomposition with adaptive noise
-
IEEE
-
Torres, M.E., Colominas, M.A., Schlotthauer, G., Flandrin, P., A complete ensemble empirical mode decomposition with adaptive noise. Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, 2011, IEEE, 4144–4147.
-
(2011)
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
, pp. 4144-4147
-
-
Torres, M.E.1
Colominas, M.A.2
Schlotthauer, G.3
Flandrin, P.4
-
128
-
-
33746410556
-
Emotional speech recognition: resources, features, and methods
-
Ververidis, D., Kotropoulos, C., Emotional speech recognition: resources, features, and methods. Speech Commun. 48:9 (2006), 1162–1181, 10.1016/j.specom.2006.04.003.
-
(2006)
Speech Commun.
, vol.48
, Issue.9
, pp. 1162-1181
-
-
Ververidis, D.1
Kotropoulos, C.2
-
129
-
-
84979622932
-
Multi-objective optimisation of wavelet features for phoneme recognition
-
Vignolo, L.D., Rufiner, H.L., Milone, D.H., Multi-objective optimisation of wavelet features for phoneme recognition. IET Signal Proc. 10:6 (2016), 685–691, 10.1049/iet-spr.2015.0568.
-
(2016)
IET Signal Proc.
, vol.10
, Issue.6
, pp. 685-691
-
-
Vignolo, L.D.1
Rufiner, H.L.2
Milone, D.H.3
-
130
-
-
67650523177
-
Genetic optimization of cepstrum filterbank for phoneme classification
-
INSTICC Press Porto (Portugal)
-
Vignolo, L., Rufiner, H., Milone, D., Goddard, J., Genetic optimization of cepstrum filterbank for phoneme classification. Proceedings of the Second International Conference on Bio-inspired Systems and Signal Processing (Biosignals 2009), 2009, INSTICC Press, Porto (Portugal), 179–185.
-
(2009)
Proceedings of the Second International Conference on Bio-inspired Systems and Signal Processing (Biosignals 2009)
, pp. 179-185
-
-
Vignolo, L.1
Rufiner, H.2
Milone, D.3
Goddard, J.4
-
131
-
-
84872842796
-
Genetic wavelet packets for speech recognition
-
Vignolo, L.D., Milone, D.H., Rufiner, H.L., Genetic wavelet packets for speech recognition. Expert Syst. Appl. 40:6 (2013), 2350–2359, 10.1016/j.eswa.2012.10.050.
-
(2013)
Expert Syst. Appl.
, vol.40
, Issue.6
, pp. 2350-2359
-
-
Vignolo, L.D.1
Milone, D.H.2
Rufiner, H.L.3
-
132
-
-
84982684046
-
Feature optimisation for stress recognition in speech
-
Vignolo, L.D., Prasanna, S.M., Dandapat, S., Rufiner, H.L., Milone, D.H., Feature optimisation for stress recognition in speech. Pattern Recognit. Lett. 84 (2016), 1–7.
-
(2016)
Pattern Recognit. Lett.
, vol.84
, pp. 1-7
-
-
Vignolo, L.D.1
Prasanna, S.M.2
Dandapat, S.3
Rufiner, H.L.4
Milone, D.H.5
-
133
-
-
79954578971
-
Evolutionary cepstral coefficients
-
Vignolo, L.D., Rufiner, H.L., Milone, D.H., Goddard, J.C., Evolutionary cepstral coefficients. Appl. Soft Comput. 11:4 (2011), 3419–3428, 10.1016/j.asoc.2011.01.012.
-
(2011)
Appl. Soft Comput.
, vol.11
, Issue.4
, pp. 3419-3428
-
-
Vignolo, L.D.1
Rufiner, H.L.2
Milone, D.H.3
Goddard, J.C.4
-
134
-
-
79953280364
-
Evolutionary splines for Cepstral filterbank optimization in phoneme classification
-
Vignolo, L.D., Rufiner, H.L., Milone, D.H., Goddard, J.C., Evolutionary splines for Cepstral filterbank optimization in phoneme classification. EURASIP J. Adv. Signal Process. 2011 (2011), 8:1–8:14.
-
(2011)
EURASIP J. Adv. Signal Process.
, vol.2011
, pp. 81-8:14
-
-
Vignolo, L.D.1
Rufiner, H.L.2
Milone, D.H.3
Goddard, J.C.4
-
135
-
-
79959978998
-
Best basis-based wavelet packet entropy feature extraction and hierarchical eeg classification for epileptic detection
-
Wang, D., Miao, D., Xie, C., Best basis-based wavelet packet entropy feature extraction and hierarchical eeg classification for epileptic detection. Expert Syst. Appl. 38:11 (2011), 14314–14320, 10.1016/j.eswa.2011.05.096.
-
(2011)
Expert Syst. Appl.
, vol.38
, Issue.11
, pp. 14314-14320
-
-
Wang, D.1
Miao, D.2
Xie, C.3
-
136
-
-
80052083690
-
On intrinsic mode function
-
Wang, G., CHEN, X.-Y., Qiao, F.-L., Wu, Z., Huang, N.E., On intrinsic mode function. Adv. Adapt. Data Anal. 2:03 (2010), 277–293.
-
(2010)
Adv. Adapt. Data Anal.
, vol.2
, Issue.3
, pp. 277-293
-
-
Wang, G.1
CHEN, X.-Y.2
Qiao, F.-L.3
Wu, Z.4
Huang, N.E.5
-
137
-
-
79151483819
-
Speaker identification system using empirical mode decomposition and an artificial neural network
-
Wu, J.-D., Tsai, Y.-J., Speaker identification system using empirical mode decomposition and an artificial neural network. Expert Syst. Appl. 38:5 (2011), 6112–6117.
-
(2011)
Expert Syst. Appl.
, vol.38
, Issue.5
, pp. 6112-6117
-
-
Wu, J.-D.1
Tsai, Y.-J.2
-
138
-
-
17944381277
-
Improved MFCC-based feature for robust speaker identification
-
Wu, Z., Cao, Z., Improved MFCC-based feature for robust speaker identification. Tsinghua Sci. Technol. 10:2 (2005), 158–161.
-
(2005)
Tsinghua Sci. Technol.
, vol.10
, Issue.2
, pp. 158-161
-
-
Wu, Z.1
Cao, Z.2
-
139
-
-
2542525254
-
A study of the characteristics of white noise using the empirical mode decomposition method
-
Wu, Z., Huang, N.E., A study of the characteristics of white noise using the empirical mode decomposition method. Proc. R. Soc. London. Ser.A 460:2046 (2004), 1597–1611.
-
(2004)
Proc. R. Soc. London. Ser.A
, vol.460
, Issue.2046
, pp. 1597-1611
-
-
Wu, Z.1
Huang, N.E.2
-
140
-
-
80052078099
-
Ensemble empirical mode decomposition: a noise-assisted data analysis method
-
Wu, Z., Huang, N.E., Ensemble empirical mode decomposition: a noise-assisted data analysis method. Adv. Adapt. Data Anal. 1:01 (2009), 1–41.
-
(2009)
Adv. Adapt. Data Anal.
, vol.1
, Issue.1
, pp. 1-41
-
-
Wu, Z.1
Huang, N.E.2
-
141
-
-
28444477853
-
A novel pitch period detection algorithm based on Hilbert-Huang transform
-
Springer
-
Yang, Z., Huang, D., Yang, L., A novel pitch period detection algorithm based on Hilbert-Huang transform. Advances in Biometric Person Authentication, 2005, Springer, 586–593.
-
(2005)
Advances in Biometric Person Authentication
, pp. 586-593
-
-
Yang, Z.1
Huang, D.2
Yang, L.3
-
142
-
-
67649852867
-
Weighting of mel sub-bands based on SNR/entropy for robust ASR
-
Yeganeh, H., Ahadi, S., Mirrezaie, S., Ziaei, A., Weighting of mel sub-bands based on SNR/entropy for robust ASR. Signal Processing and Information Technology, 2008. ISSPIT 2008. IEEE International Symposium on, 2008, 292–296.
-
(2008)
Signal Processing and Information Technology, 2008. ISSPIT 2008. IEEE International Symposium on
, pp. 292-296
-
-
Yeganeh, H.1
Ahadi, S.2
Mirrezaie, S.3
Ziaei, A.4
-
143
-
-
0023670962
-
Applications of group delay functions in speech processing
-
Yegnanarayana, B., Madhu Murthy, K., Murthy, H.A., Applications of group delay functions in speech processing. J. Inst. Electron. Telecommun. Eng. 34 (1988), 20–29.
-
(1988)
J. Inst. Electron. Telecommun. Eng.
, vol.34
, pp. 20-29
-
-
Yegnanarayana, B.1
Madhu Murthy, K.2
Murthy, H.A.3
-
144
-
-
22544440896
-
Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system
-
Yegnanarayana, B., Prasanna, S., Zachariah, J.M., Gupta, C.S., Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system. SpeechAudio Process. IEEE Trans. 13:4 (2005), 575–582.
-
(2005)
SpeechAudio Process. IEEE Trans.
, vol.13
, Issue.4
, pp. 575-582
-
-
Yegnanarayana, B.1
Prasanna, S.2
Zachariah, J.M.3
Gupta, C.S.4
-
145
-
-
79956369785
-
Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method
-
Yeh, J.-R., Shieh, J.-S., Huang, N.E., Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method. Adv. Adapt. Data Anal. 2:02 (2010), 135–156.
-
(2010)
Adv. Adapt. Data Anal.
, vol.2
, Issue.2
, pp. 135-156
-
-
Yeh, J.-R.1
Shieh, J.-S.2
Huang, N.E.3
-
146
-
-
79952092015
-
Optimized discriminative transformations for speech features based on minimum classification error
-
Zamani, B., Akbari, A., Nasersharif, B., Jalalvand, A., Optimized discriminative transformations for speech features based on minimum classification error. Pattern Recognit. Lett. 32:7 (2011), 948–955, 10.1016/j.patrec.2011.01.017.
-
(2011)
Pattern Recognit. Lett.
, vol.32
, Issue.7
, pp. 948-955
-
-
Zamani, B.1
Akbari, A.2
Nasersharif, B.3
Jalalvand, A.4
-
147
-
-
0035506942
-
Comparison of different implementations of MFCC
-
Zheng, F., Zhang, G., Song, Z., Comparison of different implementations of MFCC. J. Comput. Sci. Technol. 16:6 (2001), 582–589, 10.1007/BF02943243.
-
(2001)
J. Comput. Sci. Technol.
, vol.16
, Issue.6
, pp. 582-589
-
-
Zheng, F.1
Zhang, G.2
Song, Z.3
-
148
-
-
84897134590
-
A novel speech emotion recognition method via incomplete sparse least square regression
-
1–1
-
Zheng, W., Xin, M., Wang, X., Wang, B., A novel speech emotion recognition method via incomplete sparse least square regression. Signal Process. Lett. IEEE, PP(99), 2014, 10.1109/LSP.2014.2308954 1–1.
-
(2014)
Signal Process. Lett. IEEE
, vol.PP
, Issue.99
-
-
Zheng, W.1
Xin, M.2
Wang, X.3
Wang, B.4
-
149
-
-
46449092074
-
Robust analysis and weighting on MFCC components for speech recognition and speaker identification
-
Zhou, X., Fu, Y., Liu, M., Hasegawa-Johnson, M., Huang, T., Robust analysis and weighting on MFCC components for speech recognition and speaker identification. Multimedia and Expo, 2007 IEEE International Conference on, 2007, 188–191.
-
(2007)
Multimedia and Expo, 2007 IEEE International Conference on
, pp. 188-191
-
-
Zhou, X.1
Fu, Y.2
Liu, M.3
Hasegawa-Johnson, M.4
Huang, T.5
-
150
-
-
0033690878
-
On the use of variable frame rate analysis in speech recognition
-
IEEE
-
Zhu, Q., Alwan, A., On the use of variable frame rate analysis in speech recognition. Acoustics, Speech, and Signal Processing, 2000. ICASSP’00. Proceedings. 2000 IEEE International Conference on, 3, 2000, IEEE, 1783–1786.
-
(2000)
Acoustics, Speech, and Signal Processing, 2000. ICASSP’00. Proceedings. 2000 IEEE International Conference on
, vol.3
, pp. 1783-1786
-
-
Zhu, Q.1
Alwan, A.2
|