-
2
-
-
33646255447
-
Further intelligibility results from human listening tests using the short-time phase spectrum
-
Alsteris L D, Paliwal K K 2006 Further intelligibility results from human listening tests using the short-time phase spectrum. Speech Commun. 48: 727-736.
-
(2006)
Speech Commun.
, vol.48
, pp. 727-736
-
-
Alsteris, L.D.1
Paliwal, K.K.2
-
5
-
-
33947159989
-
Chirp group delay analysis of speech signals
-
Bozkurt B, Couvreur L, Dutoit T 2007 Chirp group delay analysis of speech signals. Speech Commun. 49(3): 159-176.
-
(2007)
Speech Commun.
, vol.49
, Issue.3
, pp. 159-176
-
-
Bozkurt, B.1
Couvreur, L.2
Dutoit, T.3
-
6
-
-
84856254882
-
-
Proc. National Conference on Communications, Mumbai, India
-
Chevireddy S, Murthy H A, Chandrasekhar C 2008a A syllable-based segment vocoder. Proc. National Conference on Communications, Mumbai, India, 442-445.
-
(2008)
A Syllable-based Segment Vocoder
, pp. 442-445
-
-
Chevireddy, S.1
Murthy, H.A.2
Chandrasekhar, C.3
-
8
-
-
0017542202
-
The cepstrum: A guide to processing
-
Childers D G 1977 The cepstrum: A guide to processing. Proc. IEEE 68: 1428-1443.
-
(1977)
Proc. IEEE
, vol.68
, pp. 1428-1443
-
-
Childers, D.G.1
-
9
-
-
84856254879
-
-
CUED 2002 HTK Speech Recognition Toolkit
-
CUED 2002 HTK Speech Recognition Toolkit. http://htk.eng.cam.ac.uk.
-
-
-
-
10
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Davis S, Mermelstein 1980 Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech, Signal Process 28: 357-366.
-
(1980)
IEEE Trans. Acoust. Speech, Signal Process
, vol.28
, pp. 357-366
-
-
Davis, S.1
Mermelstein2
-
11
-
-
28244462378
-
-
DDNews, India, Speech and Vision Lab, IIT Madras, Chennai
-
DDNews 2001 Database for Indian languages. India, Speech and Vision Lab, IIT Madras, Chennai.
-
(2001)
Database For Indian Languages
-
-
-
12
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Dupont S, Luettin J 2000 Audio-visual speech modeling for continuous speech recognition. IEEE Trans. Multimedia 2(3) 141-151.
-
(2000)
IEEE Trans. Multimedia
, vol.2
, Issue.3
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
13
-
-
85016587886
-
-
Proc. IEEE Int. Conf. Acoust. Speech Signal Process, San Francisco, California, USA
-
Godfrey J J, Holliman E C, McDaniel J 1992 SWITCHBOARD: Telephone speech corpus for research and development. Proc. IEEE Int. Conf. Acoust. Speech Signal Process, San Francisco, California, USA, 1. 517-520.
-
(1992)
SWITCHBOARD: Telephone Speech Corpus For Research and Development
, vol.1
, pp. 517-520
-
-
Godfrey, J.J.1
Holliman, E.C.2
McDaniel, J.3
-
14
-
-
0033357399
-
Speaking in short hand - A syllable centric perspective for understanding pronounciation variation
-
Greenberg S 1999 Speaking in short hand - A syllable centric perspective for understanding pronounciation variation. Speech Commun. 29: 159-176.
-
(1999)
Speech Commun.
, vol.29
, pp. 159-176
-
-
Greenberg, S.1
-
15
-
-
0002076795
-
-
Proc. Int. Conf. Spoken Language Process, Philadelphia, USA
-
Greenberg S, Hollenback J, Ellis D 1996 Insights into spoken language gleaned from phonetic transcription of the switchboard corpus. Proc. Int. Conf. Spoken Language Process, Philadelphia, USA, 24-27.
-
(1996)
Insights Into Spoken Language Gleaned From Phonetic Transcription of the Switchboard Corpus
, pp. 24-27
-
-
Greenberg, S.1
Hollenback, J.2
Ellis, D.3
-
19
-
-
0025041264
-
Perceptually linear predictive (plp) analysis of speech
-
Hermansky H 1990 Perceptually linear predictive (plp) analysis of speech. J. of the Acoust. Soc. of Am. 87: 1738-1752.
-
(1990)
J. Of the Acoust. Soc. Of Am
, vol.87
, pp. 1738-1752
-
-
Hermansky, H.1
-
21
-
-
77952210851
-
-
Proc. National Conference on Communications, Chennai, India
-
Janakiram R, Kumar C J, Murthy H A 2010 Robust syllable segmentation its application to syllable-centric continuous speech recognition. Proc. National Conference on Communications, Chennai, India, 276-280.
-
(2010)
Robust Syllable Segmentation Its Application to Syllable-centric Continuous Speech Recognition
, pp. 276-280
-
-
Janakiram, R.1
Kumar, C.J.2
Murthy, H.A.3
-
23
-
-
1842475640
-
Automatic segmentation of continuous speech using minimum phase group delay functions
-
Kamakshi Prasad V, Nagarajan T, Murthy H A 2004 Automatic segmentation of continuous speech using minimum phase group delay functions. Speech Commun. 42: 429-446.
-
(2004)
Speech Commun.
, vol.42
, pp. 429-446
-
-
Kamakshi Prasad, V.1
Nagarajan, T.2
Murthy, H.A.3
-
26
-
-
77952154201
-
-
Proc. National Conference on Communication, Chennai, India
-
Kumar J C, Janakiraman R, Murthy H A 2010 Kl divergence based feature switching in the linguistic search space for automatic speech recognition. Proc. National Conference on Communication, Chennai, India, 281-285.
-
(2010)
Kl Divergence Based Feature Switching In the Linguistic Search Space For Automatic Speech Recognition
, pp. 281-285
-
-
Kumar, J.C.1
Janakiraman, R.2
Murthy, H.A.3
-
27
-
-
28244462055
-
-
Proc. SPCOM, Bangalore, India
-
Lakshmi Sarada G, Nagarajan T, Murthy H A 2004 Multiple frame size and multiple frame rate feature extraction for speech recognition. Proc. SPCOM, Bangalore, India, 592-595.
-
(2004)
Multiple Frame Size and Multiple Frame Rate Feature Extraction For Speech Recognition
, pp. 592-595
-
-
Lakshmi Sarada, G.1
Nagarajan, T.2
Murthy, H.A.3
-
30
-
-
0141590458
-
Training of stream weights for the decoding of speech using parallel feature streams
-
Li X, Stern R 2003 Training of stream weights for the decoding of speech using parallel feature streams. Proc. IEEE Int. Conf. Acoust. Speech Signal Process, 1: 832-835.
-
(2003)
Proc. IEEE Int. Conf. Acoust. Speech Signal Process
, vol.1
, pp. 832-835
-
-
Li, X.1
Stern, R.2
-
31
-
-
0018478297
-
Spectral root homomorphic deconvolution system
-
Lim J 1979 Spectral root homomorphic deconvolution system. IEEE Trans. Acoust. Speech Signal Process 27: 223-233.
-
(1979)
IEEE Trans. Acoust. Speech Signal Process
, vol.27
, pp. 223-233
-
-
Lim, J.1
-
34
-
-
0026204672
-
Formant extraction from minimum phase group delay function
-
Murthy H A, Yegnanarayana B 1991 Formant extraction from minimum phase group delay function. Speech Commun. 10: 209-221.
-
(1991)
Speech Commun.
, vol.10
, pp. 209-221
-
-
Murthy, H.A.1
Yegnanarayana, B.2
-
35
-
-
0024681756
-
Effectiveness of representation of signals through group delay functions
-
Murthy K V M, Yegnanarayana B 1989 Effectiveness of representation of signals through group delay functions. Elsevier Signal Process. 17: 141-150.
-
(1989)
Elsevier Signal Process.
, vol.17
, pp. 141-150
-
-
Murthy, K.V.M.1
Yegnanarayana, B.2
-
36
-
-
85009197974
-
-
Proc. EUROSPEECH, Geneva, Switzerland
-
Nagarajan T, Murthy H A, Hegde R M 2003 Segmentation of speech into syllable-like units. Proc. EUROSPEECH, Geneva, Switzerland, 2893-2896.
-
(2003)
Segmentation of Speech Into Syllable-like Units
, pp. 2893-2896
-
-
Nagarajan, T.1
Murthy, H.A.2
Hegde, R.M.3
-
37
-
-
84856276506
-
-
Proc. SPCOM, Bangalore, India
-
Nagarajan T, Prasad V K, Murthy H A 2001 The minimum phase signal derived from the magnitude spectrum and its applications to speech segmentation. Proc. SPCOM, Bangalore, India, 95-101.
-
(2001)
The Minimum Phase Signal Derived From the Magnitude Spectrum and Its Applications to Speech Segmentation
, pp. 95-101
-
-
Nagarajan, T.1
Prasad, V.K.2
Murthy, H.A.3
-
38
-
-
0035790960
-
-
Proc. IEEE Fourth Workshop on Multimedia Signal Processing, Cannes, France
-
Neti C P, Luettin G, Matthews J, Vergyri J H G 2001 Large-vocabulary audio-visual speech recognition: A summary of the johns hopkins summer 2000 workshop. Proc. IEEE Fourth Workshop on Multimedia Signal Processing, Cannes, France, 619-624.
-
(2001)
Large-vocabulary Audio-visual Speech Recognition: A Summary of the Johns Hopkins Summer 2000 Workshop
, pp. 619-624
-
-
Neti, C.P.1
Luettin, G.2
Matthews, J.3
Vergyri, J.H.G.4
-
39
-
-
84856279105
-
-
NIST 2003 The NIST year 2003 speaker recognition evaluation plan
-
NIST 2003 The NIST year 2003 speaker recognition evaluation plan. http://www.itl.nist.gov/iad/mig/tests/sre/2003/index.html.
-
-
-
-
40
-
-
0014055288
-
Cepstrum pitch determination
-
Noll AM 1967 Cepstrum pitch determination. J. Acoust. Soc. Am. 41(2): 179-195.
-
(1967)
J. Acoust. Soc. Am
, vol.41
, Issue.2
, pp. 179-195
-
-
Noll, A.M.1
-
41
-
-
84856254875
-
The OGI multi-language telephone speech corpus
-
OGI, Proc. Int. Conf. Spoken Lang., Banff, Alberta
-
OGI 1992 The OGI multi-language telephone speech corpus. Proc. Int. Conf. Spoken Lang., Banff, Alberta.
-
(1992)
-
-
-
44
-
-
70450194107
-
-
Proc. INTERSPEECH, Brighton, U. K
-
Padmanabhan R, Parthasarthi S H K, Murthy H A 2009 Robustness of phase based features for speaker recognition. Proc. INTERSPEECH, Brighton, U. K., 2355-2358.
-
(2009)
Robustness of Phase Based Features For Speaker Recognition
, pp. 2355-2358
-
-
Padmanabhan, R.1
Parthasarthi, S.H.K.2
Murthy, H.A.3
-
45
-
-
13544259544
-
On the usefulness of stft phase spectrum in human listening tests
-
Paliwal K K, Alsteris L D 2005 On the usefulness of stft phase spectrum in human listening tests. Speech Commun. 45 153-170.
-
(2005)
Speech Commun.
, vol.45
, pp. 153-170
-
-
Paliwal, K.K.1
Alsteris, L.D.2
-
47
-
-
0030363953
-
-
Proc. Int. Conf. Spoken Language Process., Philadelphia, USA
-
Pfitzinger H R, Burger S, Heid S 1996 Syllable detection in read and spontaneous speech. Proc. Int. Conf. Spoken Language Process., Philadelphia, USA, 1261-1264.
-
(1996)
Syllable Detection In Read and Spontaneous Speech
, pp. 1261-1264
-
-
Pfitzinger, H.R.1
Burger, S.2
Heid, S.3
-
48
-
-
77952228240
-
-
Proc. National Conference on Communication, Chennai, India
-
Pradhan A, Chevireddy S, Veezhinathan K, Murthy H A 2010 A low-bit rate segment vocoder using minimum residual energy criteria. Proc. National Conference on Communication, Chennai, India, 246-250.
-
(2010)
A Low-bit Rate Segment Vocoder Using Minimum Residual Energy Criteria
, pp. 246-250
-
-
Pradhan, A.1
Chevireddy, S.2
Veezhinathan, K.3
Murthy, H.A.4
-
49
-
-
65249112285
-
Vowel onset point detection using source, spectral peaks and modulation spectrum energies
-
Prasanna S, Reddy S B, Krishnamoorthy P 2009 Vowel onset point detection using source, spectral peaks and modulation spectrum energies. IEEE Trans. Audio Speech Language Process. 17(4): 556-565.
-
(2009)
IEEE Trans. Audio Speech Language Process.
, vol.17
, Issue.4
, pp. 556-565
-
-
Prasanna, S.1
Reddy, S.B.2
Krishnamoorthy, P.3
-
50
-
-
3943055955
-
The chirp z-transform algorithm and its application
-
Rabiner L R, Schafer R W 1969 The chirp z-transform algorithm and its application. Bell Syst. Tech. J. 48(5): 1249-1292.
-
(1969)
Bell Syst. Tech. J.
, vol.48
, Issue.5
, pp. 1249-1292
-
-
Rabiner, L.R.1
Schafer, R.W.2
-
51
-
-
85009193716
-
-
Proc. EUROSPEECH, Geneva, Switzerland
-
Ramasubramanian V, Jayaram A K V S, Sreenivas T V 2003 Language identification using parallel sub-word recognition - an ergodic hmm equivalence. Proc. EUROSPEECH, Geneva, Switzerland, 1357-1360.
-
(2003)
Language Identification Using Parallel Sub-word Recognition - An Ergodic Hmm Equivalence
, pp. 1357-1360
-
-
Ramasubramanian, V.1
Jayaram, A.K.V.S.2
Sreenivas, T.V.3
-
52
-
-
84856254389
-
-
Proc. National Conference on Communications, Kharagpur, India
-
Rao M N, Thomas S, Nagarajan T, Murthy H A 2005 Text-to-speech synthesis using syllable-like units. Proc. National Conference on Communications, Kharagpur, India, 227-280.
-
(2005)
Text-to-speech Synthesis Using Syllable-like Units
, pp. 227-280
-
-
Rao, M.N.1
Thomas, S.2
Nagarajan, T.3
Murthy, H.A.4
-
53
-
-
84856254874
-
-
Proc. EUSIPCO 2008, Lausanne, Switzerland
-
Rasipuram R, Hegde R M, Murthy H A 2008 Incorporating acoustic diversity into the linguistic feature space for syllable recognition. Proc. EUSIPCO 2008, Lausanne, Switzerland, www.eurasip.org/Proceedings/Eusipco/papers/1569104561.pdf.
-
(2008)
Incorporating Acoustic Diversity Into the Linguistic Feature Space For Syllable Recognition
-
-
Rasipuram, R.1
Hegde, R.M.2
Murthy, H.A.3
-
56
-
-
84856272996
-
Acoustic-phonetic continuous speech corpus
-
TIMIT, National Institute of Standards and Technology Speech Disc 1-1. 1. Fisher W, Doddington G, Goudie Marshall K M, Proc. DARPA Workshop on Speech Recognition, California
-
TIMIT 1990 Acoustic-phonetic continuous speech corpus. National Institute of Standards and Technology Speech Disc 1-1. 1. Fisher W, Doddington G, Goudie Marshall K M 1986 The DARPA speech recognition research database: Specifications and status. Proc. DARPA Workshop on Speech Recognition, California, 93-99.
-
(1990)
The DARPA Speech Recognition Research Database: Specifications and Status
, pp. 93-99
-
-
-
58
-
-
0017969757
-
Formant extraction from linear prediction phase spectra
-
Yegnanarayana B 1979 Formant extraction from linear-prediction phase spectra. J. Acoust. Soc. Am. 63: 1638-1640.
-
(1979)
J. Acoust. Soc. Am.
, vol.63
, pp. 1638-1640
-
-
Yegnanarayana, B.1
-
59
-
-
0026923568
-
Significance of group delay functions in spectrum estimation
-
Yegnanarayana B, Murthy H A 1992 Significance of group delay functions in spectrum estimation. IEEE Trans. Signal Process. 40(9): 2281-2289.
-
(1992)
IEEE Trans. Signal Process.
, vol.40
, Issue.9
, pp. 2281-2289
-
-
Yegnanarayana, B.1
Murthy, H.A.2
-
62
-
-
0029733178
-
Comparison of four approaches to automatic language identification of telephone speech
-
Zissman M A 1996 Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. Speech Audio Process 4(1): 31-44.
-
(1996)
IEEE Trans. Speech Audio Process
, vol.4
, Issue.1
, pp. 31-44
-
-
Zissman, M.A.1
|