-
1
-
-
0342321463
-
The THISL broadcast new retrieval system
-
Abberley, D., Kirby, D., Renals, S., & Robinson, T. (1999). The THISL broadcast new retrieval system. In Proceedings of the ESCA ETRW Workshop Accessing Information in Spoken Audio (pp.14-19).
-
(1999)
Proceedings of the ESCA ETRW Workshop Accessing Information in Spoken Audio
, pp. 14-19
-
-
Abberley, D.1
Kirby, D.2
Renals, S.3
Robinson, T.4
-
3
-
-
0031177213
-
Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
-
Ahadi, S. M., & Woodland, P. C. (1997). Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 11, 187-206.
-
(1997)
Computer Speech and Language
, vol.11
, pp. 187-206
-
-
Ahadi, S.M.1
Woodland, P.C.2
-
5
-
-
44949259254
-
A robust fusion method for multilingual spoken document retrieval systems employing tiered resources
-
Pittsburgh
-
Akbacak, M., & Hansen, J. H. L. (2006). A robust fusion method for multilingual spoken document retrieval systems employing tiered resources. In Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006, Pittsburgh (pp. 1177-1180).
-
(2006)
Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006
, pp. 1177-1180
-
-
Akbacak, M.1
Hansen, J.H.L.2
-
6
-
-
50449102573
-
Environmental sniffing: Noise knowledge estimation for robust speech systems
-
Akbacak, M., & Hansen, J. H. L. (2007). Environmental sniffing: Noise knowledge estimation for robust speech systems. IEEE Transactions on Audio, Speech and Language Processing, 15(2), 465-477.
-
(2007)
IEEE Transactions on Audio, Speech and Language Processing
, vol.15
, Issue.2
, pp. 465-477
-
-
Akbacak, M.1
Hansen, J.H.L.2
-
7
-
-
33947113758
-
Advances in phone-based modeling for automatic accent classification
-
Angkititrakul, P., & Hansen, J. H. L. (2006). Advances in phone-based modeling for automatic accent classification. IEEE Trans. Audio, Speech & Language Proc., 14(2), 634-646.
-
(2006)
IEEE Trans. Audio, Speech & Language Proc
, vol.14
, Issue.2
, pp. 634-646
-
-
Angkititrakul, P.1
Hansen, J.H.L.2
-
8
-
-
50249182472
-
Discriminative in-set/out-of-set speaker recognition
-
Angkititrakul, P., & Hansen, J. H. L. (2007). Discriminative in-set/out-of-set speaker recognition. IEEE Transactions on Audio, Speech and Language Processing, 15(2), 498-508.
-
(2007)
IEEE Transactions on Audio, Speech and Language Processing
, vol.15
, Issue.2
, pp. 498-508
-
-
Angkititrakul, P.1
Hansen, J.H.L.2
-
9
-
-
0030757418
-
A study of temporal features and frequency characteristics in American English foreign accent
-
Arslan, L. M., & Hansen, J. H. L. (1997). A study of temporal features and frequency characteristics in American English foreign accent. The Journal of the Acoustical Society of America, 102(1), 28-40.
-
(1997)
The Journal of the Acoustical Society of America
, vol.102
, Issue.1
, pp. 28-40
-
-
Arslan, L.M.1
Hansen, J.H.L.2
-
10
-
-
0034229795
-
A comparative study of traditional and newly proposed features for recognition of speech under stress
-
Bou-Ghazale, S. E., & Hansen, J. H. L. (2000). A comparative study of traditional and newly proposed features for recognition of speech under stress. IEEE Transactions on Speech & Audio Processing, 8(4), 429-442.
-
(2000)
IEEE Transactions on Speech & Audio Processing
, vol.8
, Issue.4
, pp. 429-442
-
-
Bou-Ghazale, S.E.1
Hansen, J.H.L.2
-
12
-
-
85135272864
-
Maximum a posterior linear regression for hidden Markov model adaptation
-
Budapest
-
Chesta, C., Siohan, O., & Lee, C. H. (1999). Maximum a posterior linear regression for hidden Markov model adaptation. In Proceedings of Eurospeech-99, Budapest (pp. 203-206).
-
(1999)
Proceedings of Eurospeech-99
, pp. 203-206
-
-
Chesta, C.1
Siohan, O.2
Lee, C.H.3
-
13
-
-
84874875877
-
Maximum a posterior linear regression with elliptically symmetric matrix priors
-
Chou, W. (1999). Maximum a posterior linear regression with elliptically symmetric matrix priors. In Proceedings of Eurospeech (pp. 1-4).
-
(1999)
Proceedings of Eurospeech
, pp. 1-4
-
-
Chou, W.1
-
15
-
-
85009150731
-
Building a test collection for speech-driven web retrieval
-
Geneva
-
Fujii, A., & Itou, K. (2003). Building a test collection for speech-driven Web retrieval. In Proceedings of Eurospeech-2003, Geneva (pp. 1153-1156).
-
(2003)
Proceedings of Eurospeech-2003
, pp. 1153-1156
-
-
Fujii, A.1
Itou, K.2
-
16
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Gauvain, J.-L., & Lee, C.-H. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. on Speech and Audio Proc., 2, 291-298.
-
(1994)
IEEE Trans. On Speech and Audio Proc
, vol.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
17
-
-
0030283741
-
Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
-
Hansen, J. H. L. (1996). Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition. Speech Communications, Special Issue on Speech Under Stress, 20(2), 151-170.
-
(1996)
Speech Communications, Special Issue on Speech Under Stress
, vol.20
, Issue.2
, pp. 151-170
-
-
Hansen, J.H.L.1
-
18
-
-
85008020310
-
SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word
-
Hansen, J. H. L., Huang, R., Zhou, B., Seadle, M., Deller, J. R., Jr., Gurijala, A. R., et al. (2005). SpeechFind: Advances in spoken document retrieval for a national gallery of the spoken word. IEEE Trans. on Speech and Audio Proc., 13(5), 712-730.
-
(2005)
IEEE Trans. On Speech and Audio Proc
, vol.13
, Issue.5
, pp. 712-730
-
-
Hansen, J.H.L.1
Huang, R.2
Zhou, B.3
Seadle, M.4
Deller Jr., J.R.5
Gurijala, A.R.6
-
19
-
-
85009083936
-
Audio stream phrase recognition for a national gallery of the spoken word: 'One small step'
-
Beijing
-
Hansen, J. H. L., Zhou, B., Akbacak, M., Sarikaya, R., & Pellom, B. (2000). Audio stream phrase recognition for a national gallery of the spoken word: 'One small step'. In Proceedings of the ICSLP-2000: Inter. Conf. Spoken Lang. Proc., Beijing (Vol. 3, pp. 1089-1092).
-
(2000)
Proceedings of the ICSLP-2000: Inter. Conf. Spoken Lang. Proc
, vol.3
, pp. 1089-1092
-
-
Hansen, J.H.L.1
Zhou, B.2
Akbacak, M.3
Sarikaya, R.4
Pellom, B.5
-
20
-
-
0033705979
-
Automatic speech summarization based on word significance and linguistic likelihood
-
Hori, C., & Furui, S. (2000). Automatic speech summarization based on word significance and linguistic likelihood. In Proceedings of the IEEE ICASSP-00: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 3, pp. 1579-1582).
-
(2000)
Proceedings of the IEEE ICASSP-00: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.3
, pp. 1579-1582
-
-
Hori, C.1
Furui, S.2
-
21
-
-
34047274787
-
Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora
-
Huang, R., & Hansen, J. H. L. (2006). Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora. IEEE Trans. Audio, Speech and Language Processing, 14(3), 907-919.
-
(2006)
IEEE Trans. Audio, Speech and Language Processing
, vol.14
, Issue.3
, pp. 907-919
-
-
Huang, R.1
Hansen, J.H.L.2
-
22
-
-
64149085238
-
Dialect/accent classification using unrestricted audio
-
Huang, R., & Hansen, J. H. L. (2007). Dialect/accent classification using unrestricted audio. IEEE Trans. on Audio Speech and Language Processing, 15(2), 453-464.
-
(2007)
IEEE Trans. On Audio Speech and Language Processing
, vol.15
, Issue.2
, pp. 453-464
-
-
Huang, R.1
Hansen, J.H.L.2
-
23
-
-
84898168713
-
-
Paper presented at the EUSIPCO-2004, 12th European Signal Processing Conference, Vienna, Austria, Paper
-
Kurimo, M., Zhou, B., Huang, R., & Hansen, J. H. L. (2004). Language modeling structures in audio transcription for retrieval of historical speeches. Paper presented at the EUSIPCO-2004, 12th European Signal Processing Conference, Vienna, Austria (Paper 1530).
-
(2004)
Language Modeling Structures in Audio Transcription for Retrieval of Historical Speeches
, pp. 1530
-
-
Kurimo, M.1
Zhou, B.2
Huang, R.3
Hansen, J.H.L.4
-
24
-
-
0141496213
-
Unsupervised language model adaptation for broadcast news
-
Langzhou, C., Gauvain, J.-L., Lamel, L., & Adda, G. (2003). Unsupervised language model adaptation for broadcast news. In Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 220-223).
-
(2003)
Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.1
, pp. 220-223
-
-
Langzhou, C.1
Gauvain, J.-L.2
Lamel, L.3
Adda, G.4
-
25
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
Leggetter, C., & Woodland, P. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Computer Speech and Language, 9, 171-185.
-
(1995)
Computer Speech and Language
, vol.9
, pp. 171-185
-
-
Leggetter, C.1
Woodland, P.2
-
27
-
-
0036816475
-
Content analysis for audio classification and segmenta tion
-
Lu, L., Zhang, H., & Jiang, H. (2002). Content analysis for audio classification and segmenta tion. IEEE Trans. Speech & Audio Proc., 10(7), 504-516.
-
(2002)
IEEE Trans. Speech & Audio Proc
, vol.10
, Issue.7
, pp. 504-516
-
-
Lu, L.1
Zhang, H.2
Jiang, H.3
-
28
-
-
79951784751
-
Automatic summarization of broadcast news using structural features
-
Geneva
-
Maskey, S. R., & Hirschberg, J. (2003). Automatic summarization of broadcast news using structural features. In Proceedings of Eurospeech-2003, Geneva (pp. 1173-1176).
-
(2003)
Proceedings of Eurospeech-2003
, pp. 1173-1176
-
-
Maskey, S.R.1
Hirschberg, J.2
-
29
-
-
0141702097
-
Towards domain independent speaker clustering
-
Moh, Y., Nguyen, P., & Junqua, J.-C. (2003). Towards domain independent speaker clustering. In Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 2, pp. 85-88).
-
(2003)
Proceedings of the IEEE ICASSP-03: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.2
, pp. 85-88
-
-
Moh, Y.1
Nguyen, P.2
Junqua, J.-C.3
-
30
-
-
0034857759
-
Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
-
Mori, K., & Nakagawa, S. (2001). Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition. In Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 413-416).
-
(2001)
Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.1
, pp. 413-416
-
-
Mori, K.1
Nakagawa, S.2
-
31
-
-
0035441593
-
Spoken language recognition-a step toward multilinguality in speech processing
-
Navratil, J. (2001). Spoken language recognition-a step toward multilinguality in speech processing. IEEE Transactions on Speech & Audio Processing, 9, 678-685.
-
(2001)
IEEE Transactions on Speech & Audio Processing
, vol.9
, pp. 678-685
-
-
Navratil, J.1
-
32
-
-
0032665630
-
Experiments in topic indexing of broadcast news using neural networks
-
Neukirchen, C., Willett, D., & Rigoll, G. (1999). Experiments in topic indexing of broadcast news using neural networks. In Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 2, pp. 1093-1096).
-
(1999)
Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.2
, pp. 1093-1096
-
-
Neukirchen, C.1
Willett, D.2
Rigoll, G.3
-
36
-
-
0033688848
-
High resolution speech feature parameterization for monophone based stressed speech recognition
-
Sarikaya, R., & Hansen, J. H. L. (2000). High resolution speech feature parameterization for monophone based stressed speech recognition. IEEE Signal Processing Letters, 7(7), 182-185.
-
(2000)
IEEE Signal Processing Letters
, vol.7
, Issue.7
, pp. 182-185
-
-
Sarikaya, R.1
Hansen, J.H.L.2
-
37
-
-
85009271609
-
Towards automatic closed captioning: Low latency real time broadcast news transcription
-
Denver
-
Saraclar, M., Riley, M., Bocchieri, E., & Goffin, V. (2002). Towards automatic closed captioning: Low latency real time broadcast news transcription. In Proceedings of the ICSLP-2002: Inter. Conf. Spoken Lang., Denver (pp. 1741-1744).
-
(2002)
Proceedings of the ICSLP-2002: Inter. Conf. Spoken Lang
, pp. 1741-1744
-
-
Saraclar, M.1
Riley, M.2
Bocchieri, E.3
Goffin, V.4
-
38
-
-
85050187568
-
Lattice-based search for spoken utterance retrieval
-
Boston
-
Saraclar, M., & Sproat, R. (2004). Lattice-based search for spoken utterance retrieval. In Proceedings of the HLT-NAACL 2004, Boston (pp. 129-136).
-
(2004)
Proceedings of the HLT-NAACL 2004
, pp. 129-136
-
-
Saraclar, M.1
Sproat, R.2
-
41
-
-
0036461005
-
Structural maximum a posteriori linear regression for fast HMM adaptation
-
Siohan, O., Myrvoll, T. A., & Lee, C. H. (2002). Structural maximum a posteriori linear regression for fast HMM adaptation. Computer Speech and Language, 16(1), 5-24.
-
(2002)
Computer Speech and Language
, vol.16
, Issue.1
, pp. 5-24
-
-
Siohan, O.1
Myrvoll, T.A.2
Lee, C.H.3
-
42
-
-
44949221428
-
Analysis of lombard effect under different types and levels of background noise with application to in-set speaker ID systems
-
Pittsburgh
-
Varadarajan, V. S., & Hansen, J. H. L. (2006). Analysis of Lombard effect under different types and levels of background noise with application to in-set speaker ID systems. In Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006, Pittsburgh (pp. 937-940).
-
(2006)
Proceedings of the ISCA INTERSPEECH-2006/ICSLP-2006
, pp. 937-940
-
-
Varadarajan, V.S.1
Hansen, J.H.L.2
-
43
-
-
0032678104
-
Probabilistic models for topic detection and tracking
-
Walls, F., Jin, H., Sista, S., & Schwartz, R. (1999). Probabilistic models for topic detection and tracking. In Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 521-524).
-
(1999)
Proceedings of the IEEE ICASSP-99: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.1
, pp. 521-524
-
-
Walls, F.1
Jin, H.2
Sista, S.3
Schwartz, R.4
-
44
-
-
0034852839
-
Multi-scale-audio indexing for translingual spoken document retrieval
-
Wang, H.-M., Meng, H., Schone, P., Chen, B., & Lo, W.-K. (2001). Multi-scale-audio indexing for translingual spoken document retrieval. In Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc. (Vol. 1, pp. 605-608).
-
(2001)
Proceedings of the IEEE ICASSP-01: Inter. Conf. Acoust. Speech, Sig. Proc
, vol.1
, pp. 605-608
-
-
Wang, H.-M.1
Meng, H.2
Schone, P.3
Chen, B.4
Lo, W.-K.5
-
47
-
-
85009164449
-
A new perspective on feature extraction for robust invehicle speech recognition
-
Geneva
-
Yapanel, U., & Hansen, J. H. L. (2003). A new perspective on feature extraction for robust invehicle speech recognition. In Proceedings of Eurospeech-03, Geneva (pp. 1281-1284).
-
(2003)
Proceedings of Eurospeech-03
, pp. 1281-1284
-
-
Yapanel, U.1
Hansen, J.H.L.2
-
49
-
-
22544475615
-
Efficient audio stream segmentation via the T2 statistic based Bayesian information criterion
-
Zhou, B., & Hansen, J. H. L. (2005a). Efficient audio stream segmentation via the T2 statistic based Bayesian information criterion. IEEE Trans. Speech & Audio Proc., 13(4), 467-474.
-
(2005)
IEEE Trans. Speech & Audio Proc
, vol.13
, Issue.4
, pp. 467-474
-
-
Zhou, B.1
Hansen, J.H.L.2
-
50
-
-
22544443963
-
Rapid discriminative acoustic modeling based on eigenspace mapping for fast speaker adaptation
-
Zhou, B., & Hansen, J. H. L. (2005b). Rapid discriminative acoustic modeling based on Eigenspace mapping for fast speaker adaptation. IEEE Trans. Speech & Audio Proc., 13(4), 554-564.
-
(2005)
IEEE Trans. Speech & Audio Proc
, vol.13
, Issue.4
, pp. 554-564
-
-
Zhou, B.1
Hansen, J.H.L.2
|