-
4
-
-
77951493947
-
Robust entropy-based endpoint detection for speech recognition in noisy environments
-
paper 0232
-
J.L. Shen, J.W. Hung, and L.S. Lee, "Robust Entropy-Based Endpoint Detection for Speech Recognition in Noisy Environments", Proc. Int. Conf. Spoken Language Process., paper 0232, 1998.
-
(1998)
Proc. Int. Conf. Spoken Language Process.
-
-
Shen, J.L.1
Hung, J.W.2
Lee, L.S.3
-
5
-
-
0032301331
-
A voice activity detection algorithm for communication systems with dynamically varying background acoustic noises
-
I.D. Lee, H.P. Stern, S.A. Mahmoud, "A Voice Activity Detection Algorithm for Communication Systems with Dynamically Varying Background Acoustic Noises, " Proc. Veh. Technol. Conf., 1998.
-
(1998)
Proc. Veh. Technol. Conf.
-
-
Lee, I.D.1
Stern, H.P.2
Mahmoud, S.A.3
-
6
-
-
0032762471
-
A statistical model-based voice activity detection
-
J. Sohn, N.S. Kim, and W. Sung, "A Statistical Model-Based Voice Activity Detection", IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
-
(1999)
IEEE Signal Process. Lett.
, vol.6
, Issue.1
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
-
7
-
-
33846259282
-
Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
-
A. Davis, S. Nordholm, R. Togneri, "Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold", IEEE Trans, on Signal Proc., vol 14, no 2, pp. 412-424, 2006.
-
(2006)
IEEE Trans, on Signal Proc.
, vol.14
, Issue.2
, pp. 412-424
-
-
Davis, A.1
Nordholm, S.2
Togneri, R.3
-
8
-
-
0002531237
-
Design and preparation of the 1996 hub-4 broadcast news benchmark test corpora
-
J.S. Garofolo, J.G. Fiscus, W.M. Fisher, "Design and preparation of the 1996 hub-4 broadcast news benchmark test corpora, " in Proceedings of the DARPA Speech Recognition Workshop., pp. 15-21, 1997.
-
(1997)
Proceedings of the DARPA Speech Recognition Workshop
, pp. 15-21
-
-
Garofolo, J.S.1
Fiscus, J.G.2
Fisher, W.M.3
-
9
-
-
84973386174
-
Corpus description of the ESTER evaluation campaign for the rich transcription of french broadcast news
-
S. Galliano, E. Geoffrois, G. Gravier, J.F. Bonastre, D. Mostefa, K. Choukri. "Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News". In Proceedings of the 5th International Conference on Language Resources and Evaluation 2006.
-
(2006)
Proceedings of the 5th International Conference on Language Resources and Evaluation
-
-
Galliano, S.1
Geoffrois, E.2
Gravier, G.3
Bonastre, J.F.4
Mostefa, D.5
Choukri, K.6
-
10
-
-
33745196882
-
AUDIMUS.MEDIA: A broadcast news speech recognition system for the European Portuguese language
-
Portugal
-
H. Meinedo, D. Caseiro, J. Neto, I. Trancoso. "AUDIMUS.MEDIA: a broadcast news speech recognition system for the European Portuguese language". In Proceedings of PROPOR 2003, Portugal, 2003.
-
(2003)
Proceedings of PROPOR 2003
-
-
Meinedo, H.1
Caseiro, D.2
Neto, J.3
Trancoso, I.4
-
12
-
-
56149126159
-
The RWTH 2007 TC-STAR evaluation system for European english and Spanish
-
J. Loof, Ch. Gollan, S. Hahn, G. Heigold, B. Hoffmeister, Ch. Plahl, D. Rybach R. Schlüter and H. Ney. "The RWTH 2007 TC-STAR Evaluation System for European English and Spanish". Interspech 2007.
-
Interspech 2007
-
-
Loof, J.1
Gollan, Ch.2
Hahn, S.3
Heigold, G.4
Hoffmeister, B.5
Plahl, Ch.6
Rybach, D.7
Schlüter, R.8
Ney, H.9
-
13
-
-
84867198850
-
Towards automatic learning in LVCSR: Rapid development of a Persian broadcast transcription system
-
C. Gollan, H. Ney, "Towards automatic learning in LVCSR: Rapid development of a Persian broadcast transcription system, " Interspeech' 08.
-
Interspeech' 08
-
-
Gollan, C.1
Ney, H.2
-
15
-
-
0012577933
-
The Limsi SDR systemfor TREC-9
-
Gaithersburg, Md, USA
-
J.-L. Gauvain, L. Lamel, C. Barras, G. Adda, and Y. de Kercadio, "The Limsi SDR systemfor TREC-9, " in Proc. 9th Text Retrieval Conference, TREC-9, pp. 335-341, Gaithersburg, Md, USA, 2000.
-
(2000)
Proc. 9th Text Retrieval Conference, TREC-9
, pp. 335-341
-
-
Gauvain, J.-L.1
Lamel, L.2
Barras, C.3
Adda, G.4
De Kercadio, Y.5
-
16
-
-
85009062679
-
The ICSI-SRI-UW metadata extraction system
-
Korea
-
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin, and M. Harper. "The ICSI-SRI-UW Metadata Extraction System". ICSLP 2004, International Conf. on Spoken Language Processing, Korea. 2004.
-
(2004)
ICSLP 2004, International Conf. on Spoken Language Processing
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
Hillard, D.4
Ostendorf, M.5
Peskin, B.6
Harper, M.7
-
18
-
-
0346921386
-
Punctuation annotation using statistical prosody models
-
H. Christensen, Y. Gotoh, and S. Renais, "Punctuation annotation using statistical prosody models, " in Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding, pp. 35-40, 2001.
-
(2001)
Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding
, pp. 35-40
-
-
Christensen, H.1
Gotoh, Y.2
Renais, S.3
-
19
-
-
84919457977
-
The use of prosody in a combined system for punctuation generation and speech recognition
-
J. Kim, P. C. Woodland, "The use of prosody in a combined system for punctuation generation and speech recognition, " Proc. Eurospeech' 01.
-
Proc. Eurospeech' 01
-
-
Kim, J.1
Woodland, P.C.2
-
21
-
-
0034275920
-
Prosody based automatic segmentation of speech into sentences and topics
-
E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür, "Prosody based automatic segmentation of speech into sentences and topics, " Speech Communications, vol. 32, no. 1-2, pp. 127-154, 2000.
-
(2000)
Speech Communications
, vol.32
, Issue.1-2
, pp. 127-154
-
-
Shriberg, E.1
Stolcke, A.2
Hakkani-Tür, D.3
Tür, G.4
-
22
-
-
70349218123
-
Speaker diarization in meeting audio
-
Taipei, April 19-24
-
T. L. Nwe, H. Sun, H. Li, S. Rahardja, "Speaker Diarization in Meeting Audio", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, April 19-24, 2009.
-
(2009)
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009)
-
-
Nwe, T.L.1
Sun, H.2
Li, H.3
Rahardja, S.4
-
23
-
-
84956534244
-
The IBM RT07 evaluation systems for speaker diarization on lecture meetings
-
Springer
-
J. Huang, E. Marcheret, K. Visewswariah, G. Potamianos, "The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings", in Multimodal Technologies for Perception of Humans, Springer, 2008.
-
(2008)
Multimodal Technologies for Perception of Humans
-
-
Huang, J.1
Marcheret, E.2
Visewswariah, K.3
Potamianos, G.4
-
25
-
-
78650898482
-
LIUM SpkDiarization: An open source toolkit for diarization
-
Dallas
-
S. Meignier, T. Merlin. "LIUM SpkDiarization: An Open Source Toolkit For Diarization". CMU Sphinx Workshop 2010, Dallas, 2010.
-
(2010)
CMU Sphinx Workshop 2010
-
-
Meignier, S.1
Merlin, T.2
-
27
-
-
0001848274
-
Development of Spanish Corpora for Speech Research (Albayzin)
-
Italy, 199.1
-
F. Casacuberta, R. Garcia, J. Llisterri, C. Nadeu, J.M. Pardo, A. Rubio: "Development of Spanish Corpora for Speech Research (Albayzin)". Workshop on International Cooperation and Standarization of Speech Databases and Speech I/O Assesment Methods, Italy, 199.1.
-
Workshop on International Cooperation and Standarization of Speech Databases and Speech I/O Assesment Methods
-
-
Casacuberta, F.1
Garcia, R.2
Llisterri, J.3
Nadeu, C.4
Pardo, J.M.5
Rubio, A.6
-
28
-
-
76749092270
-
The WEKA data mining software: An update
-
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I. H. Witten. "The WEKA Data Mining Software: An Update"; SIGKDD Explorations, Volume 11, Issue 1. 2009.
-
(2009)
SIGKDD Explorations
, vol.11
, Issue.1
-
-
Hall, M.1
Frank, E.2
Holmes, G.3
Pfahringer, B.4
Reutemann, P.5
Witten, I.H.6
-
29
-
-
79551543112
-
Multext-prosody
-
(Ed.), CD-ROM Distributed by ELRA/ELDA
-
E. Campione, (Ed.) Multext-Prosody. A multilingual prosodie database. CD-ROM Distributed by ELRA/ELDA. 1999.
-
(1999)
A Multilingual Prosodie Database
-
-
Campione, E.1
|