-
1
-
-
85008533443
-
RT-2002 Evaluation Plan (Version 1. 0)
-
[Online]. Available: http://www.itl.nist.gov/iad/mig//tests/rt/2002/index.html
-
J. Fiscus and J. Garofolo, “RT-2002 Evaluation Plan (Version 1. 0),” NIST, 2002 [Online]. Available: http://www.itl.nist.gov/iad/mig//tests/rt/2002/index.html.
-
(2002)
NIST
-
-
Fiscus, J.1
Garofolo, J.2
-
2
-
-
0001979427
-
Meeting browser: Tracking and summarizing meetings
-
A. Waibel, M. Bett, M. Finke, and R. Stiefelhagen, “Meeting browser: Tracking and summarizing meetings,” in Proc. DARPA Broadcast News Transcript. Understand. Workshop, Lansdowne, VA, 1998, pp. 281–286.
-
(1998)
Proc. DARPA Broadcast News Transcript. Understand. Workshop, Lansdowne, VA
, pp. 281-286
-
-
Waibel, A.1
Bett, M.2
Finke, M.3
Stiefelhagen, R.4
-
3
-
-
44849090969
-
Recognition and understanding of meetings: The AMI and AMIDA projects
-
S. Renals, T. Hain, and H. Bourlard, “Recognition and understanding of meetings: The AMI and AMIDA projects,” in Proc. ASRU′07, 2007, pp. 238–247.
-
(2007)
Proc. ASRU′07
, pp. 238-247
-
-
Renals, S.1
Hain, T.2
Bourlard, H.3
-
4
-
-
78650672858
-
Accessing a large multimodal corpus using an automatic content linking device
-
New York: Springer
-
A. Popescu-Belis, J. Carletta, J. Kilgour, and P. Poller, “Accessing a large multimodal corpus using an automatic content linking device,” in Multimodal Corpora: From Models of Natural Interaction to Systems and Applications. New York: Springer, 2009, vol. 5509, pp. 189–206.
-
(2009)
Multimodal Corpora: From Models of Natural Interaction to Systems and Applications
, vol.5509
, pp. 189-206
-
-
Popescu-Belis, A.1
Carletta, J.2
Kilgour, J.3
Poller, P.4
-
5
-
-
33745536025
-
The 2005 AMI system for the transcription of speech in meetings
-
ser. Lecture Notes in Computer Science. Edinburgh, U. K.: Springer-Verlag
-
T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan, D. Moore, V. Wan, R. Ordelman, and S. Renals, “The 2005 AMI system for the transcription of speech in meetings,” in Machine Learning for Multimodal Interaction, ser. Lecture Notes in Computer Science. Edinburgh, U. K.: Springer-Verlag, 2005, vol. 3869, pp. 450–462.
-
(2005)
Machine Learning for Multimodal Interaction
, vol.3869
, pp. 450-462
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garau, G.4
Karafiat, M.5
Lincoln, M.6
McCowan, I.7
Moore, D.8
Wan, V.9
Ordelman, R.10
Renals, S.11
-
6
-
-
79959838190
-
The2007AMI (DA) system for meetingtran-scription
-
ser. Lec-ture Notes in Computer Science. New York: Springer-Verlag
-
T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, D. van Leeuwen, and V. Wan, “The2007AMI (DA) system for meetingtran-scription,” in Machine Learning for Multimodal Interaction, ser. Lec-ture Notes in Computer Science. New York: Springer-Verlag, 2007.
-
(2007)
Machine Learning for Multimodal Interaction
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garau, G.4
Karafiat, M.5
Lincoln, M.6
van Leeuwen, D.7
Wan, V.8
-
7
-
-
85009275134
-
The ISL meeting corpus: The impact of meeting type on speech style
-
S. Burger, V. MacLaren, and H. Yu, “The ISL meeting corpus: The impact of meeting type on speech style,” in Proc. ICSLP, 2002.
-
(2002)
Proc. ICSLP
-
-
Burger, S.1
MacLaren, V.2
Yu, H.3
-
8
-
-
33846265193
-
The AMI meeting corpus
-
J. Carletta, S. Ashby, S. Bourban, M. Guillemot, M. Kronenthal, G. Lathoud, M. Lincoln, I. McCowan, T. Hain, W. Kraaij, W. Post, J. Kadlec, P. Wellner, M. Flynn, and D. Reidsma, “The AMI meeting corpus,” in Proc. MLMI′05, Edinburgh, U. K., 2005.
-
(2005)
Proc. MLMI′05, Edinburgh, U. K.
-
-
Carletta, J.1
Ashby, S.2
Bourban, S.3
Guillemot, M.4
Kronenthal, M.5
Lathoud, G.6
Lincoln, M.7
McCowan, I.8
Hain, T.9
Kraaij, W.10
Post, W.11
Kadlec, J.12
Wellner, P.13
Flynn, M.14
Reidsma, D.15
-
9
-
-
84924134587
-
The NIST meeting room pilot corpus
-
J. Garofolo, C. Laprun, M. Miche, V. Stanford, and E. Tabassi, “The NIST meeting room pilot corpus,” in Proc. LREC′04, 2004.
-
(2004)
Proc. LREC′04
-
-
Garofolo, J.1
Laprun, C.2
Miche, M.3
Stanford, V.4
Tabassi, E.5
-
10
-
-
0141814662
-
The ICSI meeting corpus
-
A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters, “The ICSI meeting corpus,” in Proc. IEEE ICASSP, 2003, pp. 364–367.
-
(2003)
Proc. IEEE ICASSP
, pp. 364-367
-
-
Janin, A.1
Baron, D.2
Edwards, J.3
Ellis, D.4
Gelbart, D.5
Morgan, N.6
Peskin, B.7
Pfau, T.8
Shriberg, E.9
Stolcke, A.10
Wooters, C.11
-
11
-
-
33947697215
-
Strategies for language model web-data collection
-
V. Wan and T. Hain, “Strategies for language model web-data collection,” in Proc. ICASSP′06, 2006, pp. 1069–1072.
-
(2006)
Proc. ICASSP′06
, pp. 1069-1072
-
-
Wan, V.1
Hain, T.2
-
12
-
-
56149083259
-
The AMI meeting transcription system: Progress and performance
-
ser. Lecture Notes in Computer Science. New York: Springer
-
T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, J. Vepa, and V. Wan, “The AMI meeting transcription system: Progress and performance,” in Machine Learning for Multimodal Interaction, ser. Lecture Notes in Computer Science. New York: Springer, 2006, pp. 419–431.
-
(2006)
Machine Learning for Multimodal Interaction
, pp. 419-431
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garau, G.4
Karafiat, M.5
Lincoln, M.6
Vepa, J.7
Wan, V.8
-
13
-
-
47749119617
-
The ICSI RT07s speaker diariza-tion system
-
ser. LNCS. New York: Springer-Verlag
-
C. Wooters and M. Huijbregts, “The ICSI RT07s speaker diariza-tion system,” in Machine Learning for Multimodal Interaction, ser. LNCS. New York: Springer-Verlag, 2007, vol. 4625, pp. 509–519.
-
(2007)
Machine Learning for Multimodal Interaction
, vol.4625
, pp. 509-519
-
-
Wooters, C.1
Huijbregts, M.2
-
14
-
-
33745217149
-
Transcription of conference room meetings: An investigation
-
T. Hain, G. G. J. Dines, M. Karafiat, D. Moore, V. Wan, R. Ordelman, and S. Renals, “Transcription of conference room meetings: An investigation,” in Proc. Interspeech′05, 2005, pp. 1661–1664.
-
(2005)
Proc. Interspeech′05
, pp. 1661-1664
-
-
Hain, T.1
Dines, G.G.J.2
Karafiat, M.3
Moore, D.4
Wan, V.5
Ordelman, R.6
Renals, S.7
-
15
-
-
33745515429
-
Further progress in meeting recognition: The ICSI-SRI spring 2005 speech-to-text evaluation system
-
ser. LNCS. New York: Springer Verlag
-
A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng, “Further progress in meeting recognition: The ICSI-SRI spring 2005 speech-to-text evaluation system,” in Machine Learning for Multimodal Interaction, ser. LNCS. New York: Springer Verlag, 2005, pp. 463–475.
-
(2005)
Machine Learning for Multimodal Interaction
, pp. 463-475
-
-
Stolcke, A.1
Anguera, X.2
Boakye, K.3
Cetin, O.4
Grezl, F.5
Janin, A.6
Mandal, A.7
Peskin, B.8
Wooters, C.9
Zheng, J.10
-
16
-
-
44949090835
-
Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
-
I. Bulyko, M. Ostendorf, and A. Stolcke, “Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures,” in Proc. Human Lang. Technol. Conf., 2003.
-
(2003)
Proc. Human Lang. Technol. Conf.
-
-
Bulyko, I.1
Ostendorf, M.2
Stolcke, A.3
-
18
-
-
34547548247
-
The AMI system for the transcription of speech in meetings
-
T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, V. Wan, and J. Vepa, “The AMI system for the transcription of speech in meetings,” in Proc. ICASSP′07, 2007, vol. 1, pp. 357–360.
-
(2007)
Proc. ICASSP′07
, vol.1
, pp. 357-360
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garau, G.4
Karafiat, M.5
Lincoln, M.6
Wan, V.7
Vepa, J.8
-
19
-
-
40249083942
-
The segmentation of multi-channel meeting recordings for automatic speech recognition
-
J. Dines, J. Vepa, and T. Hain, “The segmentation of multi-channel meeting recordings for automatic speech recognition,” Proc. Inter-speech′06, 2006.
-
(2006)
Proc. Inter-speech′06
-
-
Dines, J.1
Vepa, J.2
Hain, T.3
-
20
-
-
44849123928
-
Robust speaker diarization for meetings
-
Ph. D. dissertation, UPC, Barcelona, Spain
-
X. Anguera, “Robust speaker diarization for meetings,” Ph. D. dissertation, UPC, Barcelona, Spain, 2006.
-
(2006)
-
-
Anguera, X.1
-
21
-
-
77249176190
-
The AMI speaker diarization system for IST RT06s meeting data
-
D. A. van Leeuwen and M. Huijbregts, “The AMI speaker diarization system for IST RT06s meeting data,” in Proc. MLMI′06, 2006, pp. 371–384.
-
(2006)
Proc. MLMI′06
, pp. 371-384
-
-
van Leeuwen, D.A.1
Huijbregts, M.2
-
22
-
-
4544265717
-
Discriminative Training for Large Vocabulary Speech, Recognition
-
Ph. D. dissertation, Cambridge Univ., Cambridge, U. K.
-
D. Povey, “Discriminative Training for Large Vocabulary Speech, Recognition,” Ph. D. dissertation, Cambridge Univ., Cambridge, U. K., 2004.
-
(2004)
-
-
Povey, D.1
-
23
-
-
84959118000
-
The Fisher corpus: A resource for the next generations of speech-to-text
-
C. Cieri, D. Miller, and K. Walker, “The Fisher corpus: A resource for the next generations of speech-to-text,” in Proc. LREC′04: 4th Inte. Conf. Lang. Resources Eval., Lisbon, Portugal, 2004.
-
(2004)
Proc. LREC′04: 4th Inte. Conf. Lang. Resources Eval., Lisbon, Portugal
-
-
Cieri, C.1
Miller, D.2
Walker, K.3
-
24
-
-
47749135184
-
Application of CMLLR in narrow band wide band adapted systems
-
M. Karafiat, L. Burget, T. Hain, and J. Cernocky, “Application of CMLLR in narrow band wide band adapted systems,” in Proc 8th Int. Conf. Interspeech′07, Antwerp, Belgium, 2007, p. 4.
-
(2007)
Proc 8th Int. Conf. Interspeech′07, Antwerp, Belgium
, pp. 4
-
-
Karafiat, M.1
Burget, L.2
Hain, T.3
Cernocky, J.4
-
25
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Apr.
-
J. -L. Gauvain and C. -H. Lee, “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process.
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
26
-
-
0003871508
-
Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition
-
Ph. D. dissertation, John Hopkins Univ., Baltimore, MD
-
N. Kumar, “Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,” Ph. D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
-
(1997)
-
-
Kumar, N.1
-
27
-
-
70450211380
-
Investigation into bottle-neck features for meeting speech recognition
-
F. Grezl, M. Karafiat, and L. Burget, “Investigation into bottle-neck features for meeting speech recognition,” in Proc. Interspeech′09, 2009, no. 9, pp. 2947–2950.
-
(2009)
Proc. Interspeech′09
, Issue.9
, pp. 2947-2950
-
-
Grezl, F.1
Karafiat, M.2
Burget, L.3
-
28
-
-
33745211419
-
Improvements to fMPE for discriminative training of features
-
D. Povey, “Improvements to fMPE for discriminative training of features,” in Proc. Interspeech, 2005, pp. 2977–2980.
-
(2005)
Proc. Interspeech
, pp. 2977-2980
-
-
Povey, D.1
-
29
-
-
44949102463
-
Recent progress on the discriminative region-dependent transform for speech feature extraction
-
B. Zhang, S. Matsoukas, and R. Schwartz, “Recent progress on the discriminative region-dependent transform for speech feature extraction,” in Proc. Interspeech′06, 2006.
-
(2006)
Proc. Interspeech′06
-
-
Zhang, B.1
Matsoukas, S.2
Schwartz, R.3
-
30
-
-
70450174924
-
Real-time ASR from meetings
-
P. N. Garner, J. Dines, T. Hain, A. E. Hannani, M. Karafiat, D. Kor-chagin, M. Lincoln, V. Wan, and L. Zhang, “Real-time ASR from meetings,” in Proc. Interspeech, 2009.
-
(2009)
Proc. Interspeech
-
-
Garner, P.N.1
Dines, J.2
Hain, T.3
Hannani, A.E.4
Karafiat, M.5
Kor-chagin, D.6
Lincoln, M.7
Wan, V.8
Zhang, L.9
-
32
-
-
74549128196
-
Automatic optimisation of speech decoder parameters
-
A. E. Hannani and T. Hain, “Automatic optimisation of speech decoder parameters,” IEEESignalProcessingLetters, vol. 17, pp. 95–98, 2010.
-
(2010)
IEEESignalProcessingLetters
, vol.17
, pp. 95-98
-
-
Hannani, A.E.1
Hain, T.2
-
33
-
-
0030263447
-
Mean and variance adaptation within the MLLR framework
-
M. J. Gales and P. Woodland, “Mean and variance adaptation within the MLLR framework,” Comput. Speech Lang., vol. 10, pp. 249–264, 1996.
-
(1996)
Comput. Speech Lang.
, vol.10
, pp. 249-264
-
-
Gales, M.J.1
Woodland, P.2
-
34
-
-
85008521982
-
The rich transcription 2007 meeting recognition evaluation
-
ser. Lecture Notes in Computer Science. New York: Springer-Verlag
-
J. G. Fiscus, J. Ajot, and J. S. Garofolo, “The rich transcription 2007 meeting recognition evaluation,” in Machine Learning for Multimodal Interaction, ser. Lecture Notes in Computer Science. New York: Springer-Verlag, 2007.
-
(2007)
Machine Learning for Multimodal Interaction
-
-
Fiscus, J.G.1
Ajot, J.2
Garofolo, J.S.3
|