-
1
-
-
34047250740
-
SuperEARS: Multi-site broadcast news system
-
presented at the, Palisades, NY, Nov
-
P. C. Woodland, H. Y. Chan, G. Evermann, M. J. F. Gales, D. Y. Kim, X. A. Liu, D. Mrva, K. C. Sim, L. Wang, K. Yu, J. Makhoul, R. Schwartz, L. Nguyen, S. Matsoukas, B. Xiang, M. Afify, S. Abdou, J.-L. Gauvain, L. Lamel, H. Schwenk, G. Adda, F. Lefevre, D. Vergyri, W. Wang, J. Zheng, A. Venkataraman, R. R. Gadde, and A. Stolcke, "SuperEARS: multi-site broadcast news system," presented at the Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004.
-
(2004)
Proc. Fall 2004 Rich Transcription Workshop (RT-04)
-
-
Woodland, P.C.1
Chan, H.Y.2
Evermann, G.3
Gales, M.J.F.4
Kim, D.Y.5
Liu, X.A.6
Mrva, D.7
Sim, K.C.8
Wang, L.9
Yu, K.10
Makhoul, J.11
Schwartz, R.12
Nguyen, L.13
Matsoukas, S.14
Xiang, B.15
Afify, M.16
Abdou, S.17
Gauvain, J.-L.18
Lamel, L.19
Schwenk, H.20
Adda, G.21
Lefevre, F.22
Vergyri, D.23
Wang, W.24
Zheng, J.25
Venkataraman, A.26
Gadde, R.R.27
Stolcke, A.28
more..
-
2
-
-
0036296863
-
Minimum phone error and I-smoothing for improved discriminative training
-
Orlando, FL, May
-
D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Orlando, FL, May 2002, pp. 105-108.
-
(2002)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 105-108
-
-
Povey, D.1
Woodland, P.C.2
-
3
-
-
84946740232
-
Recent advances in broadcast news transcription
-
St. Thomas, U.S. Virgin Islands, Nov
-
D. Y. Kim, G. Evermann, T. Hain, D. Mrva, S. E. Tranter, L. Wang, and P. C. Woodland, "Recent advances in broadcast news transcription," in Proc. IEEE ASRU Workshop, St. Thomas, U.S. Virgin Islands, Nov. 2003, pp. 105-110.
-
(2003)
Proc. IEEE ASRU Workshop
, pp. 105-110
-
-
Kim, D.Y.1
Evermann, G.2
Hain, T.3
Mrva, D.4
Tranter, S.E.5
Wang, L.6
Woodland, P.C.7
-
4
-
-
85009165931
-
MMI-MAP and MPE-MAP for acoustic model adaptation
-
Geneva, Switzerland, Sep
-
D. Povey, M. J. F. Gales, D. Y. Kim, and P. C. Woodland, "MMI-MAP and MPE-MAP for acoustic model adaptation," in Proc. Eur. Conf. Speech Commun. Technol., Geneva, Switzerland, Sep. 2003, pp. 1981-1984.
-
(2003)
Proc. Eur. Conf. Speech Commun. Technol
, pp. 1981-1984
-
-
Povey, D.1
Gales, M.J.F.2
Kim, D.Y.3
Woodland, P.C.4
-
5
-
-
4544253838
-
Improving broadcast news transcription by lightly supervised discriminative training
-
Montreal, QC, Canada, Mar
-
H. Y. Chan and P. C. Woodland, "Improving broadcast news transcription by lightly supervised discriminative training," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, Mar. 2004, pp. 737-740.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 737-740
-
-
Chan, H.Y.1
Woodland, P.C.2
-
6
-
-
33646792633
-
Investigation of acoustic modeling techniques for LVCSR systems
-
Philadelphia, PA, Mar
-
X. Liu, M. J. F. Gales, K. C. Sim, and K. Yu, "Investigation of acoustic modeling techniques for LVCSR systems," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Philadelphia, PA, Mar. 2005, pp. 849-852.
-
(2005)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 849-852
-
-
Liu, X.1
Gales, M.J.F.2
Sim, K.C.3
Yu, K.4
-
7
-
-
33745196067
-
-
NIST Speech Group, Online] Available
-
NIST Speech Group. (2004) Fall 2004 rich transcription RT-04f evaluation plan. [Online] Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/
-
(2004)
Fall 2004 rich transcription RT-04f evaluation plan
-
-
-
8
-
-
34047261805
-
An overview of automatic speaker diarization systems
-
Sep
-
S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1555-1563, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1555-1563
-
-
Tranter, S.E.1
Reynolds, D.A.2
-
9
-
-
33646357306
-
The Cambridge University March 2005 speaker diarization system
-
Lisbon, Portugal, Sep
-
R. Sinha, S. E. Tranter, M. J. F. Gales, and P. C. Woodland, "The Cambridge University March 2005 speaker diarization system," in Proc. InterSpeech, Lisbon, Portugal, Sep. 2005, pp. 2347-2350.
-
(2005)
Proc. InterSpeech
, pp. 2347-2350
-
-
Sinha, R.1
Tranter, S.E.2
Gales, M.J.F.3
Woodland, P.C.4
-
10
-
-
34047250923
-
The 2004 BBN/LIMSI 10 × RT English broadcast news transcription system
-
Palisades, NY, Nov
-
L. Nguyen, S. Abdou, M. Afify, J. Makhoul, S. Matsoukas, R. Schwartz, B. Xiang, L. Lamel, J. Gauvain, G. Adda, H. Schwenk, and F. Lefevre, "The 2004 BBN/LIMSI 10 × RT English broadcast news transcription system," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004.
-
(2004)
Proc. Fall 2004 Rich Transcription Workshop (RT-04)
-
-
Nguyen, L.1
Abdou, S.2
Afify, M.3
Makhoul, J.4
Matsoukas, S.5
Schwartz, R.6
Xiang, B.7
Lamel, L.8
Gauvain, J.9
Adda, G.10
Schwenk, H.11
Lefevre, F.12
-
11
-
-
0036567851
-
The LIMSI broadcast news transcription system
-
J.-L. Gauvain, L. Lamel, and G. Adda, "The LIMSI broadcast news transcription system," Comput. Speech Lang., pp. 89-108, 2002.
-
(2002)
Comput. Speech Lang
, pp. 89-108
-
-
Gauvain, J.-L.1
Lamel, L.2
Adda, G.3
-
12
-
-
34047259784
-
CTS decoding improvements at IBM
-
presented at the, St. Thomas, U.S. Virgin Islands, Dec
-
G. Saon, D. Povey, and G. Zweig, "CTS decoding improvements at IBM," presented at the Proc. EARS STT Workshop, St. Thomas, U.S. Virgin Islands, Dec. 2003, p. XXX.
-
(2003)
Proc. EARS STT Workshop
-
-
Saon, G.1
Povey, D.2
Zweig, G.3
-
13
-
-
0036460908
-
Lightly supervised and unsupervised acoustic model training
-
L. Lamel and J.-L. Gauvain, "Lightly supervised and unsupervised acoustic model training," Comput. Speech Lang., vol. 16, pp. 115-129, 2002.
-
(2002)
Comput. Speech Lang
, vol.16
, pp. 115-129
-
-
Lamel, L.1
Gauvain, J.-L.2
-
14
-
-
4544273245
-
Light supervision in acoustic model training
-
Montreal, QC, Canada, Mar
-
L. Nguyen and B. Xiang, "Light supervision in acoustic model training," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, Mar. 2004, pp. 185-188.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 185-188
-
-
Nguyen, L.1
Xiang, B.2
-
15
-
-
34047246426
-
Lightly supervised discriminative training for LVCSR,
-
Master's thesis, Cambridge Univ, Cambridge, U.K
-
H. Y. Chan, "Lightly supervised discriminative training for LVCSR," Master's thesis, Cambridge Univ., Cambridge, U.K., 2004.
-
(2004)
-
-
Chan, H.Y.1
-
16
-
-
0030638031
-
A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)
-
J. G. Fiscus, "A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER)," in Proc. IEEE ASRU Workshop, 1997, pp. 347-352.
-
(1997)
Proc. IEEE ASRU Workshop
, pp. 347-352
-
-
Fiscus, J.G.1
-
17
-
-
84946728861
-
Design of fast LVCSR systems
-
St. Thomas, U.S. Virgin Islands, Nov
-
G. Evermann and P. C. Woodland, "Design of fast LVCSR systems," in Proc. IEEE ASRU Workshop, St. Thomas, U.S. Virgin Islands, Nov. 2003, pp. 7-12.
-
(2003)
Proc. IEEE ASRU Workshop
, pp. 7-12
-
-
Evermann, G.1
Woodland, P.C.2
-
18
-
-
34047255347
-
SRIs 2004 broadcast news speech to text system
-
Palisades, NY, Nov
-
A. Venkataraman, R. Gadde, A. Stolcke, D. Vergyri, W. Wang, and J. Zheng, "SRIs 2004 broadcast news speech to text system," in Proc. Fall 2004 Rich Transcription Workshop (RT-04), Palisades, NY, Nov. 2004.
-
(2004)
Proc. Fall 2004 Rich Transcription Workshop (RT-04)
-
-
Venkataraman, A.1
Gadde, R.2
Stolcke, A.3
Vergyri, D.4
Wang, W.5
Zheng, J.6
-
19
-
-
0141809272
-
E-HMM approach for learning and adapting sound models for speaker indexing
-
Crete, Greece, Jun
-
S. Meignier, J.-F. Bonastre, and S. Igounet, "E-HMM approach for learning and adapting sound models for speaker indexing," in Proc. Odyssey Speaker and Language Recognition Workshop, Crete, Greece, Jun. 2001, pp. 175-180.
-
(2001)
Proc. Odyssey Speaker and Language Recognition Workshop
, pp. 175-180
-
-
Meignier, S.1
Bonastre, J.-F.2
Igounet, S.3
-
20
-
-
33745185104
-
Combining speaker identification and BIC for speaker diarization
-
Lisbon, Portugal, Sep
-
X. Zhu, C. Barras, S. Meignier, and J.-L. Gauvain, "Combining speaker identification and BIC for speaker diarization," in Proc. InterSpeech, Lisbon, Portugal, Sep. 2005, pp. 2441-2444.
-
(2005)
Proc. InterSpeech
, pp. 2441-2444
-
-
Zhu, X.1
Barras, C.2
Meignier, S.3
Gauvain, J.-L.4
-
21
-
-
34047257943
-
-
S. E. Tranter, K. Yu, D. A. Reynolds, G. Evermann, D. Y. Kim, and P. C. Woodland, An investigation into the interactions between speaker diarization systems and automatic speech transcription, Cambridge Univ. Eng. Dept., Tech. Rep. CUED/F-INFENG/TR-464, 2003.
-
S. E. Tranter, K. Yu, D. A. Reynolds, G. Evermann, D. Y. Kim, and P. C. Woodland, "An investigation into the interactions between speaker diarization systems and automatic speech transcription," Cambridge Univ. Eng. Dept., Tech. Rep. CUED/F-INFENG/TR-464, 2003.
-
-
-
-
22
-
-
85128356454
-
Partitioning and transcription of broadcast news data
-
Sydney, Australia, Dec
-
J.-L. Gauvain, L. Lamel, and G. Adda, "Partitioning and transcription of broadcast news data," in Proc. Int. Conf. Spoken Lang. Process., vol. 4, Sydney, Australia, Dec. 1998, pp. 1335-1338.
-
(1998)
Proc. Int. Conf. Spoken Lang. Process
, vol.4
, pp. 1335-1338
-
-
Gauvain, J.-L.1
Lamel, L.2
Adda, G.3
-
23
-
-
33745217352
-
The BBN RT04 English broadcast news transcription system
-
Lisbon, Portugal
-
L. Nguyen, B. Xiang, M. Afify, S. Abdou, S. Matsoukas, R. Schwartz, and J. Makhoul, "The BBN RT04 English broadcast news transcription system," in Proc. InterSpeech, Lisbon, Portugal, 2005, pp. 1673-1676.
-
(2005)
Proc. InterSpeech
, pp. 1673-1676
-
-
Nguyen, L.1
Xiang, B.2
Afify, M.3
Abdou, S.4
Matsoukas, S.5
Schwartz, R.6
Makhoul, J.7
-
24
-
-
0036567794
-
The development of the HTK broadcast news transcription system: An overview
-
P. C. Woodland, "The development of the HTK broadcast news transcription system: An overview," Speech Commun., vol. 37, pp. 47-67, 2002.
-
(2002)
Speech Commun
, vol.37
, pp. 47-67
-
-
Woodland, P.C.1
-
25
-
-
33646788079
-
Development of the CU-HTK 2004 broadcast news transcription systems
-
Philadelphia, PA, Mar
-
D. Y. Kim, H. Y. Chan, G. Evermann, M. J. F. Gales, D. Mrva, K. C. Sim, and P. C. Woodland, "Development of the CU-HTK 2004 broadcast news transcription systems," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Philadelphia, PA, Mar. 2005, pp. 861-864.
-
(2005)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 861-864
-
-
Kim, D.Y.1
Chan, H.Y.2
Evermann, G.3
Gales, M.J.F.4
Mrva, D.5
Sim, K.C.6
Woodland, P.C.7
-
26
-
-
0003871508
-
Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
-
Ph.D. dissertation, John Hopkins Univ, Baltimore, MD
-
N. Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
-
(1997)
-
-
Kumar, N.1
-
27
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
May
-
M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
-
(1999)
IEEE Trans. Speech Audio Process
, vol.7
, Issue.3
, pp. 272-281
-
-
Gales, M.J.F.1
-
28
-
-
34047247141
-
-
S. J. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book, version 3.3. Cambridge, U.K, Cambridge Univ. Eng. Dept, 2005
-
S. J. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book, version 3.3. Cambridge, U.K.: Cambridge Univ. Eng. Dept., 2005.
-
-
-
-
29
-
-
0002144369
-
Tree-based state tying for high accuracy acoustic modeling
-
S. J. Young, J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Human Lang. Technol. Workshop, 1994, pp. 307-312.
-
(1994)
Proc. ARPA Human Lang. Technol. Workshop
, pp. 307-312
-
-
Young, S.J.1
Odell, J.2
Woodland, P.C.3
-
30
-
-
0036461035
-
Large scale discriminative training of hidden Markov models for speech recognition
-
P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models for speech recognition," Comput. Speech Lang., vol. 16, pp. 25-47, 2002.
-
(2002)
Comput. Speech Lang
, vol.16
, pp. 25-47
-
-
Woodland, P.C.1
Povey, D.2
-
31
-
-
0028419019
-
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
-
Apr
-
J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
-
(1994)
IEEE Trans. Speech Audio Process
, vol.2
, Issue.2
, pp. 291-298
-
-
Gauvain, J.-L.1
Lee, C.-H.2
-
32
-
-
84891308106
-
SRILM - an extensible language modeling toolkit
-
Denver, CO, Sep
-
A. Stolcke, "SRILM - an extensible language modeling toolkit," in Proc. Int. Conf. Spoken Lang. Process., Denver, CO, Sep. 2002, pp. 901-904.
-
(2002)
Proc. Int. Conf. Spoken Lang. Process
, pp. 901-904
-
-
Stolcke, A.1
-
34
-
-
0030366664
-
Iterative unsupervised adaptation using maximum likelihood linear regression
-
Philadelphia, PA
-
P. Woodland, D. Pye, and M. Gales, "Iterative unsupervised adaptation using maximum likelihood linear regression," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, 1996, pp. 1133-1136.
-
(1996)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1133-1136
-
-
Woodland, P.1
Pye, D.2
Gales, M.3
-
35
-
-
33745225187
-
The 2004 BBN 1 × RT recognition systems for English broadcast news and conversational telephone speech
-
Lisbon, Portugal, Sep
-
S. Matsoukas, R. Prasad, S. Laxminarayan, B. Xiang, L. Nguyen, and R. Schwartz, "The 2004 BBN 1 × RT recognition systems for English broadcast news and conversational telephone speech," in Proc. Inter-Speech, Lisbon, Portugal, Sep. 2005, pp. 1641-1644.
-
(2005)
Proc. Inter-Speech
, pp. 1641-1644
-
-
Matsoukas, S.1
Prasad, R.2
Laxminarayan, S.3
Xiang, B.4
Nguyen, L.5
Schwartz, R.6
-
36
-
-
85009192356
-
An architecture for rapid decoding of large vocabulary conversational speech
-
Geneva, Switzerland, Sep
-
G. Saon, G. Zweig, B. Kingsbury, L. Mangu, and U. Chaudhari, "An architecture for rapid decoding of large vocabulary conversational speech," in Proc. Eur. Conf. Speech Commun. Technol., Geneva, Switzerland, Sep. 2003, pp. 1977-1980.
-
(2003)
Proc. Eur. Conf. Speech Commun. Technol
, pp. 1977-1980
-
-
Saon, G.1
Zweig, G.2
Kingsbury, B.3
Mangu, L.4
Chaudhari, U.5
-
37
-
-
4544253834
-
Posterior probability decoding, confidence estimation, and system combination
-
College Park, MD, May
-
G. Evermann and P. C. Woodland, "Posterior probability decoding, confidence estimation, and system combination," in Proc. Speech Transcription Workshop, College Park, MD, May 2000.
-
(2000)
Proc. Speech Transcription Workshop
-
-
Evermann, G.1
Woodland, P.C.2
-
38
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs," Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
-
(1995)
Comput. Speech Lang
, vol.9
, pp. 171-186
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
39
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
-
(1998)
Comput. Speech Lang
, vol.12
, pp. 75-98
-
-
Gales, M.J.F.1
-
41
-
-
4544324761
-
Implicit pronunciation modeling in ASR
-
T. Hain, "Implicit pronunciation modeling in ASR," in Proc. ISCA ITRW PMLA, 2002.
-
(2002)
Proc. ISCA ITRW PMLA
-
-
Hain, T.1
-
42
-
-
4544373872
-
Basis superposition precision matrix modeling for large vocabulary continuous speech recognition
-
K. C. Sim and M. J. F. Gales, "Basis superposition precision matrix modeling for large vocabulary continuous speech recognition," in Proc. ICASSP, 2004, pp. 801-804.
-
(2004)
Proc. ICASSP
, pp. 801-804
-
-
Sim, K.C.1
Gales, M.J.F.2
-
43
-
-
4544253619
-
Adaptive training using structured transforms
-
K. Yu and M. J. F. Gales, "Adaptive training using structured transforms," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, pp. 317-320.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 317-320
-
-
Yu, K.1
Gales, M.J.F.2
-
44
-
-
33646821390
-
Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system
-
Philadelphia, PA, Mar
-
M. J. F. Gales, B. Jia, X. Liu, K. C. Sim, P. C. Woodland, and K. Yu, "Development of the CUHTK 2004 Mandarin conversational telephone speech transcription system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Philadelphia, PA, Mar. 2005, pp. 841-844.
-
(2005)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 841-844
-
-
Gales, M.J.F.1
Jia, B.2
Liu, X.3
Sim, K.C.4
Woodland, P.C.5
Yu, K.6
-
45
-
-
85135271674
-
Finding consensus among words: Lattice-based word error minimization
-
L. Mangu, E. Brill, and A. Stolcke, "Finding consensus among words: Lattice-based word error minimization," in Proc. Eur. Conf. Speech Commun. Technol., 1999, pp. 495-498.
-
(1999)
Proc. Eur. Conf. Speech Commun. Technol
, pp. 495-498
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
46
-
-
33947657269
-
Error analysis of the BN and CTS results
-
presented at the, St. Thomas, U.S. Virgin Islands, Dec
-
N. Duta and R. Schwartz, "Error analysis of the BN and CTS results," presented at the Proc. EARS STT Workshop, St. Thomas, U.S. Virgin Islands, Dec. 2003.
-
(2003)
Proc. EARS STT Workshop
-
-
Duta, N.1
Schwartz, R.2
-
47
-
-
0025680226
-
Tools for the analysis of benchmark speech recognition tests
-
D. S. Pallett, W. M. Fisher, and J. G. Fiscus, "Tools for the analysis of benchmark speech recognition tests," in Proc. IEEE Int. Conf. Acoust., Speech. Signal Process., 1990.
-
(1990)
Proc. IEEE Int. Conf. Acoust., Speech. Signal Process
-
-
Pallett, D.S.1
Fisher, W.M.2
Fiscus, J.G.3
|