SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 2, 2012, Pages 486-498

Transcribing Meetings with the AMIDA Systems

(10) Hain, Thomas a El Hannani, Asmaa a Wan, Vincent a Burget, Lukáš b Grézl, Frantisek b Karafiät, Martin b Dines, John c Garner, Philip N c Lincoln, Mike d Huijbregts, Marijn e

a UNIVERSITY OF SHEFFIELD (United Kingdom)

b BRNO UNIVERSITY OF TECHNOLOGY (Czech Republic)

c IDIAP RESEARCH INSTITUTE (Switzerland)

d UNIVERSITY OF EDINBURGH (United Kingdom)

e UNIVERSITY OF TWENTE (Netherlands)

Author keywords

AMI corpus; Juicer; meeting transcription; multiple distant microphone; resource optimisation; rich text

Indexed keywords

EID: 85008520364 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2011.2163395 Document Type: Article

Times cited : (114)

References (34)

1
- 85008533443
- RT-2002 Evaluation Plan (Version 1. 0)
- [Online]. Available: http://www.itl.nist.gov/iad/mig//tests/rt/2002/index.html
- J. Fiscus and J. Garofolo, “RT-2002 Evaluation Plan (Version 1. 0),” NIST, 2002 [Online]. Available: http://www.itl.nist.gov/iad/mig//tests/rt/2002/index.html.
- (2002) NIST
- Fiscus, J.¹ Garofolo, J.²

2
- 0001979427
- Meeting browser: Tracking and summarizing meetings
- A. Waibel, M. Bett, M. Finke, and R. Stiefelhagen, “Meeting browser: Tracking and summarizing meetings,” in Proc. DARPA Broadcast News Transcript. Understand. Workshop, Lansdowne, VA, 1998, pp. 281–286.
- (1998) Proc. DARPA Broadcast News Transcript. Understand. Workshop, Lansdowne, VA , pp. 281-286
- Waibel, A.¹ Bett, M.² Finke, M.³ Stiefelhagen, R.⁴

3
- 44849090969
- Recognition and understanding of meetings: The AMI and AMIDA projects
- S. Renals, T. Hain, and H. Bourlard, “Recognition and understanding of meetings: The AMI and AMIDA projects,” in Proc. ASRU′07, 2007, pp. 238–247.
- (2007) Proc. ASRU′07 , pp. 238-247
- Renals, S.¹ Hain, T.² Bourlard, H.³

4
- 78650672858
- Accessing a large multimodal corpus using an automatic content linking device
- New York: Springer
- A. Popescu-Belis, J. Carletta, J. Kilgour, and P. Poller, “Accessing a large multimodal corpus using an automatic content linking device,” in Multimodal Corpora: From Models of Natural Interaction to Systems and Applications. New York: Springer, 2009, vol. 5509, pp. 189–206.
- (2009) Multimodal Corpora: From Models of Natural Interaction to Systems and Applications , vol.5509 , pp. 189-206
- Popescu-Belis, A.¹ Carletta, J.² Kilgour, J.³ Poller, P.⁴

5
- 33745536025
- The 2005 AMI system for the transcription of speech in meetings
- ser. Lecture Notes in Computer Science. Edinburgh, U. K.: Springer-Verlag
- T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, I. McCowan, D. Moore, V. Wan, R. Ordelman, and S. Renals, “The 2005 AMI system for the transcription of speech in meetings,” in Machine Learning for Multimodal Interaction, ser. Lecture Notes in Computer Science. Edinburgh, U. K.: Springer-Verlag, 2005, vol. 3869, pp. 450–462.
- (2005) Machine Learning for Multimodal Interaction , vol.3869 , pp. 450-462
- Hain, T.¹ Burget, L.² Dines, J.³ Garau, G.⁴ Karafiat, M.⁵ Lincoln, M.⁶ McCowan, I.⁷ Moore, D.⁸ Wan, V.⁹ Ordelman, R.¹⁰ Renals, S.¹¹

6
- 79959838190
- The2007AMI (DA) system for meetingtran-scription
- ser. Lec-ture Notes in Computer Science. New York: Springer-Verlag
- T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, D. van Leeuwen, and V. Wan, “The2007AMI (DA) system for meetingtran-scription,” in Machine Learning for Multimodal Interaction, ser. Lec-ture Notes in Computer Science. New York: Springer-Verlag, 2007.
- (2007) Machine Learning for Multimodal Interaction
- Hain, T.¹ Burget, L.² Dines, J.³ Garau, G.⁴ Karafiat, M.⁵ Lincoln, M.⁶ van Leeuwen, D.⁷ Wan, V.⁸

7
- 85009275134
- The ISL meeting corpus: The impact of meeting type on speech style
- S. Burger, V. MacLaren, and H. Yu, “The ISL meeting corpus: The impact of meeting type on speech style,” in Proc. ICSLP, 2002.
- (2002) Proc. ICSLP
- Burger, S.¹ MacLaren, V.² Yu, H.³

8
- 33846265193
- The AMI meeting corpus
- J. Carletta, S. Ashby, S. Bourban, M. Guillemot, M. Kronenthal, G. Lathoud, M. Lincoln, I. McCowan, T. Hain, W. Kraaij, W. Post, J. Kadlec, P. Wellner, M. Flynn, and D. Reidsma, “The AMI meeting corpus,” in Proc. MLMI′05, Edinburgh, U. K., 2005.
- (2005) Proc. MLMI′05, Edinburgh, U. K.
- Carletta, J.¹ Ashby, S.² Bourban, S.³ Guillemot, M.⁴ Kronenthal, M.⁵ Lathoud, G.⁶ Lincoln, M.⁷ McCowan, I.⁸ Hain, T.⁹ Kraaij, W.¹⁰ Post, W.¹¹ Kadlec, J.¹² Wellner, P.¹³ Flynn, M.¹⁴ Reidsma, D.¹⁵

9
- 84924134587
- The NIST meeting room pilot corpus
- J. Garofolo, C. Laprun, M. Miche, V. Stanford, and E. Tabassi, “The NIST meeting room pilot corpus,” in Proc. LREC′04, 2004.
- (2004) Proc. LREC′04
- Garofolo, J.¹ Laprun, C.² Miche, M.³ Stanford, V.⁴ Tabassi, E.⁵

10
- 0141814662
- The ICSI meeting corpus
- A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, and C. Wooters, “The ICSI meeting corpus,” in Proc. IEEE ICASSP, 2003, pp. 364–367.
- (2003) Proc. IEEE ICASSP , pp. 364-367
- Janin, A.¹ Baron, D.² Edwards, J.³ Ellis, D.⁴ Gelbart, D.⁵ Morgan, N.⁶ Peskin, B.⁷ Pfau, T.⁸ Shriberg, E.⁹ Stolcke, A.¹⁰ Wooters, C.¹¹

11
- 33947697215
- Strategies for language model web-data collection
- V. Wan and T. Hain, “Strategies for language model web-data collection,” in Proc. ICASSP′06, 2006, pp. 1069–1072.
- (2006) Proc. ICASSP′06 , pp. 1069-1072
- Wan, V.¹ Hain, T.²

12
- 56149083259
- The AMI meeting transcription system: Progress and performance
- ser. Lecture Notes in Computer Science. New York: Springer
- T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, J. Vepa, and V. Wan, “The AMI meeting transcription system: Progress and performance,” in Machine Learning for Multimodal Interaction, ser. Lecture Notes in Computer Science. New York: Springer, 2006, pp. 419–431.
- (2006) Machine Learning for Multimodal Interaction , pp. 419-431
- Hain, T.¹ Burget, L.² Dines, J.³ Garau, G.⁴ Karafiat, M.⁵ Lincoln, M.⁶ Vepa, J.⁷ Wan, V.⁸

13
- 47749119617
- The ICSI RT07s speaker diariza-tion system
- ser. LNCS. New York: Springer-Verlag
- C. Wooters and M. Huijbregts, “The ICSI RT07s speaker diariza-tion system,” in Machine Learning for Multimodal Interaction, ser. LNCS. New York: Springer-Verlag, 2007, vol. 4625, pp. 509–519.
- (2007) Machine Learning for Multimodal Interaction , vol.4625 , pp. 509-519
- Wooters, C.¹ Huijbregts, M.²

14
- 33745217149
- Transcription of conference room meetings: An investigation
- T. Hain, G. G. J. Dines, M. Karafiat, D. Moore, V. Wan, R. Ordelman, and S. Renals, “Transcription of conference room meetings: An investigation,” in Proc. Interspeech′05, 2005, pp. 1661–1664.
- (2005) Proc. Interspeech′05 , pp. 1661-1664
- Hain, T.¹ Dines, G.G.J.² Karafiat, M.³ Moore, D.⁴ Wan, V.⁵ Ordelman, R.⁶ Renals, S.⁷

15
- 33745515429
- Further progress in meeting recognition: The ICSI-SRI spring 2005 speech-to-text evaluation system
- ser. LNCS. New York: Springer Verlag
- A. Stolcke, X. Anguera, K. Boakye, O. Cetin, F. Grezl, A. Janin, A. Mandal, B. Peskin, C. Wooters, and J. Zheng, “Further progress in meeting recognition: The ICSI-SRI spring 2005 speech-to-text evaluation system,” in Machine Learning for Multimodal Interaction, ser. LNCS. New York: Springer Verlag, 2005, pp. 463–475.
- (2005) Machine Learning for Multimodal Interaction , pp. 463-475
- Stolcke, A.¹ Anguera, X.² Boakye, K.³ Cetin, O.⁴ Grezl, F.⁵ Janin, A.⁶ Mandal, A.⁷ Peskin, B.⁸ Wooters, C.⁹ Zheng, J.¹⁰

16
- 44949090835
- Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures
- I. Bulyko, M. Ostendorf, and A. Stolcke, “Getting more mileage from web text sources for conversational speech language modeling using class-dependent mixtures,” in Proc. Human Lang. Technol. Conf., 2003.
- (2003) Proc. Human Lang. Technol. Conf.
- Bulyko, I.¹ Ostendorf, M.² Stolcke, A.³

17
- 0012611072
- Entropoy-based pruning of backoff language models
- A. Stolcke, “Entropoy-based pruning of backoff language models,” in Proc. DARPA Broadcast News Transcript. Understand. Workshop, 1998, pp. 270–274.
- (1998) Proc. DARPA Broadcast News Transcript. Understand. Workshop , pp. 270-274
- Stolcke, A.¹

18
- 34547548247
- The AMI system for the transcription of speech in meetings
- T. Hain, L. Burget, J. Dines, G. Garau, M. Karafiat, M. Lincoln, V. Wan, and J. Vepa, “The AMI system for the transcription of speech in meetings,” in Proc. ICASSP′07, 2007, vol. 1, pp. 357–360.
- (2007) Proc. ICASSP′07 , vol.1 , pp. 357-360
- Hain, T.¹ Burget, L.² Dines, J.³ Garau, G.⁴ Karafiat, M.⁵ Lincoln, M.⁶ Wan, V.⁷ Vepa, J.⁸

19
- 40249083942
- The segmentation of multi-channel meeting recordings for automatic speech recognition
- J. Dines, J. Vepa, and T. Hain, “The segmentation of multi-channel meeting recordings for automatic speech recognition,” Proc. Inter-speech′06, 2006.
- (2006) Proc. Inter-speech′06
- Dines, J.¹ Vepa, J.² Hain, T.³

20
- 44849123928
- Robust speaker diarization for meetings
- Ph. D. dissertation, UPC, Barcelona, Spain
- X. Anguera, “Robust speaker diarization for meetings,” Ph. D. dissertation, UPC, Barcelona, Spain, 2006.
- (2006)
- Anguera, X.¹

21
- 77249176190
- The AMI speaker diarization system for IST RT06s meeting data
- D. A. van Leeuwen and M. Huijbregts, “The AMI speaker diarization system for IST RT06s meeting data,” in Proc. MLMI′06, 2006, pp. 371–384.
- (2006) Proc. MLMI′06 , pp. 371-384
- van Leeuwen, D.A.¹ Huijbregts, M.²

22
- 4544265717
- Discriminative Training for Large Vocabulary Speech, Recognition
- Ph. D. dissertation, Cambridge Univ., Cambridge, U. K.
- D. Povey, “Discriminative Training for Large Vocabulary Speech, Recognition,” Ph. D. dissertation, Cambridge Univ., Cambridge, U. K., 2004.
- (2004)
- Povey, D.¹

23
- 84959118000
- The Fisher corpus: A resource for the next generations of speech-to-text
- C. Cieri, D. Miller, and K. Walker, “The Fisher corpus: A resource for the next generations of speech-to-text,” in Proc. LREC′04: 4th Inte. Conf. Lang. Resources Eval., Lisbon, Portugal, 2004.
- (2004) Proc. LREC′04: 4th Inte. Conf. Lang. Resources Eval., Lisbon, Portugal
- Cieri, C.¹ Miller, D.² Walker, K.³

24
- 47749135184
- Application of CMLLR in narrow band wide band adapted systems
- M. Karafiat, L. Burget, T. Hain, and J. Cernocky, “Application of CMLLR in narrow band wide band adapted systems,” in Proc 8th Int. Conf. Interspeech′07, Antwerp, Belgium, 2007, p. 4.
- (2007) Proc 8th Int. Conf. Interspeech′07, Antwerp, Belgium , pp. 4
- Karafiat, M.¹ Burget, L.² Hain, T.³ Cernocky, J.⁴

25
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J. -L. Gauvain and C. -H. Lee, “Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains,” IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291–298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

26
- 0003871508
- Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition
- Ph. D. dissertation, John Hopkins Univ., Baltimore, MD
- N. Kumar, “Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,” Ph. D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
- (1997)
- Kumar, N.¹

27
- 70450211380
- Investigation into bottle-neck features for meeting speech recognition
- F. Grezl, M. Karafiat, and L. Burget, “Investigation into bottle-neck features for meeting speech recognition,” in Proc. Interspeech′09, 2009, no. 9, pp. 2947–2950.
- (2009) Proc. Interspeech′09 , Issue.9 , pp. 2947-2950
- Grezl, F.¹ Karafiat, M.² Burget, L.³

28
- 33745211419
- Improvements to fMPE for discriminative training of features
- D. Povey, “Improvements to fMPE for discriminative training of features,” in Proc. Interspeech, 2005, pp. 2977–2980.
- (2005) Proc. Interspeech , pp. 2977-2980
- Povey, D.¹

29
- 44949102463
- Recent progress on the discriminative region-dependent transform for speech feature extraction
- B. Zhang, S. Matsoukas, and R. Schwartz, “Recent progress on the discriminative region-dependent transform for speech feature extraction,” in Proc. Interspeech′06, 2006.
- (2006) Proc. Interspeech′06
- Zhang, B.¹ Matsoukas, S.² Schwartz, R.³

30
- 70450174924
- Real-time ASR from meetings
- P. N. Garner, J. Dines, T. Hain, A. E. Hannani, M. Karafiat, D. Kor-chagin, M. Lincoln, V. Wan, and L. Zhang, “Real-time ASR from meetings,” in Proc. Interspeech, 2009.
- (2009) Proc. Interspeech
- Garner, P.N.¹ Dines, J.² Hain, T.³ Hannani, A.E.⁴ Karafiat, M.⁵ Kor-chagin, D.⁶ Lincoln, M.⁷ Wan, V.⁸ Zhang, L.⁹

31
- 79959826088
- Tracter: A lightweight dataflow framework
- Sep.
- P. N. Garner and J. Dines, “Tracter: A lightweight dataflow framework,” in Proc. Interspeech, Makuhari, Japan, Sep. 2010.
- (2010) Proc. Interspeech, Makuhari, Japan
- Garner, P.N.¹ Dines, J.²

32
- 74549128196
- Automatic optimisation of speech decoder parameters
- A. E. Hannani and T. Hain, “Automatic optimisation of speech decoder parameters,” IEEESignalProcessingLetters, vol. 17, pp. 95–98, 2010.
- (2010) IEEESignalProcessingLetters , vol.17 , pp. 95-98
- Hannani, A.E.¹ Hain, T.²

33
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. J. Gales and P. Woodland, “Mean and variance adaptation within the MLLR framework,” Comput. Speech Lang., vol. 10, pp. 249–264, 1996.
- (1996) Comput. Speech Lang. , vol.10 , pp. 249-264
- Gales, M.J.¹ Woodland, P.²

34
- 85008521982
- The rich transcription 2007 meeting recognition evaluation
- ser. Lecture Notes in Computer Science. New York: Springer-Verlag
- J. G. Fiscus, J. Ajot, and J. S. Garofolo, “The rich transcription 2007 meeting recognition evaluation,” in Machine Learning for Multimodal Interaction, ser. Lecture Notes in Computer Science. New York: Springer-Verlag, 2007.
- (2007) Machine Learning for Multimodal Interaction
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.