SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 26, Issue 2, 2012, Pages 67-89

Integrating imperfect transcripts into speech recognition systems for building high-quality corpora

(3) Lecouteux, Benjamin a Linarés, Georges b Oger, Stanislas b

a UNIV GRENOBLE ALPES (France)

b UNIVERSITY OF AVIGNON (France)

Author keywords

Acoustic model training; Speech processing; Text to speech alignment

Indexed keywords

ACOUSTIC MODEL; AUTOMATIC SPEECH RECOGNITION SYSTEM; CORRECT ERROR; DECODING STRATEGY; HIGH QUALITY; LOW-COST SOLUTION; SEARCH ALGORITHMS; SPEECH CORPORA; SPEECH RECOGNITION SYSTEMS; SPEECH SIGNALS; TEMPORAL INFORMATION; TEXT TO SPEECH; TRAINING CORPUS;

ALIGNMENT; COMMUNICATION CHANNELS (INFORMATION THEORY); SOFTWARE AGENTS; SPEECH COMMUNICATION; SPEECH PROCESSING;

SPEECH RECOGNITION;

EID: 80055054639 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2011.06.001 Document Type: Article

Times cited : (13)

References (52)

1
- 0000286376
- Using dynamic time warping to find patterns in time series
- D. Berndt, and J. Clifford Using dynamic time warping to find patterns in time series Workshop on Knowledge Discovery in Databases (KDD'94) 1994 359 370
- (1994) Workshop on Knowledge Discovery in Databases (KDD'94) , pp. 359-370
- Berndt, D.¹ Clifford, J.²

2
- 33646807492
- Alize, a free toolkit for speaker recognition
- J.-F. Bonastre, F. Wils, and S. Meignier Alize, a free toolkit for speaker recognition Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'05), vol. 1 2005 737 740
- (2005) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'05), Vol. 1 , pp. 737-740
- Bonastre, J.-F.¹ Wils, F.² Meignier, S.³

3
- 0028467635
- Automatic speech recognition in machine aided translation
- P. Brown, S. Chen, S.D. Pietra, V.D. Pietra, S. Kehler, and R. Mercer Automatic speech recognition in machine aided translation Computer Speech and Language 8 1994 177 187
- (1994) Computer Speech and Language , vol.8 , pp. 177-187
- Brown, P.¹ Chen, S.² Pietra, S.D.³ Pietra, V.D.⁴ Kehler, S.⁵ Mercer, R.⁶

4
- 33745188444
- Segmentation of recordings based on partial transcriptions
- P. Cardinal, G. Boulianne, and M. Comeau Segmentation of recordings based on partial transcriptions Proc. Interspeech'05 2005 3345 3348
- (2005) Proc. Interspeech'05 , pp. 3345-3348
- Cardinal, P.¹ Boulianne, G.² Comeau, M.³

5
- 4544253838
- Improving broadcast news transcription by lightly supervised discriminative training
- H.Y. Chan, and P. Woodland Improving broadcast news transcription by lightly supervised discriminative training Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04), vol. 1 2004 737 740
- (2004) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04), Vol. 1 , pp. 737-740
- Chan, H.Y.¹ Woodland, P.²

6
- 85009064174
- Dynamic language modeling for broadcast news
- L. Chen, J.-L. Gauvain, L. Lamel, and G. Adda Dynamic language modeling for broadcast news Proc. International Conference on Spoken Language Processing (ICSLP'04) 2004 1281 1284
- (2004) Proc. International Conference on Spoken Language Processing (ICSLP'04) , pp. 1281-1284
- Chen, L.¹ Gauvain, J.-L.² Lamel, L.³ Adda, G.⁴

7
- 4544315111
- Lightly supervised acoustic model training using consensus networks
- L. Chen, L. Lamel, and J.-L. Gauvain Lightly supervised acoustic model training using consensus networks Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04), vol. 1 2004 189 192
- (2004) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04), Vol. 1 , pp. 189-192
- Chen, L.¹ Lamel, L.² Gauvain, J.-L.³

8
- 80055060274
- Evaluation of ASR systems, algorithms and databases
- G. Chollet Evaluation of ASR systems, algorithms and databases Speech Recognition and Coding: New Advances and Trends 1995 32 40
- (1995) Speech Recognition and Coding: New Advances and Trends , pp. 32-40
- Chollet, G.¹

9
- 0030715425
- Language model adaptation using mixtures and an exponentially decaying cache
- P. Clarkson, and A. Robinson Language model adaptation using mixtures and an exponentially decaying cache Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97), vol. 2 1997 799 802
- (1997) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97), Vol. 2 , pp. 799-802
- Clarkson, P.¹ Robinson, A.²

10
- 0004116989
- MIT Press
- T. Cormen, C. Leiserson, R. Rivest, and C. Stein Introduction to Algorithms 2001 MIT Press
- (2001) Introduction to Algorithms
- Cormen, T.¹ Leiserson, C.² Rivest, R.³ Stein, C.⁴

11
- 0030635306
- Flexible transcription alignment
- M. Finke, and A. Waibel Flexible transcription alignment Proc. IEEE Workshop Automatic Speech Recognition and Understanding (ASRU'97) 1997 34 40
- (1997) Proc. IEEE Workshop Automatic Speech Recognition and Understanding (ASRU'97) , pp. 34-40
- Finke, M.¹ Waibel, A.²

12
- 47749152568
- The rich transcription 2007 meeting recognition evaluation
- J.G. Fiscus, J. Ajot, and J.S. Garofolo The rich transcription 2007 meeting recognition evaluation Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR'07 and RT'07 2008 373 389
- (2008) Multimodal Technologies for Perception of Humans: International Evaluation Workshops CLEAR'07 and RT'07 , pp. 373-389
- Fiscus, J.G.¹ Ajot, J.² Garofolo, J.S.³

13
- 33745224977
- The ESTER phase II evaluation campaign for the rich transcription of French broadcast news
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- S. Galliano, E. Geoffrois, D. Mostefa, K. Choukri, J.-F. Bonastre, and G. Gravier The ester phase 2 based evaluation campaign for the rich transcription of french broadcast news Proc. of the European Conference on Speech Communication and Technology (ICSLP'05) 2005 1149 1152 (Pubitemid 43908270)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 1149-1152
- Galliano, S.¹ Geoffrois, E.² Mostefa, D.³ Choukri, K.⁴ Bonastre, J.-F.⁵ Gravier, G.⁶

14
- 46449097482
- Alignment of speech to highly imperfect text transcriptions
- J.R. Kender
- A. Haubold, and J.R. Kender Alignment of speech to highly imperfect text transcriptions J.R. Kender, Proc. IEEE International Conference on Multimedia and Expo (ICME'07) 2007 224 227
- (2007) Proc. IEEE International Conference on Multimedia and Expo (ICME'07) , pp. 224-227
- Haubold, A.¹ Kender, J.R.²

15
- 0002910412
- Stemming algorithms: A case study for detailed evaluation
- D.A. Hull Stemming algorithms: a case study for detailed evaluation Journal of the American Society of Information Science 47 1996 70 84 (Pubitemid 126582657)
- (1996) Journal of the American Society for Information Science , vol.47 , Issue.1 , pp. 70-84
- Hull, D.A.¹

16
- 84944044707
- Clustering of imperfect transcripts using a novel similarity measure
- O. Ibrahimov, I.K. Sethi, and N. Dimitrova Clustering of imperfect transcripts using a novel similarity measure Information Retrieval Techniques for Speech Applications 1 2002 23 34
- (2002) Information Retrieval Techniques for Speech Applications , vol.1 , pp. 23-34
- Ibrahimov, O.¹ Sethi, I.K.² Dimitrova, N.³

17
- 0032785782
- Modeling long distance dependence in language: Topic mixtures versus dynamic cache models
- R. Iyer, and M. Ostendorf Modeling long distance dependence in language: topic mixtures versus dynamic cache models IEEE Transactions on Speech and Audio Processing 7 Jan 1999 30 39
- (1999) IEEE Transactions on Speech and Audio Processing , vol.7 , pp. 30-39
- Iyer, R.¹ Ostendorf, M.²

18
- 0001882615
- Self-organized language modeling for speech recognition
- F. Jelinek Self-organized language modeling for speech recognition Language Processing for Speech Recognition 1990 450 506
- (1990) Language Processing for Speech Recognition , pp. 450-506
- Jelinek, F.¹

19
- 85135261720
- Unsupervised training of a speech recognizer: Recent experiments
- T. Kemp, and A. Waibel Unsupervised training of a speech recognizer: recent experiments Eurospeech'99 1999 2725 2728
- (1999) Eurospeech'99 , pp. 2725-2728
- Kemp, T.¹ Waibel, A.²

20
- 35248838963
- Derivative dynamic time warping
- E. Keogh, and M. Pazzani Derivative dynamic time warping International Conference on Data Mining (SDM'01) 2001
- (2001) International Conference on Data Mining (SDM'01)
- Keogh, E.¹ Pazzani, M.²

21
- 0025446887
- Cache-based natural language model for speech recognition
- DOI 10.1109/34.56193
- R. Kuhn, and R. De Mori A cache-based natural language model for speech recognition IEEE Transactions on Pattern Analysis and Machine Intelligence 12 1990 570 583 (Pubitemid 20724489)
- (1990) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.12 , Issue.6 , pp. 570-583
- Kuhn Roland¹ De Mori Renato²

22
- 0036460908
- Lightly supervised and unsupervised acoustic models training
- L. Lamel, J.-L. Gauvain, and G. Adda Lightly supervised and unsupervised acoustic models training Computer Speech and Language 16 2002 115 229
- (2002) Computer Speech and Language , vol.16 , pp. 115-229
- Lamel, L.¹ Gauvain, J.-L.² Adda, G.³

23
- 53149086681
- Using prompts to produce quality corpus for training automatic speech recognition systems
- B. Lecouteux, and G. Linarés Using prompts to produce quality corpus for training automatic speech recognition systems Proc. 14th IEEE Mediterranean Electrotechnical Conference (MELECON'08) 2008 841 846
- (2008) Proc. 14th IEEE Mediterranean Electrotechnical Conference (MELECON'08) , pp. 841-846
- Lecouteux, B.¹ Linarés, G.²

24
- 80052124654
- Text island spotting in large speech databases
- B. Lecouteux, G. Linarés, F. Beaugendre, and P. Nocéra Text island spotting in large speech databases Interspeech'07 2007 1318 1321
- (2007) Interspeech'07 , pp. 1318-1321
- Lecouteux, B.¹ Linarés, G.² Beaugendre, F.³ Nocéra, P.⁴

25
- 44949259199
- Imperfect transcript driven speech recognition
- B. Lecouteux, G. Linarés, J. Bonastre, and P. Nocéra Imperfect transcript driven speech recognition InterSpeech'06 2006 1626 1629
- (2006) InterSpeech'06 , pp. 1626-1629
- Lecouteux, B.¹ Linarés, G.² Bonastre, J.³ Nocéra, P.⁴

26
- 34547506927
- System combination by driven decoding
- B. Lecouteux, G. Linarés, Y. Estve, and J. Mauclair System combination by driven decoding Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'07), vol. 4 2007 341 344
- (2007) Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'07), Vol. 4 , pp. 341-344
- Lecouteux, B.¹ Linarés, G.² Estve, Y.³ Mauclair, J.⁴

27
- 38049119734
- The lia speech recognition system: From 10xrt to 1xrt
- G. Linarés, P. Nocéra, D. Massonié, and D. Matrouf The lia speech recognition system: from 10xrt to 1xrt Proc. of the 10th international conference on Text, Speech and Dialogue (TSD'07) 2007 302 308
- (2007) Proc. of the 10th International Conference on Text, Speech and Dialogue (TSD'07) , pp. 302-308
- Linarés, G.¹ Nocéra, P.² Massonié, D.³ Matrouf, D.⁴

28
- 44849113247
- Dynamic language modeling for a daily broadcast news transcription system
- C. Martins, A. Teixeira, and J. Neto Dynamic language modeling for a daily broadcast news transcription system Proc. Automatic Speech Recognition & Understanding IEEE Workshop (ASRU'07) 2007 165 170
- (2007) Proc. Automatic Speech Recognition & Understanding IEEE Workshop (ASRU'07) , pp. 165-170
- Martins, C.¹ Teixeira, A.² Neto, J.³

29
- 33745202617
- Scalable language model look-ahead for LVCSR
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- D. Massonié, P. Nocéra, and G. Linarés Scalable language model look-ahead for lvcsr Proc. of InterSpeech'05 2005 569 572 (Pubitemid 43908126)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 569-572
- Massonie, D.¹ Nocera, P.² Linares, G.³

30
- 53149145640
- Edit-distance of weighted automata
- M. Mohri Edit-distance of weighted automata Conference on Implementation and Application of Automata (CIAA'02) 2002 1 23
- (2002) Conference on Implementation and Application of Automata (CIAA'02) , pp. 1-23
- Mohri, M.¹

31
- 84885726863
- A recursive algorithm for the forced alignment of very long audio segments
- P.J. Moreno, C. Joerg, J.-M.V. Thong, and O. Glickman A recursive algorithm for the forced alignment of very long audio segments International Conference on Spoken Language Processing (ICSLP'98) 1998
- (1998) International Conference on Spoken Language Processing (ICSLP'98)
- Moreno, P.J.¹ Joerg, C.² Thong, J.-M.V.³ Glickman, O.⁴

32
- 0027929445
- On structuring probabilistic dependencies in stochastic language modeling
- H. Ney, U. Essen, and R. Kneser On structuring probabilistic dependencies in stochastic language modeling Computer Speech and Language 8 1994 1 38
- (1994) Computer Speech and Language , vol.8 , pp. 1-38
- Ney, H.¹ Essen, U.² Kneser, R.³

33
- 4544273245
- Light supervision in acoustic model training
- L. Nguyen, and B. Xiang Light supervision in acoustic model training Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04), vol. 1 2004 185 188
- (2004) Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'04), Vol. 1 , pp. 185-188
- Nguyen, L.¹ Xiang, B.²

34
- 33745195228
- Document driven machine translation enhanced ASR
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- M. Paulik, C. Fügen, S. Stüker, T. Schultz, T. Schaaf, and A. Waibel Document driven machine translation enhanced ASR Proc. Interspeech'05 2005 2261 2264 (Pubitemid 43908546)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 2261-2264
- Paulik, M.¹ Fugen, C.² Stuker, S.³ Schultz, T.⁴ Schaaf, T.⁵ Waibel, A.⁶

35
- 84867216798
- Lightly supervised acoustic model training on epps recordings
- M. Paulik, and A. Waibel Lightly supervised acoustic model training on epps recordings Proc. Interspeech'08 2008 224 227
- (2008) Proc. Interspeech'08 , pp. 224-227
- Paulik, M.¹ Waibel, A.²

36
- 34547522348
- Reconstructing medical dictations from automatically recognized and non-literal transcripts with phonetic similarity matching
- S. Petrik, and G. Kubin Reconstructing medical dictations from automatically recognized and non-literal transcripts with phonetic similarity matching Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'07), vol. 4 2007 1125 1128
- (2007) Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'07), Vol. 4 , pp. 1125-1128
- Petrik, S.¹ Kubin, G.²

37
- 51449103680
- Automatic phonetics-driven reconstruction of medical dictations on multiple levels of segmentation
- S. Petrik, and F. Pernkopf Automatic phonetics-driven reconstruction of medical dictations on multiple levels of segmentation Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'08) 2008 4317 4320
- (2008) Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'08) , pp. 4317-4320
- Petrik, S.¹ Pernkopf, F.²

38
- 0008746009
- The 1996 hub-4 sphinx-3 system
- P. Placeway, S. Chen, M. Eskenazi, U. Jain, V. Parikh, B. Raj, M. Ravishankar, R. Rosenfeld, K. Seymore, M. Siegler, R. Stern, and E. Thayer The 1996 hub-4 sphinx-3 system Proc. of the 1997 ARPA Speech Recognition Workshop 1997 85 89
- (1997) Proc. of the 1997 ARPA Speech Recognition Workshop , pp. 85-89
- Placeway, P.¹ Chen, S.² Eskenazi, M.³ Jain, U.⁴ Parikh, V.⁵ Raj, B.⁶ Ravishankar, M.⁷ Rosenfeld, R.⁸ Seymore, K.⁹ Siegler, M.¹⁰ Stern, R.¹¹ Thayer, E.¹²

39
- 0030352958
- Cheating with imperfect transcripts
- P. Placeway, and J. Lafferty Cheating with imperfect transcripts Proc. International Conference on Spoken Language (ICSLP'96), vol. 4 1996 2115 2118
- (1996) Proc. International Conference on Spoken Language (ICSLP'96), Vol. 4 , pp. 2115-2118
- Placeway, P.¹ Lafferty, J.²

40
- 67649537413
- On-the-fly term spotting by phonetic filtering and request-driven decoding
- M. Rouvier, G. Linarés, and B. Lecouteux On-the-fly term spotting by phonetic filtering and request-driven decoding Proc. IEEE Spoken Language Technology Workshop (SLT'08) 2008 305 308
- (2008) Proc. IEEE Spoken Language Technology Workshop (SLT'08) , pp. 305-308
- Rouvier, M.¹ Linarés, G.² Lecouteux, B.³

41
- 0003882234
- Addison-Wesley Longman Publishing Company
- G. Salton Automatic Text Processing 1988 Addison-Wesley Longman Publishing Company
- (1988) Automatic Text Processing
- Salton, G.¹

42
- 45549117987
- Term-weighting approaches in automatic text retrieval
- G. Salton, and C. Buckley Term-weighting approaches in automatic text retrieval Information Processing & Management 24 1988 513 523
- (1988) Information Processing & Management , vol.24 , pp. 513-523
- Salton, G.¹ Buckley, C.²

43
- 0019887799
- Identification of common molecular subsequences
- T.F. Smith, and M.S. Waterman Identification of common molecular subsequences Molecular Biology 147 1981 195 197
- (1981) Molecular Biology , vol.147 , pp. 195-197
- Smith, T.F.¹ Waterman, M.S.²

44
- 0006273615
- Specifications of the 1996 hub-4 broadcast news evaluation
- R. Stern Specifications of the 1996 hub-4 broadcast news evaluation Proc. of the DARPA Speech Recognition Workshop 1997
- (1997) Proc. of the DARPA Speech Recognition Workshop
- Stern, R.¹

45
- 85124698057
- The architecture of the festival speech synthesis system
- P. Taylor, A. Black, and R. Caley The architecture of the festival speech synthesis system Proc. of the third ESCA Workshop in Speech Synthesis 1998 147 151
- (1998) Proc. of the Third ESCA Workshop in Speech Synthesis , pp. 147-151
- Taylor, P.¹ Black, A.² Caley, R.³

46
- 80055032960
- Aidar: Une architecture pour l'indexation de documents audio numériques
- B. Tshibasu-Kabeya, G. Bontempi, F. Beaugendre, and G. Marechal Aidar: Une architecture pour l'indexation de documents audio numériques Proc. Veille Stratégique Scientifique & Technologique (VSST'06) 2006
- (2006) Proc. Veille Stratégique Scientifique & Technologique (VSST'06)
- Tshibasu-Kabeya, B.¹ Bontempi, G.² Beaugendre, F.³ Marechal, G.⁴

47
- 0015960104
- The string-to-string correction problem
- R. Wagner, and M. Fisher The string-to-string correction problem The Journal of the ACM 1 1974 168 173
- (1974) The Journal of the ACM , vol.1 , pp. 168-173
- Wagner, R.¹ Fisher, M.²

48
- 11144239919
- Unsupervised training of acoustic models for large vocabulary continuous speech recognition
- DOI 10.1109/TSA.2004.838537
- F. Wessel, and H. Ney Unsupervised training of acoustic models for large vocabulary continuous speech recognition IEEE Transactions on Speech and Audio Processing 13 2005 23 31 (Pubitemid 40049937)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.1 , pp. 23-31
- Wessel, F.¹ Ney, H.²

49
- 0030676041
- Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents
- M.J. Witbrock, and A.G. Hauptmann Using words and phonetic strings for efficient information retrieval from imperfectly transcribed spoken documents Proc. of the second ACM international conference on Digital libraries (DL'97) 1997 30 35
- (1997) Proc. of the Second ACM International Conference on Digital Libraries (DL'97) , pp. 30-35
- Witbrock, M.J.¹ Hauptmann, A.G.²

50
- 0343950213
- Improving acoustic models by watching television
- Carnegie Mellon University
- Witbrock, M.J., Hauptmann, A.G., 1998. Improving acoustic models by watching television. Tech. Rep., CMU-CS-98-110, Carnegie Mellon University.
- (1998) Tech. Rep., CMU-CS-98-110
- Witbrock, M.J.¹ Hauptmann, A.G.²

51
- 0036461035
- Large scale discriminative training of HMM for speech recognition
- P. Woodland, and D. Povey Large scale discriminative training of HMM for speech recognition Computer Speech and Language 16 2002 25 47
- (2002) Computer Speech and Language , vol.16 , pp. 25-47
- Woodland, P.¹ Povey, D.²

52
- 79951779719
- Unsupervised training and directed manual transcription for lvcsr
- K. Yu, M. Gales, L. Wang, and P.C. Woodland Unsupervised training and directed manual transcription for lvcsr Speech Communication 52 2010 652 663
- (2010) Speech Communication , vol.52 , pp. 652-663
- Yu, K.¹ Gales, M.² Wang, L.³ Woodland, P.C.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.