SCOPUS 정보 검색 플랫폼

Proceedings of the International Multiconference on Computer Science and Information Technology, IMCSIT 2010

Volumn 5, Issue , 2010, Pages 567-574

APyCA: Towards the Automatic Subtitling of Television C ontent in Spanish

(3) Alvarez, Aitor a Del Pozo, Arantza a Amiti, Aiidom b

a VICOMTECH (Spain)

b UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

SOFTWARE PROTOTYPING;

AUTOMATIC SPEECH RECOGNITION; AUTOMATIC SUBTITLING; LANGUAGE TECHNOLOGY; SPEAKER DIARIZATION; SPEECH RECOGNITION MODULES; STATE OF THE ART; TELEVISION CONTENT; VOICE ACTIVITY DETECTION;

SPEECH RECOGNITION;

EID: 79551563690 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/imcsit.2010.5680055 Document Type: Conference Paper

Times cited : (10)

References (32)

1
- 79551572695
- Dublin City University
- M. Flanagan, "Human Evaluation of Example-Based MT of subtitles for DVD, " Dublin City University, 2009.
- (2009) Human Evaluation of Example-based MT of Subtitles for DVD
- Flanagan, M.¹

2
- 79551520226
- 3.5.
- M. Carroll, "Subtitling: Changing standards for new media? LISA Newsletter Global Insider, XIII, 3.5.2004. http://www.lisa.org/ globalizationinsider/2004/09/subtitling-chan.htm, ".
- (2004) Subtitling: Changing Standards for New Media? , vol.12
- Carroll, M.¹

3
- 0042597840
- Ottawa: University of Ottawa Press
- L. Bowker, Computer-aided Translation Technology: A Practical Introduction, Ottawa: University of Ottawa Press, 2002.
- (2002) Computer-aided Translation Technology: A Practical Introduction
- Bowker, L.¹

4
- 77951493947
- Robust entropy-based endpoint detection for speech recognition in noisy environments
- paper 0232
- J.L. Shen, J.W. Hung, and L.S. Lee, "Robust Entropy-Based Endpoint Detection for Speech Recognition in Noisy Environments", Proc. Int. Conf. Spoken Language Process., paper 0232, 1998.
- (1998) Proc. Int. Conf. Spoken Language Process.
- Shen, J.L.¹ Hung, J.W.² Lee, L.S.³

5
- 0032301331
- A voice activity detection algorithm for communication systems with dynamically varying background acoustic noises
- I.D. Lee, H.P. Stern, S.A. Mahmoud, "A Voice Activity Detection Algorithm for Communication Systems with Dynamically Varying Background Acoustic Noises, " Proc. Veh. Technol. Conf., 1998.
- (1998) Proc. Veh. Technol. Conf.
- Lee, I.D.¹ Stern, H.P.² Mahmoud, S.A.³

6
- 0032762471
- A statistical model-based voice activity detection
- J. Sohn, N.S. Kim, and W. Sung, "A Statistical Model-Based Voice Activity Detection", IEEE Signal Process. Lett., vol. 6, no. 1, pp. 1-3, 1999.
- (1999) IEEE Signal Process. Lett. , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

7
- 33846259282
- Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
- A. Davis, S. Nordholm, R. Togneri, "Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold", IEEE Trans, on Signal Proc., vol 14, no 2, pp. 412-424, 2006.
- (2006) IEEE Trans, on Signal Proc. , vol.14 , Issue.2 , pp. 412-424
- Davis, A.¹ Nordholm, S.² Togneri, R.³

8
- 0002531237
- Design and preparation of the 1996 hub-4 broadcast news benchmark test corpora
- J.S. Garofolo, J.G. Fiscus, W.M. Fisher, "Design and preparation of the 1996 hub-4 broadcast news benchmark test corpora, " in Proceedings of the DARPA Speech Recognition Workshop., pp. 15-21, 1997.
- (1997) Proceedings of the DARPA Speech Recognition Workshop , pp. 15-21
- Garofolo, J.S.¹ Fiscus, J.G.² Fisher, W.M.³

9
- 84973386174
- Corpus description of the ESTER evaluation campaign for the rich transcription of french broadcast news
- S. Galliano, E. Geoffrois, G. Gravier, J.F. Bonastre, D. Mostefa, K. Choukri. "Corpus description of the ESTER Evaluation Campaign for the Rich Transcription of French Broadcast News". In Proceedings of the 5th International Conference on Language Resources and Evaluation 2006.
- (2006) Proceedings of the 5th International Conference on Language Resources and Evaluation
- Galliano, S.¹ Geoffrois, E.² Gravier, G.³ Bonastre, J.F.⁴ Mostefa, D.⁵ Choukri, K.⁶

10
- 33745196882
- AUDIMUS.MEDIA: A broadcast news speech recognition system for the European Portuguese language
- Portugal
- H. Meinedo, D. Caseiro, J. Neto, I. Trancoso. "AUDIMUS.MEDIA: a broadcast news speech recognition system for the European Portuguese language". In Proceedings of PROPOR 2003, Portugal, 2003.
- (2003) Proceedings of PROPOR 2003
- Meinedo, H.¹ Caseiro, D.² Neto, J.³ Trancoso, I.⁴

11
- 77949397809
- DiSCo -A speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain
- D. Baum, B. Samlowski, T. Winkler, R. Bardeli, Schneider: "DiSCo -a speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain". Proceedings of the GSCL Symposium'Sprachtechnologie und eHumanities' 2009.
- (2009) Proceedings of the GSCL Symposium'Sprachtechnologie und EHumanities'
- Baum, D.¹ Samlowski, B.² Winkler, T.³ Bardeli, R.⁴ Schneider⁵

12
- 56149126159
- The RWTH 2007 TC-STAR evaluation system for European english and Spanish
- J. Loof, Ch. Gollan, S. Hahn, G. Heigold, B. Hoffmeister, Ch. Plahl, D. Rybach R. Schlüter and H. Ney. "The RWTH 2007 TC-STAR Evaluation System for European English and Spanish". Interspech 2007.
- Interspech 2007
- Loof, J.¹ Gollan, Ch.² Hahn, S.³ Heigold, G.⁴ Hoffmeister, B.⁵ Plahl, Ch.⁶ Rybach, D.⁷ Schlüter, R.⁸ Ney, H.⁹

13
- 84867198850
- Towards automatic learning in LVCSR: Rapid development of a Persian broadcast transcription system
- C. Gollan, H. Ney, "Towards automatic learning in LVCSR: Rapid development of a Persian broadcast transcription system, " Interspeech' 08.
- Interspeech' 08
- Gollan, C.¹ Ney, H.²

14
- 77949404775
- Comparing automatic rich transcription for Portuguese, Spanish and English broadcast news
- F. Batista, I. Trancoso, N. J. Mamede. "Comparing Automatic Rich Transcription for Portuguese, Spanish and English Broadcast News". In Automatic Speech Recognition and Understanding Workshop, 2009.
- (2009) Automatic Speech Recognition and Understanding Workshop
- Batista, F.¹ Trancoso, I.² Mamede, N.J.³

15
- 0012577933
- The Limsi SDR systemfor TREC-9
- Gaithersburg, Md, USA
- J.-L. Gauvain, L. Lamel, C. Barras, G. Adda, and Y. de Kercadio, "The Limsi SDR systemfor TREC-9, " in Proc. 9th Text Retrieval Conference, TREC-9, pp. 335-341, Gaithersburg, Md, USA, 2000.
- (2000) Proc. 9th Text Retrieval Conference, TREC-9 , pp. 335-341
- Gauvain, J.-L.¹ Lamel, L.² Barras, C.³ Adda, G.⁴ De Kercadio, Y.⁵

16
- 85009062679
- The ICSI-SRI-UW metadata extraction system
- Korea
- Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin, and M. Harper. "The ICSI-SRI-UW Metadata Extraction System". ICSLP 2004, International Conf. on Spoken Language Processing, Korea. 2004.
- (2004) ICSLP 2004, International Conf. on Spoken Language Processing
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Hillard, D.⁴ Ostendorf, M.⁵ Peskin, B.⁶ Harper, M.⁷

17
- 79551563199
- Darwin College, University of Cambridge and Cambridge University Engineering Department
- J.H. Yim. "Named Entity Recognition from Speech and its Use in the Generation of Enhanced Speech Recognition Output". Darwin College, University of Cambridge and Cambridge University Engineering Department. 2001.
- (2001) Named Entity Recognition from Speech and its Use in the Generation of Enhanced Speech Recognition Output
- Yim, J.H.¹

18
- 0346921386
- Punctuation annotation using statistical prosody models
- H. Christensen, Y. Gotoh, and S. Renais, "Punctuation annotation using statistical prosody models, " in Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding, pp. 35-40, 2001.
- (2001) Proc. of the ISCA Workshop on Prosody in Speech Recognition and Understanding , pp. 35-40
- Christensen, H.¹ Gotoh, Y.² Renais, S.³

19
- 84919457977
- The use of prosody in a combined system for punctuation generation and speech recognition
- J. Kim, P. C. Woodland, "The use of prosody in a combined system for punctuation generation and speech recognition, " Proc. Eurospeech' 01.
- Proc. Eurospeech' 01
- Kim, J.¹ Woodland, P.C.²

20
- 79551527933
- Sentence boundary detection in broadcast speech transcripts
- Y. Gotoh and S. Renais, "Sentence boundary detection in broadcast speech transcripts, " in Proc. of the ISCA Workshop: ASR-2000.
- Proc. of the ISCA Workshop: ASR-2000
- Gotoh, Y.¹ Renais, S.²

21
- 0034275920
- Prosody based automatic segmentation of speech into sentences and topics
- E. Shriberg, A. Stolcke, D. Hakkani-Tür, and G. Tür, "Prosody based automatic segmentation of speech into sentences and topics, " Speech Communications, vol. 32, no. 1-2, pp. 127-154, 2000.
- (2000) Speech Communications , vol.32 , Issue.1-2 , pp. 127-154
- Shriberg, E.¹ Stolcke, A.² Hakkani-Tür, D.³ Tür, G.⁴

22
- 70349218123
- Speaker diarization in meeting audio
- Taipei, April 19-24
- T. L. Nwe, H. Sun, H. Li, S. Rahardja, "Speaker Diarization in Meeting Audio", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, April 19-24, 2009.
- (2009) IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009)
- Nwe, T.L.¹ Sun, H.² Li, H.³ Rahardja, S.⁴

23
- 84956534244
- The IBM RT07 evaluation systems for speaker diarization on lecture meetings
- Springer
- J. Huang, E. Marcheret, K. Visewswariah, G. Potamianos, "The IBM RT07 Evaluation Systems for Speaker Diarization on Lecture Meetings", in Multimodal Technologies for Perception of Humans, Springer, 2008.
- (2008) Multimodal Technologies for Perception of Humans
- Huang, J.¹ Marcheret, E.² Visewswariah, K.³ Potamianos, G.⁴

24
- 51449087867
- The ICSI RT07s speaker diarization system
- C.Wooters, M. Huijbregts. "The ICSI RT07s Speaker Diarization System". In Rich Transcription 2007 Meeting Recognition Workshop.
- Rich Transcription 2007 Meeting Recognition Workshop
- Wooters, C.¹ Huijbregts, M.²

25
- 78650898482
- LIUM SpkDiarization: An open source toolkit for diarization
- Dallas
- S. Meignier, T. Merlin. "LIUM SpkDiarization: An Open Source Toolkit For Diarization". CMU Sphinx Workshop 2010, Dallas, 2010.
- (2010) CMU Sphinx Workshop 2010
- Meignier, S.¹ Merlin, T.²

26
- 0141589463
- Hidden Markov Model Toolkit (HTK) 3.2, Cambridge University Engineering Department, http://htk.eng.cam.ac.uk/, 2002.
- (2002) Hidden Markov Model Toolkit (HTK) 3.2

27
- 0001848274
- Development of Spanish Corpora for Speech Research (Albayzin)
- Italy, 199.1
- F. Casacuberta, R. Garcia, J. Llisterri, C. Nadeu, J.M. Pardo, A. Rubio: "Development of Spanish Corpora for Speech Research (Albayzin)". Workshop on International Cooperation and Standarization of Speech Databases and Speech I/O Assesment Methods, Italy, 199.1.
- Workshop on International Cooperation and Standarization of Speech Databases and Speech I/O Assesment Methods
- Casacuberta, F.¹ Garcia, R.² Llisterri, J.³ Nadeu, C.⁴ Pardo, J.M.⁵ Rubio, A.⁶

28
- 76749092270
- The WEKA data mining software: An update
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I. H. Witten. "The WEKA Data Mining Software: An Update"; SIGKDD Explorations, Volume 11, Issue 1. 2009.
- (2009) SIGKDD Explorations , vol.11 , Issue.1
- Hall, M.¹ Frank, E.² Holmes, G.³ Pfahringer, B.⁴ Reutemann, P.⁵ Witten, I.H.⁶

29
- 79551543112
- Multext-prosody
- (Ed.), CD-ROM Distributed by ELRA/ELDA
- E. Campione, (Ed.) Multext-Prosody. A multilingual prosodie database. CD-ROM Distributed by ELRA/ELDA. 1999.
- (1999) A Multilingual Prosodie Database
- Campione, E.¹

30
- 79551569128
- Spoken Language Processing Lab, Purdue University
- Z. Huang, L. Chen, M. Harper. "Purdue Prosodie Feature Extraction Toolkit on Praat". Spoken Language Processing Lab, Purdue University. 2006.
- (2006) Purdue Prosodie Feature Extraction Toolkit on Praat
- Huang, Z.¹ Chen, L.² Harper, M.³

31
- 84910065251
- Sphinx-4. "A speech recognizer written entirely in the Java programming language", http://cmusphinx.sourceforge.net/sphinx4/.
- A Speech Recognizer Written Entirely in the Java Programming Language

32
- 84971360473
- FFmpeg. "A complete, cross-platform solution to record, convert and stream audio and video", http://www.ffmpeg.org/inbroad.
- A Complete, Cross-platform Solution to Record, Convert and Stream Audio and Video

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.